Automation & Benchmarking Engineer

Key Responsibilities
- Design and develop end-to-end automation pipelines for evaluation workflows, including prompt submission, response collection, result aggregation, and reporting.
- Integrate evaluation tooling with developer surfaces such as Gemini CLI, VS Code, and GitHub.
- Conduct competitive benchmarking against peer AI tools to measure correctness, verbosity, and usefulness.
- Build dashboards and visualization reports using Looker Studio, BigQuery, or Python-based tools.
- Optimize system performance, automate error logging, and maintain reproducibility across evaluations.
- Collaborate with TPM and data specialists to deliver evaluation automation at scale.
- Ensure source code management and deployment compliance in GitLab and Bitbucket environments.

Job Details

Experience : 4 To 8

Number Of Vacancies : 10

Job Type : Permanent

Industry Type : IT/Software

Salary : 7 Lac - 15 Lac P.A

Education Summary

UG : Any UG Degree PG : Any PG Degree

Contact Details

Contact Person : NA

Contact Number : 4439277700

e-mailId : kirans@virtusapolaris.com

Address : Virtusa Consulting Services Pvt Ltd,VirtusaPolarisFoundationBuildingNo 34 , IT High way, Rajiv Gandhi Salai,Navallur , Chennai .

Back

Similar Jobs

View all Jobs in this Company

Office Location

Central Jakarta No 1234, Jakarta, Indonesia

Similar Jobs

Qlik Sense Architect

Birlasoft

Pune

Experience - 10 to 15

Key Skills - Cloud Platforms, Reporting, Java, Qlik Sense, Hadoop, BI Tools, Power BI, Data Warehousing, Data Architecture, Big Data Technologies, Tableau, Data Modeling, SQL, Spark, Data Visualization, Azure, Python, AWS, ETL.,

Gen AI Lead

Birlasoft

Pune

Experience - 0 to 1

Key Skills - AWS Bedrock, AWS Bedrock Agent Core SDK, AWS Strands SDK, AWS X-Ray, Application Load Balancer, OpenSearch, Amazon Aurora, S3, DynamoDB, EC2, ECS, EKS, AWS Lambda, API Gateway, Amazon CloudWatch, AWS CodePipeline, AWS CodeBuild, AWS CodeDeploy, GitHub Actions, GitLab CI, AWS CloudFormation, CDK, Terraform, Parameter Store, AWS Secrets Manager, Python.,

Guidewire PolicyCenter Developer

HTC Global Services (India) Pvt Ltd

Chennai

Experience - 0 to 1

Key Skills - SOAP/REST, GOSU Web Services, PCF, Data Model, Jenkins/GitHub,

Software Development Engineer

Amazon Development Centre (India) Pvt Ltd

Bangalore

Experience - 2 to 3

Key Skills - Java / Python / C++, Data Structures & Algorithms, System Design, Distributed Systems, Microservices Architecture, Scalable & Fault-Tolerant Systems, RESTful APIs, SQL & NoSQL Databases, Cloud Computing (AWS), Agile Development, Unit & Integration Testing, High-Performance Backend Development,

View All...