System Development Engineer
KeySkills
Company Name
Job Description
Gathered and translated requirements from applied scientists into scalable data and ML tooling solutions.
-
Extracted and processed data from multiple in-house systems and open-source domains.
-
Designed and developed Python scripts and UI tools to build and manage end-to-end data pipelines for model training and testing.
-
Curated training and testing datasets, including data annotation, validation, and cleaning to remove noise and inconsistencies.
-
Implemented data and model versioning to ensure reproducibility of experiments.
-
Built and maintained AWS SageMaker batch transform scripts for large-scale model inference.
-
Automated monitoring and orchestration of batch transform and AWS Batch jobs, including chaining dependent workflows.
-
Maintained and fixed internal tools, data pipelines, and Amazon QuickSight dashboards.
-
Prepared science code for production by creating Docker images and deployment-ready packages for engineering handoff.
-
Worked extensively with structured and semi-structured data using Python libraries such as NumPy and Pandas, handling JSON, CSV, and Parquet formats.