Machine Learning Engineer (AI/ML) – TensorFlow Lite, ML Kit, PyTorch

Posted 1 month ago • 14 applicants • Be one of the first to apply!

Our Client - Information Technology & Services company

Mountain View, CA

$75.00 - $90.00/hour

Exact compensation may vary based on skills, experience, and location.

40 hrs/wk

Contract (w2)

Remote work partially (20%)

Travel not required

Start date

July 7, 2025

End date

January 7, 2026

Superpower

Technology

Capabilities

Mobile Development

Data Science and Machine Learning

Technology Product Management

Technology Architecture

Technical Program/Project Management

Preferred skills

Android Development

Machine Learning

Quantization

Android (Operating System)

Generative Artificial Intelligence

AI/ML Inference

Deep Learning

PyTorch (Machine Learning Library)

TensorFlow

Reinforcement Learning

Machine Learning Model Training

Machine Learning Model Monitoring And Evaluation

Preferred industry experience

Information Technology & Services

Experience level

5 - 8 years of experience

Job description

Our Customer is a Silicon Valley-based company that is engaged in researching emerging technologies.

We are seeking a contract Machine Learning Research Engineer to develop advanced on-device machine learning systems that enable secure, adaptive, and scalable intelligence across mobile devices. The role emphasizes building intelligent, adaptive, and privacy-preserving ML systems that operate efficiently within the constraints of mobile environments. The ideal candidate will have strong experience in designing real-time, context-aware inference systems that can respond dynamically to local data patterns and behaviors. This role is a hybrid setup (4 days onsite and 1 day remote/week) in Mountain View, CA.

Responsibilities:

Design, develop, and deploy on-device machine learning models optimized for Android, ensuring low latency and minimal resource consumption.
Build robust and scalable ML pipelines using Android-native frameworks such as: TensorFlow Lite, ML Kit (including GenAI APIs), MediaPipe, and PyTorch Mobile
Implement local signal aggregation and real-time pattern recognition logic to enable responsive in-app actions driven by on-device inference.
Architect systems that support telemetry, secure logging, and privacy-first feedback collection for monitoring and evaluation.
Apply model compression and optimization techniques (e.g., quantization, pruning, distillation) to meet mobile performance constraints.
Develop secure, privacy-first solutions where all data processing and ML inference occur strictly on-device, with no external data exposure.
Enable mechanisms for continuous local learning and model updates using device-resident data and signals, without compromising privacy.
Ensure integration with Android's security model and collaborate with platform and product teams to deploy AI features safely at scale.

Skills and Qualifications:

5-7 years of experience with a Masters degree, 3+ years of experience with a PhD
Proven experience in Android development (Kotlin/Java), with strong understanding of system architecture, resource management, and performance tuning.
Hands-on expertise with on-device ML frameworks including TensorFlow Lite, ML Kit, MediaPipe, and PyTorch Mobile.
Solid foundation in machine learning and signal processing techniques, such as time-series modeling, clustering, classification, and real-time event detection.
Strong knowledge of mobile data handling and Android security practices, including permissions, sandboxing, and secure data storage.
Understanding of privacy-preserving learning techniques and data governance in mobile environments.
Familiarity with secure data handling on Android, including encrypted storage, permissions, sandboxing, and secure compute enclaves.
Experience with telemetry systems and evaluation pipelines for monitoring model performance on-device at scale.

Preferred:

Experience building ML-driven mobile applications in domains requiring user personalization, privacy, or security.
Understanding of real-time data processing and behavioral modeling on resource-constrained edge devices.
Knowledge of on-device learning techniques, federated learning, or personalization methods.
Prior contributions to systems using federated learning, differential privacy, or local fine-tuning of models is a plus
Experience with backend infrastructure for model management (e.g., model registries, update orchestration, logging frameworks) is a plus.
Prior work with anomaly detection or behavioral modeling in resource-constrained environments is a plus.
Experience developing responsive systems capable of monitoring local context and dynamically triggering actions based on model outputs is a plus
Experience optimizing models for ARM architectures is a plus

We offer a competitive salary range for this position. Most candidates who join our team are hired at the median of this range, ensuring fair and equitable compensation based on experience and qualifications.

Contractor benefits are available through our 3rd Party Employer of Record (Available upon completion of waiting period for eligible engagements)

Benefits include: Medical, Dental, Vision, 401k.

An Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.

All applicants applying for U.S. job openings must be legally authorized to work in the United States and are required to have U.S. residency at the time of application.

If you are a person with a disability needing assistance with the application, or at any point in the hiring process, please contact us at support@themomproject.com.

Apply