Android AI/ML Engineer - On-Device
New Yesterday
Location:
Mountain View, CA (On-site preferred, at least 4 days/week)
Position Summary:
We are looking for a highly capable engineer to design and deploy real-time, on-device machine learning solutions for mobile devices. This role focuses on developing privacy-first, resource-optimized ML systems that operate directly on Android hardware, supporting high-impact AI applications in real-world environments.
The ideal candidate brings deep technical expertise in on-device intelligence and mobile ML pipelines, and thrives in fast-paced environments that demand performance, security, and adaptability.
Key Responsibilities:
Design and deploy efficient on-device ML models tailored for Android platforms.
Build end-to-end ML pipelines using: TensorFlow Lite
ML Kit (including GenAI APIs)
MediaPipe
PyTorch Mobile
Optimize models using techniques like quantization, pruning, and distillation to meet mobile performance targets.
Develop context-aware, real-time inference systems and data pipelines on-device.
Implement privacy-first architecture, ensuring all data processing and inference is local.
Collaborate with backend/cloud teams to integrate model orchestration systems (e.g., MCP, Vertex AI, SageMaker) for: Model delivery and remote updates
Telemetry and performance monitoring
A/B testing and rollout strategies
Implement secure storage and encrypted data handling in line with privacy and compliance standards.
Support adaptive model behavior using on-device personalization, federated learning, or similar privacy-preserving techniques.
Technical Requirements:
Must-Have Skills: Strong Android development experience using Kotlin and/or Java
Proficiency with on-device ML tools: TensorFlow Lite
ML Kit
MediaPipe
PyTorch Mobile
Solid understanding of mobile constraints: Real-time inference
Low-latency processing
Model size and resource optimization
Experience in integrating mobile apps with backend/cloud systems for: Model lifecycle management
Secure telemetry and data analytics
Knowledge of Android security best practices, including sandboxing, permissions, encryption, and local data protection
Nice-to-Have Skills: Experience with federated learning, differential privacy, or on-device personalization
Familiarity with cloud infrastructure (e.g., AWS, GCP) and ML deployment workflows
Background in mobile AI features like anomaly detection, behavioral modeling, or privacy-focused applications
Experience with model orchestration platforms such as MCP, Vertex AI, or SageMaker
Education & Experience: Master's degree with 5-7 years of relevant experience, or
PhD with 3 years of relevant experience
Preferred: Less than 10 years of total professional experience
Work Schedule: On-site presence preferred - at least 4 days/week in Mountain View, CA
Standard business hours, minimal overtime except during key sprints
Interview Process: 1 Technical Phone Screen
2 Virtual Technical Interviews
Position Type & Growth Potential: Contract role with high potential for extension
Strong possibility of full-time conversion based on performance and business needs
Core Technical Keywords (for resume alignment): TensorFlow Lite (TFLite)
ML Kit
MediaPipe
PyTorch Mobile
On-device machine learning
Mobile ML pipeline
Edge AI
Model quantization / pruning / distillation
Real-time inference
Federated learning
Differential privacy
Telemetry integration
Secure Android development
Model orchestration
Cloud-integrated ML (e.g., Vertex AI, SageMaker, MCP)
- Location:
- Mountain View, CA, United States
- Job Type:
- FullTime
- Category:
- Computer And Mathematical Occupations