WoRV Team is Hiring!
Job Introduction - Machine Learning Research Scientist
“Responsible for the pre‑training and post‑training of the Foundation Embodied Agent.”
The WoRV (World model for Robotics and Vehicle control) team is tackling robotics and autonomous‑driving challenges by building agents that, like humans, integrate diverse, high‑dimensional information to perceive, decide, and act.
Currently, the ML Research team is driving research and development around two core missions:
- Building Training Recipes for the Foundation Embodied Agent
- In the nascent field of Embodied AI, research and develop new training recipes to create Foundation Embodied Agents that respond robustly to previously unseen environments, rules, robots, or instructions.
- Aim to deliver a general‑purpose driving agent capable of cooperation, reasoning, and adaptation across varied scenarios.
- Solving Memory & Efficiency Challenges of the Foundation Embodied Agent
- Most Robotics Foundation Models (RFMs) decide actions solely on the current observation—without any memory—which drastically limits the range of tasks they can perform. Humans act based on memory, so this gap also breaks the assumptions of imitation learning.
- The way Multimodal LLMs (MLLMs) process visual information is often inefficient. We’re researching solutions to these challenges to build a more efficient Embodied Agent.
Research Support
- Ultra‑High‑Performance GPU Cluster: We maintain CORE (Compute‑Oriented Research Environment) CORE Introduction
- On‑premise DGX H100: 12 units (H100×96), over 30 A100 GPUs, and more than 10 V100 GPUs in operation.
- High‑Quality, Fully Human‑Driven Driving Data Pipeline for both simulation and real‑world environments
- Unlimited access to custom datasets and annotations tailored to the research team's needs.
- Extensive Map‑Based Driving Dataset
- Over 200 hours of driving data available, with unlimited access for the research team.
- Comprehensive Conference & Publication Support
- Assistance with attending and submitting to NeurIPS, ICLR, CVPR, ECCV, Interspeech, ACL, etc.
- Full coverage of conference participation expenses upon paper acceptance in international journals.
For more details:
Key Responsibilities
- Broad understanding of AI/ML and deep expertise in specific domains
- Excellent foundational CS knowledge and engineering skills
- Strong grasp of core computer science principles (operating systems, data structures, algorithms, databases, networks, etc.)