Twelve Labs
Senior Software Engineer, ML Data
Seoul, South KoreaPosted 7 days agofulltime
What you'd do
- Build data engines for collecting,preprocessing,filtering,and labeling large-scale multimodal datasets (video,image,audio) for LLM and VLM training
- Design and develop data systems for managing and visualizing petabyte-scale multimedia data
- Build high-impact libraries and services for the ML data pipeline from design through operations
What they want
- 3 or more years of practical experience as a backend or data engineer
- Proficiency in Python
- High interest in and understanding of ML/AI systems and related data processing
Nice to have
- Experience leading engineering teams as a technical lead
- Large-scale data collection,processing,and storage pipeline experience with tools such as Spark
- Experience building model-based language or vision-language datasets