Upstage

AI Research Engineer - LLM Inference Optimization

Seoul, South KoreaPosted 11 days ago

Apply on Upstage's career page

What you'd do

Design and implement systems optimizing latency, throughput, cost trade-offs for LLM inference.
Develop model lightweight pipelines minimizing accuracy loss while maximizing hardware acceleration.
Research and apply production inference techniques like Speculative Decoding and Expert Parallelism.

What they want

3+ years model inference optimization research or development experience.
Deep understanding of latest LLM architectures and inference optimization techniques.
Experience using vLLM, SGLang, TensorRT-LLM, or Text Generation Inference engines.

Nice to have

Published papers at international ML/NLP conferences as first or corresponding author.
Contribution experience to vLLM, SGLang, TensorRT-LLM, or Transformers frameworks.
Optimization experience with MoE, long-context, or multimodal model serving.

Seoulstart's read

Visa tier (likely): E-7 specialty / professional (likely)
Korean language: Business-level Korean required
English friendliness: High: JD in English, foreign-friendly signals
Context: Upstage is a well-funded Korean AI startup with strong research focus; expect hybrid work flexibility and strong English collaboration despite business-level Korean requirement.

This is Seoulstart's analysis of the public job description, not the employer's stated policy. Verify visa sponsorship, language requirements, and remote allowances directly with the recruiter.

View the full posting and apply on Upstage's career page

Korea jobs hiring English speakers

AI Research Engineer - LLM Inference Optimization

What you'd do

What they want

Nice to have

Seoulstart's read

Related Seoulstart guides