ByteDance
AI

Research Scientist - Driven Agent Self-Evolution - Global Frontier Tech Recruitment Program - 2027 Start (PhD)

ByteDance · Seattle, WA, US · $202k - $368k

Actively hiring Posted about 1 month ago

Responsibilities

  • Research and develop agent frameworks that continuously learn and improve from execution traces, user feedback, and environmental signals.
  • Build large-scale log analytics pipelines to extract quality signals, usage patterns, and actionable insights from model and agent invocation logs, driving data-informed system and model improvements.
  • Explore and apply frontier techniques in LLM post-training, reasoning, and planning to enhance agent capabilities.
  • Collaborate across algorithm research, platform engineering, and product teams to turn research ideas into production-grade systems at scale.

Basic qualifications

  • Individuals who are completing or have recently completed a Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related discipline.
  • Strong theoretical and practical foundation in machine learning, deep learning, reinforcement learning, or optimization.
  • Research experience in at least one of the following areas: LLM-based agents, planning and reasoning, multi-agent systems, continual/lifelong learning, or LLM post-training (e.g., RLHF, DPO, GRPO, self-play).
  • Strong programming skills in Python and proficiency with ML frameworks (e.g., PyTorch, TensorFlow, JAX).
  • Publication record at top-tier venues (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, AAAI, AAMAS, COLM).
  • Strong problem-solving skills and ability to thrive in a fast-paced, collaborative environment.

Preferred qualifications

  • Publications in areas directly related to agent learning and adaptation, such as tool use, self-improvement, skill discovery, trajectory optimization, reward modeling, or agent evaluation.
  • Research experience in LLM reasoning and planning, including chain-of-thought, tree/graph search, Monte Carlo methods, or inference-time compute scaling.
  • Experience training or fine-tuning large language models, including supervised fine-tuning, preference optimization, or curriculum learning.
  • Hands-on experience building or evaluating LLM-based agent systems (e.g., ReAct, function calling, code generation agents, or multi-agent orchestration).
  • Familiarity with meta-learning, few-shot generalization, or transfer learning in the context of LLM-based systems.
  • Experience with feedback-driven optimization loops, such as online learning, bandit methods, or evolutionary strategies applied to agent improvement.
  • Strong interest in bridging frontier AI research with production-grade engineering — turning papers into systems that work at scale.
  • Internship experience at technology companies or research organizations.
  • Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
  • Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and
  • Exercising sound judgment.

Tags & focus areas

Used for matching and alerts on DevFound
Internship Machine Learning Generative Ai Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.