Internships
Register
Copyright © 2000—2025 JetBrains s.r.o.

Evaluation of LLM-Based Agentic Systems for Software Development Tasks

Description

Large Language Models (LLMs) have vast applications in the software engineering domain. In the AI Agents and Planning team, we focus on agentic systems, where LLMs can interact with external environments, such as IDEs or web browsers, and autonomously work through complex, open-ended tasks. With LLM-based agents, it becomes possible to consider tasks resembling daily developer work (e.g., SWE-bench). However, building agents also brings technical challenges: with interactive environments instead of static data points, data preparation, training, and evaluation require significant effort.

In this project, we invite you to work on technical tasks that arise in AI agents research and directly contribute to our team's ongoing projects (with potential publications). Your tasks could include:

  • Data filtering and preparation (e.g., gather agent trajectories for further fine-tuning or extend the existing dataset)

  • Work on evaluation infrastructure (e.g., support new benchmarks/new agentic scaffolds or scale existing infrastructure)

  • Fine-tuning models (e.g., SFT or DPO using established infrastructure)

Requirements

  • Strong Python programming skills

  • Ability to work with existing codebases and learn unfamiliar frameworks

  • Experience with Docker and familiarity with data analysis and experiment tracking tools

  • Basic understanding of NLP, LLMs, and AI agents


Would be a plus:

  • Experience with our tech stack:

    • Agentic scaffolds: LangGraph

    • Training: VeRL & LLaMA-Factory

    • Inference: vLLM

    • Orchestration: Kubernetes, ZenML

  • Experience with AI agents

  • Experience with LLMs fine-tuning and evaluation

  • Relevant publications

Admission

Internship projects 2025-2026

Contact details

internship@jetbrains.com

Preferred internship location

Germany
Serbia

Area

Data Science
Machine Learning
Research

Internship timing preferences

Flexible start
Full-time preferable