Back to Jobs

[Remote] Research Intern (LLM)

Remote, USA Full-time Posted 2025-11-24

Note: The job is a remote job and is open to candidates in USA. 2077AI Open Source Foundation is looking for a Research & Evaluation Intern to help build advanced QA datasets and evaluate large language models. This role is ideal for students passionate about LLMs, evaluation science, and the intersection of research and applied data work.


Responsibilities

  • Design and construct high-quality, sufficiently challenging QA datasets (graduate/PhD level) inspired by GPQA, HLE, and AI4Sci families, collaborating with a global network of talented researchers
  • Evaluate large language models on reasoning, factuality, and problem-solving benchmarks
  • Develop review pipelines and quality-control criteria for expert-level question generation
  • Analyze model outputs, conduct error taxonomy studies, and summarize insights for internal reports and research papers
  • Collaborate with the 2077AI Foundation’s open-source benchmark teams on public dataset releases

Skills

  • Strong background in computer science, data engineering, artificial intelligence, or related fields, with hands-on experience in large-scale data systems
  • 1+ years of experience with LLMs, prompt engineering, and evaluation frameworks (e.g., LM Eval Harness, OpenCompass)
  • Excellent written and verbal English skills and analytical reasoning
  • Strong execution and team management skills—able to translate high-level objectives into actionable plans and drive team outcomes
  • Experience with formal methods, chain-of-thought evaluation, or curriculum generation
  • Relevant publications in top conferences

Company Overview

  • The 2077AI Foundation, is at the forefront of AI data standardization and progression. It was founded in undefined, and is headquartered in Singapore, SG, with a workforce of 51-200 employees. Its website is https://www.2077ai.com/.

  •   Apply To This Job

    Similar Jobs

    Experienced Part-Time Data Entry Specialist – Remote Work Opportunity with arenaflex for Organized and Detail-Oriented Individuals

    Remote, USA Full-time

    Experienced Remote Customer Service Agent – Delivering Exceptional Travel Experiences and World-Class Support to Passengers at arenaflex

    Remote, USA Full-time

    Remote Care Manager - RN 3 Locations

    Remote, USA Full-time

    Business Development Director, Commercial Enter...

    Remote, USA Full-time

    Data Entry Remote Jobs-JetBlue Airline At Home ...

    Remote, USA Full-time

    Senior Data Scientist - Revenue Intelligence

    Remote, USA Full-time

    Dog Walker / Dog Sitter

    Remote, USA Full-time

    Human Resources Global Services Specialist

    Remote, USA Full-time

    Strategic Accounts Executive - Ambulatory Surge...

    Remote, USA Full-time

    Manager, Customer Strategy - Marketing (Onsite)

    Remote, USA Full-time

    Amazon Customer Service Center (Work At Home) Up to $25/hr

    Remote, USA Full-time

    Informationist II

    Remote, USA Full-time

    Disney Data Entry Part Time Remote Jobs – Work From Home Job

    Remote, USA Full-time

    Text Collector (Project Nimbus)

    Remote, USA Full-time

    Wholesale Director

    Remote, USA Full-time

    [Work From Home] Immediately Need KINDERGARTEN TEACHER in

    Remote, USA Full-time

    Insurance Verification Specialist (Bilingual Spanish)

    Remote, USA Full-time

    Remote Typing Jobs for Beginners Work from Anywhere

    Remote, USA Full-time

    Healthcare Recruiter Allied Health - Remote

    Remote, USA Full-time

    Dell Part Time Data Entry @remote $24/hour At Careermilard

    Remote, USA Full-time