Software Engineer for AI Model Evaluation (Scala/Kotlin/OCaml)

Company:  Confidential
Location: remote
Closing Date: 06/07/2026
Salary: £120 - £200 Per Annum
Hours: Full Time
Type: Permanent

Job Description

A leading AI research organization is evaluating how advanced AI systems perform in specialized engineering domains, seeking expert software engineers with deep, hands-on experience in Scala, Kotlin, and/or OCaml. In this role, you will apply your expertise to assess complex, real-world technical scenarios, directly shaping how cutting-edge AI performs in these ecosystems. Single-language specialists are strongly encouraged to apply.

Key Responsibilities
  • Apply your language expertise to evaluate technical tasks against real-world professional standards.
  • Review intricate code-level situations and provide precise, structured written assessments.
  • Work inside containerized repositories (Docker), running and interpreting programmatic and CI-style checks to judge whether an engineering environment is sound.
  • Provide clear written rationales explaining your expert judgments.
  • Complete well-defined, time-bounded tasks with explicit evaluation criteria.
Qualifications

We value strong CS fundamentals, fast ramp-up, and high ownership. Engineers who can flex across languages are especially valued, and deep single-language specialists are very welcome. The ideal candidate brings:

  • 5+ years of professional software engineering in one or more of:
  • Scala , functional/OOP JVM development; distributed systems; Spark, Akka, Cats, or ZIO; data engineering contexts.
  • Kotlin , JVM and Android development; coroutines; modern backend services.
  • OCaml , typed functional programming; compilers, tooling, or formal/systems work.
  • Comfort working in Docker/CI-based repo environments and reading automated checks, or the ability to ramp on these fast.
  • Ability to articulate not just what code does but why it''s correct or idiomatic, clearly in writing.
  • Welcomed: engineers from top product, data, or research engineering backgrounds; non-traditional digital footprints are fine, a GitHub profile is not required.
Why This Work Matters

Scala, Kotlin, and OCaml power distributed data systems, modern mobile and backend platforms, and high-assurance software, domains where AI systems still have limited depth. Your assessments directly influence how they learn to operate in them.

Work Terms
  • Compensation: $120, 200/hour , based on depth and experience.
  • Expected commitment: 20, 40 hrs/week .
  • Task flow is variable, work tends to arrive in waves, and there can be a lag between task batches. We''re looking for people who stay flexible, including availability on weekends when needed, and who take ownership of keeping the project moving forward.
  • No proprietary tooling required, tasks are completable without employer-provided systems.
Apply Now
Share this job
Confidential
  • Similar Jobs

  • Aerospace Engineer for AI Model Evaluation

    remote
    View Job
  • Petroleum Engineer for AI Model Evaluation

    remote
    View Job
  • Software Engineer for AI Model Development

    remote
    View Job
  • Hardware RTL Engineer for AI Evaluation

    remote
    View Job
  • Software Developer for AI Model Training

    remote
    View Job
An unhandled error has occurred. Reload 🗙