Job Description
A leading AI research organization is evaluating how advanced AI systems perform in specialized engineering domains, seeking expert software engineers with deep, hands-on experience in Scala, Kotlin, and/or OCaml. In this role, you will apply your expertise to assess complex, real-world technical scenarios, directly shaping how cutting-edge AI performs in these ecosystems. Single-language specialists are strongly encouraged to apply.
Key Responsibilities- Apply your language expertise to evaluate technical tasks against real-world professional standards.
- Review intricate code-level situations and provide precise, structured written assessments.
- Work inside containerized repositories (Docker), running and interpreting programmatic and CI-style checks to judge whether an engineering environment is sound.
- Provide clear written rationales explaining your expert judgments.
- Complete well-defined, time-bounded tasks with explicit evaluation criteria.
We value strong CS fundamentals, fast ramp-up, and high ownership. Engineers who can flex across languages are especially valued, and deep single-language specialists are very welcome. The ideal candidate brings:
- 5+ years of professional software engineering in one or more of:
- Scala , functional/OOP JVM development; distributed systems; Spark, Akka, Cats, or ZIO; data engineering contexts.
- Kotlin , JVM and Android development; coroutines; modern backend services.
- OCaml , typed functional programming; compilers, tooling, or formal/systems work.
- Comfort working in Docker/CI-based repo environments and reading automated checks, or the ability to ramp on these fast.
- Ability to articulate not just what code does but why it''s correct or idiomatic, clearly in writing.
- Welcomed: engineers from top product, data, or research engineering backgrounds; non-traditional digital footprints are fine, a GitHub profile is not required.
Scala, Kotlin, and OCaml power distributed data systems, modern mobile and backend platforms, and high-assurance software, domains where AI systems still have limited depth. Your assessments directly influence how they learn to operate in them.
Work Terms- Compensation: $120, 200/hour , based on depth and experience.
- Expected commitment: 20, 40 hrs/week .
- Task flow is variable, work tends to arrive in waves, and there can be a lag between task batches. We''re looking for people who stay flexible, including availability on weekends when needed, and who take ownership of keeping the project moving forward.
- No proprietary tooling required, tasks are completable without employer-provided systems.