Back to jobs

Senior LLM Researcher

Job description

Senior LLM Researcher (ML)
Remote -United States
$250,000 - $500,000 + Equity + Flexible PTO + Benefits + Progression

Are you passionate about working on the bleeding edge of AI innovation, and looking for a high impact role within a well-funded startup that's looking to take on OpenAI, whilst working alongside one of the most respected researchers in AI?

This is an excellent opportunity to join a high-caliber team led by proven AI founders and tech entrepreneurs to pioneer the next generation of enterprise SaaS tools, powered by novel AI/ML technologies.

My client is a fast-growing startup with substantial funding and backing from notable tech giants/investment firms, and is looking to disrupt the AI/ML space by developing task-optimized AI models designed to deliver high performance while using significantly less computational power than traditional LLMs.

You will conduct and lead research on the development, training, and deployment of large language models, and work closely with MLOps engineers to design, optimize, and maintain scalable training pipelines for large language models, ensuring their efficient deployment in production environments. In addition, you will design, implement, and evaluate novel model architectures, leveraging techniques from transformer models and beyond to enhance language understanding, generation, and inference capabilities.

This role is a great opportunity to join early in a startup that is excellently positioned to disrupt the company hierarchy within the AI space, in a role where you'll work directly with some of the brightest minds in the industry on truly innovative novel architecture, whilst achieving unmatched career progression.

The Person:

  • PhD / MS in Computer Science, Machine Learning, Computational Linguistics
  • Python, TensorFlow, PyTorch, Hugging Face Transformers, MLflow, Kubeflow
  • Expertise in MLOps best practices, including model versioning, CI/CD pipelines, containerization, and cloudbased deployment of large-scale models.
  • Experience in pretraining or post-training large language models


The Role

  • Work with a leading ML research expert
  • Dedicated GPU cluster for research.
  • Conduct and lead research on the development, training, and deployment of large language models
  • Collaborate closely with MLOps engineers to build, optimize, and maintain scalable training pipelines
  • Design, implement, and evaluate novel model architectures
  • Play a pivotal role in shaping the future of AI applications