AI Training Contributor | Ardeshir Shojaei

In one sentence

A remote contractor role contributing to AI model training and evaluation for leading AI labs, via Mercor’s expert annotation platform.

What I did

Worked on AI training projects focused on improving how language models reason through complex problems. The work involved three main tasks: writing structured reasoning trajectories — step-by-step reasoning chains that models learn from — evaluating model outputs against defined quality criteria, and identifying where reasoning breaks down in code-based tasks.

The projects contributed training data to language model fine-tuning pipelines, specifically in areas of code reasoning and logical problem solving. This type of work sits within the post-training phase of model development, using supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) techniques.

Outcome

Ongoing. Selected by Mercor as a domain expert contractor, contributing to active AI training projects for leading AI labs.