LLM Evaluator (Model Response Analyst)
12000 $Odixcity Consulting
Job Title: LLM Evaluator (Model Response Analyst)
Location: Remote (Worldwide)
Job Summary: We are seeking a detail-oriented and analytical LLM Evaluator to assess, analyze, and improve the performance of large language models (LLMs). In this role, you will evaluate AI-generated content for accuracy, coherence, factual reliability, bias, safety, and alignment with defined guidelines.
Responsibilities:
· Evaluate and rank model-generated text based on complex rubrics covering dimensions such as factuality, coherence, safety, instruction- following, and creativity.
· Review multiple model responses to the same prompt and determine which output a human would prefer, providing justifications for your choices.
· Provide clear, concise feedback to the modeling and training teams regarding recurring failure models observed during evaluation sessions.
· Attempt to “break” the model by crafting prompts designed to elicit biased, harmful, or insecure outputs to help patch safety vulnerabilities.
· Collaborate with the quality assurance team to suggest improvements to evaluation guidelines when you encounter ambiguous or unclassifiable edge cases.
· Participate in regular “cross-checking” sessions with other evaluators to calibrate scoring standards and ensure inter-rater reliability across the global team.
· When a model underperforms, dig deeper than the surface score to hypothesize “why” the model made a specific error (e.g., training data vs. prompt misinterpretation).
· Identify and flag novel or unexpected model behaviors to the research team, contributing to a living library of unique model outputs and failure modes.
Requirements:
· Minimum of 2 years of professional experience in a relevant field such as; Computational Linguistics, Data Analysis, Technical Writing, Quality Assurance (specifically for NLP/AI), or cognitive science.
· Bachelor’s degree in Computer Science, or a relating field.
· Deep understanding of how-to craft prompts to elicit specific behaviors and test model limits.
· Ability to look at a text output and explain “why” it is “good” or “bad” based on logic, tone, factuality, and instruction adherence.
· Experience working with Reinforcement Learning from Human Feedback (RLHF) data collection.
· Proven experience monitoring and improving consistency among evaluation teams. Ability to analyze IAA scores and conduct calibration sessions to align judgement.
· Experience sourcing, cleaning, and annotating datasets specifically for the fine-tuning or evaluating LLMs. Understanding of data distribution and its impact on model performance.
· Familiarity with A/B testing concepts applied to AI. Ability to help design experiments to test if a new model version is truly “better” than the previous one.
6000 $
...Job Title: 3D Modeling Job Type: Full-Time / Contract Location: Remote Job Summary: We are seeking a highly creative... ..., high-quality 3D assets for our digital projects. Job Responsibilities Design and build high-quality 3D models (both organic and...12000 $
...Job Title: Legal Data Analyst Location: Remote (Worldwide) Job Summary: The Legal Data Analyst is responsible for collecting, analyzing, and interpreting legal... ...and refine prompts for large language models (LLMs), and evaluate model outputs for accuracy,...12000 $
...Job Title: Risk Analyst Location: Remote (Worldwide) Job Summary: The Risk Analyst is responsible for identifying, assessing, and mitigating... ...to detect vulnerabilities, evaluate exposure levels, and... ...Design and implement rules, models, and workflows to detect and...12000 $
.... QA Analyst (Quality Assurance Analyst) Job Summary: We are looking for a QA Analyst ensures the quality, consistency... ...content, and operational workflows. This role evaluates annotated datasets, AI model responses, software outputs, and content deliverables to ensure...- We are recruiting to fill the position below: Job Title: Monitoring and Evaluation Assistant Job ID: 2026-7887 Location: Birnin, Kebbi Job type: Full-time
- ...collection system and of the related findings and results. RESPONSIBILITIES Tasks Contribute to the development and update of... ...collection tools, systems and DB for the correct monitoring and evaluation of the projects for internal and external use, in line with...
12000 $
...Job Title: Document Analyst Location: Remote (Worldwide) Job... ...: The Document Analyst is responsible for reviewing, verifying, and... ...engineers and data scientists on model performance. Identify... ...used for model training and evaluation. · Perform quality checks on...- ...resilient communities through our humanitarian and development response in Nigeria. Plan Nigeria works with communities, civil society... ...strengthened in Adamawa and Sokoto states. End of Project Evaluation Objectives Assess how well the project’s strategies addressed...
6000 $
...Job Title: Financial Analyst Location: Remote Job Summary : We are seeking for a... ...making across the organization. You will be responsible for analyzing financial data, preparing... ...financial outcomes. Build financial models and conduct scenario analysis to support...12000 $
...Web Content Analyst Annotator Job Summary: We are seeking for a... ...Analyst Annotator to ensure evaluating, categorizing, and annotating... ...websites to support AI systems responsible for content understanding, ranking... ...and will help train AI models to interpret online information...12000 $
...Job Title: Web Data Analyst Annotator Job Summary: We are seeking... ...and Large Language Models (LLMs). This role combines web... ...driven technologies. Key Responsibilities · Analyze and annotate web... ...websites and online sources. · Evaluate webpage quality, relevance,...- ...the project, and CASFOD from July 2025 to date. The integrated response sought to address the need risks and gaps identified by the... ...as recommended in the MSNA for ECHO HIP 2023 Purpose of the Evaluation The purpose of the end line evaluation is to independently...
- We are recruiting to fill the position below: Job Title: Volunteer – Emergency Response Location: Lagos Job type: Contract
- ...We are looking for detail-oriented English (Nigeria) Audio Evaluators to join an exciting language evaluation project. In this role, you will assess short audio clips in English (Nigerian) , focusing on accent recognition and the naturalness of speech. Your work will...
12000 $
...Job Title: Trust and Safety Analyst Location: Remote (Worldwide) Job Summary: The Trust and Safety Analyst play a critical... ...identifying, assessing, and mitigating online risks. This position is responsible for enforcing community standards, investigating policy...- We are recruiting to fill the position below: Job Title: Procurement Analyst Location: Abuja (FCT) Employment Type: Full-time
12000 $
...Data Operations Analyst Job Summary: We are seeking for a Data Operations Analyst to ensure smooth handling, quality, and management... ...in managing data pipelines and operations. Key Responsibilities · Monitor and manage data pipelines and workflows. · Validate...- We are recruiting to fill the position below: Job Title: Analyst – CVM Analytics Job Identification: 7430 Location: Ikoyi, Lagos Job Category: MTN Level 2 Reports To: Manager – CVM Analytics Division: Marketing
- We are recruiting to fill the position below: Job Title: Junior Quant / Scorecard Analyst Location: Yaba, Lagos Employment Type: Full-time
- We are recruiting to fill the position below: Job Title: Analyst – Data Analytics, Network Job Identification: 7452 Location: Ikoyi, Lagos Reports To: Manager – Network Business Performance and Quality Assurance Division: Network
- We are recruiting to fill the position below: Job Title: Talent Acquisition Analyst Location: Ibese Plant, Ogun Department: Human Resources Reports To: Talent Acquisition & Talent Management Lead
- We are recruiting to fill the position below: Job Title: Analyst – Service Assurance, Digital Services Job Identification: 7466 Location: Ikoyi, Lagos
- We are recruiting to fill the position below: Job Title: Inbound Sales Analyst Location: Lagos , Hybrid Employment Type: Full-time
- We are recruiting to fill the position below: Job Title: Inventory Analyst Lead Location: Lagos
- We are recruiting to fill the position below: Job Title: Inventory Recovery Analyst Location: Lagos
- We are recruiting to fill the position below: Job Title: Production Analyst Location: Lagos
- We are recruiting to fill the position below: Job Title: Quality Control Analyst (ICT) Location: Maryland, Lagos Employment Type: Full-time
- Applications are invited from suitably qualified candidates for the position below: Job Title: System Programmer / Analyst II Location: Amizi, Abia
- We are recruiting to fill the position below: Job Title: Analyst – Sponsorship Product launches Edutainment, Marketing Job Identification: 7431 Location: Ikoyi, Lagos
- We are recruiting to fill the position below: Job Title: HR Data Analyst – Compensation & Rewards Location: Lagos Employment Type: Full Time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to LLM Evaluator (Model Response Analyst). Be the first to apply!
