Freelance AI Evaluation Engineer (Python/Full-Stack)

Mindrift
Remote Hungary Full-time 🌐 English
MI
Salary: $100k - $100k/year
Experience: Mid-level
Added to JobCollate: March 14, 2026

AI Summary Powered by Gemini

This freelance role involves creating challenging coding test cases for AI systems, requiring strong Python development, Full-Stack experience with React, and a background in test automation. The opportunity offers flexible hours and a competitive hourly rate for a remote position.

Job Description

We're looking for a freelance AI evaluation engineer with experience in software development, test automation, and Full-Stack development to create challenging coding test cases for AI coding systems.RequirementsDegree in Computer Science, Software Engineering, or related fields5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systemsExperience writing tests (functional, integration – not just running them)Docker containers (running evaluations locally in containers)CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)English proficiency - B2BenefitsUp to $50 per hour equivalentEstimated 20 hours of work per projectFlexible work scheduleOriginally posted on Himalayas

Full Description

We're looking for a freelance AI evaluation engineer with experience in software development, test automation, and Full-Stack development to create challenging coding test cases for AI coding systems.RequirementsDegree in Computer Science, Software Engineering, or related fields5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systemsExperience writing tests (functional, integration – not just running them)Docker containers (running evaluations locally in containers)CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)English proficiency - B2BenefitsUp to $50 per hour equivalentEstimated 20 hours of work per projectFlexible work scheduleOriginally posted on Himalayas

Required Skills

Mid-Level-Full-Stack-AI-Developer-(Python-Azure) Python-AI-Engineer Software-Engineer-(AI) Mid-Level-Full-Stack-AI-Engineer Freelance-AI-Developer

Similar Jobs

Systems Analyst

name
Remote Australia, Brazil, Bulgaria, Canada, Denmark, France, Germany, India, Ireland, Japan, Mexico, Netherlands, Poland, Portugal, Singapore, Spain, Sweden, Ukraine, United Arab Emirates, United Kingdom, United States
View Details →

Backend Developer

name
Remote United States
View Details →