Brainscape
AI

AI Prompt Engineer

Brainscape · New York, NY · $12k

Actively hiring Posted 4 months ago

Direct message the job poster from Brainscape

Andrew Cohen

Andrew Cohen

CEO @ Brainscape | Optimizing learning, using cognitive science | SaaS & EdTech advisor and investor

Brainscape, the world's leading web & mobile EdTech study platform, is seeking an AI Prompt Engineer to help us ship and maintain high-quality generative AI features that help millions of learners create better flashcards.

You will be working directly with Brainscape's Knowledge Manager to iterate on LLM prompts, analyze real user data, and ensure our AI output meets a high quality bar - both at launch and as models evolve. The immediate priority is migrating and testing our existing bulk flashcard creation prompts in an updated AI environment with newer GPT models. These prompts power three user-facing features: importing pasted or uploaded content into flashcards, summarizing documents into flashcards, and generating flashcards from a user-described topic. From there, the role expands into ongoing QA, regression testing, and prompt optimization across all of Brainscape's AI features.

This is a part-time contract role (~5-10 hours/week, remote) through the end of 2026, with potential to extend or convert to a permanent position. Hourly rate is $40-$100 (based on experience and location).

Responsibilities

Migrate and test existing bulk flashcard creation prompts in an updated AI environment with newer GPT models - and plan future migrations as OpenAI retires older models
Run test suites and manually review AI outputs for quality and correctness (fine-tune prompts)
Analyze real user data to identify failure patterns and inform prompt improvements
Streamline testing and evaluation workflows to make QA faster and more repeatable
Monitor production quality post-launch and detect regressions as underlying models shift
Build and maintain model evaluation datasets from real user inputs across all AI features
Write new test cases for edge cases, multilingual content, and messy real-world inputs
Document prompt changes, test results, and lessons learned
Work with the Content Team to apply flashcard authoring quality standards

Qualifications

1+ years hands-on prompt engineering experience with LLMs / OpenAI API (systematic testing and iteration, not just casual ChatGPT usage)
Familiarity with Cursor IDE or similar AI-assisted development tools (our work is primarily Python - Cursor experience is more important than raw Python skill)
Some experience with Git version control and collaborating via shared repositories (we use GitLab)
A habit of documenting what you tried, what worked, and why - you don't need a formal QA background, but you naturally keep track of your process
Clear written communication skills
Proactive attitude; ability to work independently and manage your own time
BONUS: Experience building prompt evals, AI quality assurance, or using GPT to grade GPT outputs
BONUS: Experience with regression testing for AI systems or detecting model drift
BONUS: Background in education technology (EdTech) or content creation - especially microlearning, flashcards, or other concise Q&A formats
BONUS: A degree in Computer Science, Information Science, or a similar field

To Apply
Do NOT apply on LinkedIn. Please apply at the following link: https://brainscape.breezy.hr/p/a0807688996d-ai-prompt-engineer

Show more

Show less

Seniority level

Entry level

Employment type

Contract

Job function

Engineering and Information Technology

Industries

E-Learning Providers

Tags & focus areas

Used for matching and alerts on DevFound
Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.