Super Micro Computer, Inc.
AI

Generative AI Software Engineer Co-Op

Super Micro Computer, Inc. · San Jose, CA, US · $62k

Actively hiring Posted 4 months ago

Generative AI Software Engineer Co-Op

Date: Feb 12, 2026

Location: San Jose, California, United States

Company: Super Micro Computer

**Job Req ID: 28406

About Supermicro:**

Supermicro® is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary:

AI is evolving at an unprecedented pace — new models, new frameworks, new paradigms every month. We're looking for someone who's excited to learn fast and build real AI systems.

As a GenAI Software Engineer Co-Op at Supermicro, you'll work on AI Agents and LLM-powered applications that help customers find the right server solutions. You'll build systems that can reason, retrieve information, use tools, and interact with databases — all running on powerful GPU infrastructure including NVIDIA H100, H200, and GH200.

This is not a research internship where you read papers all day. You'll ship production code that real users interact with.

**Essential Duties and Responsibilities:

Includes the following essential duties and responsibilities (other duties may also be assigned):

What You’ll Work On:**

With mentorship from senior engineers, you may help with:

  • AI Agents & Autonomous Systems
  • LLM Applications & RAG
  • Data & Infrastructure

What You’ll Learn:

By the end of the Co-Op, you'll have hands-on experience in:

  • AI Agent architectures: Planning, tool use, multi-step reasoning
  • LLM application development: Prompt engineering, RAG, fine-tuning concepts
  • Modern AI stack: Vector databases, embedding models, rerankers
  • GPU infrastructure: Deploying AI workloads on enterprise hardware
  • Production engineering: Testing, monitoring, iterating on real systems

**Qualifications:

What We’re Looking For:**

Minimum Requirements:

  • Called an LLM API — OpenAI, Anthropic, or any LLM provider (not just ChatGPT web UI)
  • Know what a Vector Database is — and why it's used for semantic search
  • Built something with AI — a chatbot, RAG app, agent, or any AI-powered project
  • Good Python skills — can write clean, working code independently

Required

  • Currently pursuing bachelor’s degree in Computer science, Software Engineering, or Computer Engineering or a related field
  • Genuine curiosity about AI — you follow the latest developments
  • Fast learner — comfortable with ambiguity and rapid change
  • Self-motivated, can work independently

Nice to Have

  • Has built a RAG system, chatbot, or AI agent (even a weekend project!)
  • Experience with LLM frameworks (LangChain, LlamaIndex, CrewAI, etc.)
  • Familiarity with vector databases (Qdrant, Pinecone, Milvus, Chroma, etc.)
  • Docker and Linux command line experience
  • Database knowledge (PostgreSQL, SQL)
  • Side projects, hackathons, Kaggle, open-source contributions — all count!

Salary Range

$30/hr

The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

EEO Statement

Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

Tags & focus areas

Used for matching and alerts on DevFound
Internship Ai Generative Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.