Castalia Systems
AI

Data Scientist/Data Engineer- Mid

Castalia Systems · Charlottesville, VA, US · $401k

Actively hiring Posted 3 days ago

Job Type: Full Time

Workplace Type: Onsite in Charlottesville, VA

Clearance: TS/SCI with a CI polygraph

**Must be a U.S. Citizen

Benefits:** Medical, dental, and vision coverage, 401k matching, generous PTO, paid holidays, professional training opportunities, and even pet insurance to ensure your furry friends are cared for too.

Job Summary

Ever-expanding technology like IoT, machine learning, and artificial intelligence means that there’s more structured and unstructured data available today than ever before. As a Data Scientist/Data Engineer, you know that organizing data can yield pivotal insights when it’s gathered from disparate sources. We need an experienced data professional like you to help our clients find answers in their data to impact important missions—from fraud detection to cancer research to national intelligence. As a Data Scientist/Data Engineer at Castalia Systems, you’ll use your expertise to help build advanced technology solutions and lead data engineering activities on some of the most mission-driven projects in the industry. You’ll guide data engineering activities by overseeing the development and deployment of pipelines and platforms that organize and make disparate data meaningful. Here, you’ll mentor a multi-disciplinary team of analysts, data engineers, developers, and data consumers in a fast-paced, agile environment. You’ll use your expertise in analytical exploration and data examination while you oversee the assessment, design, building, and maintenance of scalable platforms for your clients.

? **Roles and Responsibilities

This role encompasses both Data Science and Data Engineering disciplines. It requires raw coding experience without AI assisted IDEs. A qualified candidate will perform the following duties and responsibilities, but are not limited to:**

  • Creating various ML-based tools or processes such as recommendation engines or automated lead scoring systems.
  • Performs statistical analysis, applies data mining techniques, and builds high quality prediction systems.
  • ?Should be skilled in data visualization and use of graphical applications, including Microsoft Office (Power BI) and Tableau; major data science languages, such as R and Python; managing and margining of disparate data sources, preferably through Python, or SQL; statistical analysis; and data mining algorithms.
  • ?Should have prior experience with large data Multi-INT analytics, ML, and automated predicative analytics.
  • Conducts data analytics, data engineering, data mining, exploratory analysis, predicative analysis, and statistical analysis, and uses scientific techniques to correlate data into graphical, written, visual and verbal narrative products, enabling more informed analytic decisions.
  • Proactively retrieves information from various sources, analyzes it for better understanding about the data set, and builds AI tools that automate certain processes.

Required Qualifications:

  • Bachelor’s degree or equivalent experience
  • 3+ years of experience designing, developing, operating, and maintaining complex data applications
  • ?3+ years of experience with raw coding of Python or Java, PostgreSQL or similar databases
  • ?3+ years of experience developing code for data manipulation, data analysis, or data modeling
  • ?Experience creating and integrating code for retrieving, parsing, and processing data across multiple systems or applications
  • ?Experience developing scripts and programs for converting various types of data into usable formats and supporting project teams to scale, monitor, and operate data platforms in support of specific mission goals
  • ?Experience in the development of algorithms leveraging Python and SQL
  • ?Experience with Python data science and visualization packages
  • ?Experience with leading the development of solutions to complex problems
  • ?Experience working within a TS environment with onsite development tools
  • ?Experience with database administration or data analytics
  • ?Experience with structured and unstructured data
  • ?CompTIA Security+

Preferred Qualifications:

  • 12+ years of experience in Extract, Transform, and Load (ETL) processes
  • 7+ years of experience designing, developing, operationalizing, and maintaining complex data applications
  • ?7+ years of experience with raw coding of Python or Java, PostgreSQL or similar databases
  • ?7+ years of experience developing code for data manipulation, data analysis, or data modeling
  • ?3+ years of experience with AI and ML implementation and development
  • ?Experience with DevSecOps tools, including GitHub
  • ?Experience working within the Intel Community
  • ?Experience with a cloud environment, including AWS, Microsoft Azure, or Google Cloud
  • ?Experience with Streamlit, Tkinter
  • ?Experience with Agile software development

Requirements/Work Environment

  • Normal office environment.

Travel

  • Minimal travel

Company Description

Castalia Systems is a proven business partner providing mission critical solutions to the Federal Government. We provide cutting edge solutions from Securing and Managing Data to Systems Engineering and Development. Castalia Systems is a pioneer in Artificial Intelligence Design and Application.

With our vast knowledge of our customers’ needs and relevant technology, our team is able to bring successful solutions to every mission. We are one-upping our competitors by providing premium IT solutions and platforms with cutting-edge technology so it’s so evident when you compare us with anyone.

Disclaimer

Castalia Systems is an equal employment opportunity and affirmative action employer and strives to comply with all applicable laws prohibiting discrimination based on race, color, creed, sex, sexual orientation, age, national origin, or ancestry, physical or mental disability, veteran status, marital status, HIV-positive status, as well as any other category protected by federal, state, or local laws. All such discrimination is unlawful, and all persons involved in the operations of the company are prohibited from engaging in this type of conduct.

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Ai Machine Learning Data Science Data Engineer

Next step

Ready to Join the Team?

Apply once with DevFound. We'll route your profile to Castalia Systems and keep you informed when matching AI roles go live.

  • Single profile, multiple curated AI opportunities
  • No spam roles — only vetted AI positions
  • You choose which roles to apply to
Sign up to apply

No CV uploads. We never share your profile without your consent.

Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.