Amazon.com
AI

Data Scientist, Amazon Books

Amazon.com · Madrid, MD, ES

Actively hiring Posted 4 months ago

DESCRIPTION

Amazon is looking for an analytical Data Scientist to tackle critical data quality challenges with our Amazon Books team. You'll dive deep into our vast Books Catalog data to uncover root causes of data issues and their downstream impacts, directly influencing how hundreds of millions of customers discover their next great read.

At Amazon Books we believe that reading is essential for a healthy society. As such, we aim to inspire readers by making it easy to read more and get more out of reading. We do this by creating an unmatched book discovery experience for our customers worldwide. We enable customers to discover new books, authors and genres through smart search tools, intelligent interactions and sophisticated recommendations, and we need your help to ensure our data foundation supports these experiences.

If you are looking for an opportunity to solve complex analytical problems in a fast-paced environment working within a smart and passionate team, this might be the role for you. You will conduct sophisticated analyses to identify data quality issues, perform root cause analysis to understand systemic problems, and build prototypes that demonstrate potential solutions. You will work at the intersection of data science, business intelligence, and product development to drive data-driven decisions that improve our catalog quality.

Key job responsibilities

In this role you will:

  • Conduct deep-dive analyses of Books Catalog data to identify quality issues, patterns, and anomalies that impact downstream applications and customer experiences
  • Perform rigorous root cause analysis using statistical techniques and data mining to understand the underlying drivers of data quality problems
  • Build analytical prototypes and proof-of-concepts using ML and agentic technology that demonstrate potential approaches to resolve identified issues
  • Collaborate with scientists, engineers, and product teams to communicate findings and influence data quality strategies
  • Design and implement scalable data extraction and analysis pipelines to monitor catalog health and track improvements over time
  • Translate complex analytical findings into clear, actionable insights for both technical and non-technical stakeholders
  • Stay current with data science methodologies and apply best practices to ensure reproducible, high-quality analysis

A day in the life

Day-to-day work varies, but on a typical day you will:

  • Run exploratory data analyses to investigate specific data quality concerns or validate hypotheses about catalog issues
  • Build visualizations and statistical models to quantify the impact of data problems on customer-facing applications
  • Prototype potential solutions using scripting languages and collaborate with engineering teams to assess feasibility
  • Present findings to stakeholders including product managers, subject matter experts, and engineering leaders, incorporating their feedback into your analysis
  • Participate in team meetings to review metrics, share insights, and contribute to strategic planning for catalog improvements

About the team

The team consists of a collaborative group of scientists, product leaders, and dedicated engineering teams. Our aim is to maintain the world's most accurate and descriptive set of books metadata, where every title in our catalog is uniquely characterized via a set of high-quality, concise attributes. We believe this is a foundational capacity for any bookstore. We work with sister teams to leverage our systems to drive a diverse array of customer experiences that enable customers to easily identify their ideal next read.

BASIC QUALIFICATIONS

  • Experience with machine learning/statistical modeling data analysis tools and techniques, and parameters that affect their performance
  • Experience applying theoretical models in an applied environment
  • Experience working as a Data Scientist
  • Experience with data scripting languages (e.g., SQL, Python, R, or equivalent) or statistical/mathematical software (e.g., R, SAS, Matlab, or equivalent)
  • Experience diving into data to discover hidden patterns and of conducting error/deviation analysis
  • Experience effectively communicating complex concepts through written and verbal communication

PREFERRED QUALIFICATIONS

  • Currently has, or is in the process of obtaining, a Master's degree or above in Math, Statistics, Computer Science, or related science field
  • Experience with AWS services including S3, Redshift, Sagemaker, EMR, Kinesis, Lambda, and EC2
  • Experience working in a fast-paced environment similar to a high-tech start-up

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Data Science Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.