Will AI replace Data Scientists?
On paper, AI could touch ~75% of the work in Data Scientists — and unlike most jobs, it's already showing up in the real workday, not just the theory.
O*NET-SOC 15-2051
How your 35 core tasks split
Top = what GPT-4 judged AI could speed up. Bottom = how much AI was actually used for these tasks (Anthropic's March 2026 report, usage from Aug & Nov 2025). The gap is the real story.
Back in 2023, GPT-4 judged AI could, in theory, assist with a high share of this job's tasks (~75%). By late 2025, real-world AI use had reached about 46% of its task activity (already common). The gap between that 2023 forecast and today is the real story.
Where this job sits among 738 jobs
Each dot is one of 738 U.S. jobs. Right = AI can do more of it. Up = AI is actually used more.
The signals here line up
Theoretical reach (~75%), real-world use (~46%) and the task-level picture mostly agree — so this read is more reliable than for jobs where the signals contradict each other. Even so, AI-risk estimates shift by model (a 2026 study saw the "high-risk" share swing 2.7%–51.5%), so treat these as directional, not destiny.
See all 35 tasks, ratedBased on real task-level AI scores — click to collapse
- Maintain library of model documents, templates, or other reusable knowledge assets.
- Create business intelligence tools or systems, including design of related databases, spreadsheets, or outputs.
- Maintain or update business intelligence tools, databases, dashboards, systems, or methods.
- Provide technical support for existing reports, dashboards, or other tools.
- Create or review technical design documentation to ensure the accurate development of reporting solutions.
- Conduct or coordinate tests to ensure that intelligence is consistent with defined needs.
- Design and validate clinical databases, including designing or testing logic checks.
- Prepare appropriate formatting to data sets as requested.
- Design forms for receiving, processing, or tracking data.
- Develop technical specifications for data management programming and communicate needs to information technology staff.
- Write work instruction manuals, data capture guidelines, or standard operating procedures.
- Generate standard or custom reports summarizing business, financial, or economic data for review by executives, managers, clients, and other stakeholders.
- Synthesize current business intelligence or trend data to support recommendations for action.
- Manage timely flow of business intelligence information to users.
- Collect business intelligence data from available industry reports, public information, field reports, or purchased sources.
- Analyze competitive market strategies through analysis of related product, market, or share trends.
- Identify or monitor current and potential customers, using business intelligence tools.
- Disseminate information regarding tools, reports, or metadata enhancements.
- Identify and analyze industry or geographic trends with business strategy implications.
- Communicate with customers, competitors, suppliers, professional organizations, or others to stay abreast of industry or business trends.
- Analyze technology trends to identify markets for future product development or to improve sales of existing products.
- Process clinical data, including receipt, entry, verification, or filing of information.
- Generate data queries, based on validation checks or errors and omissions identified during data entry, to resolve identified problems.
- Develop project-specific data management plans that address areas such as coding, reporting, or transfer of data, database locks, and work flow processes.
- Monitor work productivity or quality to ensure compliance with standard operating procedures.
- Prepare data analysis listings and activity, performance, or progress reports.
- Confer with end users to define or implement clinical system requirements such as data release formats, delivery schedules, and testing protocols.
- Perform quality control audits to ensure accuracy, completeness, or proper usage of clinical systems and data.
- Analyze clinical data using appropriate statistical tools.
- Evaluate processes and technologies, and suggest revisions to increase productivity and efficiency.
- Track the flow of work forms, including in-house data flow or electronic forms transfer.
- Supervise the work of data management project staff.
- Contribute to the compilation, organization, and production of protocols, clinical study reports, regulatory submissions, or other controlled documentation.
- Read technical literature and participate in continuing education or professional associations to maintain awareness of current database technology and best practices.
- Train staff on technical procedures or software program usage.
- ⚠️ None — every core task is at least partly within AI's reach. The job won't vanish, but almost all of it changes.
How we measured this — and how fresh it is
AI's theoretical reach data: 2023
From GPTs-are-GPTs (Eloundou et al.), where GPT-4 rated how much of each task an AI tool could meaningfully speed up. This is the most recent open, commercially-usable occupation-level potential dataset — it dates to 2023. Newer multi-model re-runs exist but swing wildly (one 2026 study saw "high-risk" jobs range 2.7%–51.5% by model) and aren't openly licensed, so we show the stable 2023 baseline and pair it with newer real-world data.
Real-world AI use 2026 report
From the Anthropic Economic Index, which observes how real Claude conversations map onto each occupation's tasks. Published in Anthropic's March 2026 labor-market report, based on usage measured in Aug & Nov 2025 (Sonnet 4 / 4.5).
Task list & ratings O*NET 30.3
Tasks come from O*NET 30.3. Each task's "AI can do / speeds up / still on you" tier uses the real task-level exposure scores from GPTs-are-GPTs (E1 / E2 / E0) — not a guess from keywords.
Sources: O*NET 30.3 (CC BY 4.0) · GPTs-are-GPTs (MIT, 2023) · Anthropic Economic Index (CC BY, Aug & Nov 2025). Page compiled June 2026. "O*NET" is a trademark of the U.S. Department of Labor.
This page is for general informational purposes only and is not career, financial, or employment advice. AI exposure reflects research estimates of task overlap, not predictions about any individual's job, employer, or future employment.