GistGarden

Will AI replace Data Scientists?

On paper, AI could touch ~75% of the work in Data Scientists — and unlike most jobs, it's already showing up in the real workday, not just the theory.

The Epicenter Where AI is already part of the workday.

O*NET-SOC 15-2051

How your 35 core tasks split

100% within AI's reach
11 AI can do this now
24 AI speeds this up
0 Still on you
AI could do · GPT-4 study
75%
29-pt gap
AI actually does · 2026 report
46%

Top = what GPT-4 judged AI could speed up. Bottom = how much AI was actually used for these tasks (Anthropic's March 2026 report, usage from Aug & Nov 2025). The gap is the real story.

⚡ The short answer

Back in 2023, GPT-4 judged AI could, in theory, assist with a high share of this job's tasks (~75%). By late 2025, real-world AI use had reached about 46% of its task activity (already common). The gap between that 2023 forecast and today is the real story.

Where this job sits among 738 jobs

Being automatedTicking (can, but unused)Relatively safeQuietly happeningYOU0%50%100%0%40%75% → How much AI could do (theory) → How much AI is actually used (late 2025)

Each dot is one of 738 U.S. jobs. Right = AI can do more of it. Up = AI is actually used more.

Stableconfidence

The signals here line up

Theoretical reach (~75%), real-world use (~46%) and the task-level picture mostly agree — so this read is more reliable than for jobs where the signals contradict each other. Even so, AI-risk estimates shift by model (a 2026 study saw the "high-risk" share swing 2.7%–51.5%), so treat these as directional, not destiny.

See all 35 tasks, ratedBased on real task-level AI scores — click to collapse
AI can already do this11 of 35
  • Maintain library of model documents, templates, or other reusable knowledge assets.
  • Create business intelligence tools or systems, including design of related databases, spreadsheets, or outputs.
  • Maintain or update business intelligence tools, databases, dashboards, systems, or methods.
  • Provide technical support for existing reports, dashboards, or other tools.
  • Create or review technical design documentation to ensure the accurate development of reporting solutions.
  • Conduct or coordinate tests to ensure that intelligence is consistent with defined needs.
  • Design and validate clinical databases, including designing or testing logic checks.
  • Prepare appropriate formatting to data sets as requested.
  • Design forms for receiving, processing, or tracking data.
  • Develop technical specifications for data management programming and communicate needs to information technology staff.
  • Write work instruction manuals, data capture guidelines, or standard operating procedures.
AI speeds this up24 of 35
  • Generate standard or custom reports summarizing business, financial, or economic data for review by executives, managers, clients, and other stakeholders.
  • Synthesize current business intelligence or trend data to support recommendations for action.
  • Manage timely flow of business intelligence information to users.
  • Collect business intelligence data from available industry reports, public information, field reports, or purchased sources.
  • Analyze competitive market strategies through analysis of related product, market, or share trends.
  • Identify or monitor current and potential customers, using business intelligence tools.
  • Disseminate information regarding tools, reports, or metadata enhancements.
  • Identify and analyze industry or geographic trends with business strategy implications.
  • Communicate with customers, competitors, suppliers, professional organizations, or others to stay abreast of industry or business trends.
  • Analyze technology trends to identify markets for future product development or to improve sales of existing products.
  • Process clinical data, including receipt, entry, verification, or filing of information.
  • Generate data queries, based on validation checks or errors and omissions identified during data entry, to resolve identified problems.
  • Develop project-specific data management plans that address areas such as coding, reporting, or transfer of data, database locks, and work flow processes.
  • Monitor work productivity or quality to ensure compliance with standard operating procedures.
  • Prepare data analysis listings and activity, performance, or progress reports.
  • Confer with end users to define or implement clinical system requirements such as data release formats, delivery schedules, and testing protocols.
  • Perform quality control audits to ensure accuracy, completeness, or proper usage of clinical systems and data.
  • Analyze clinical data using appropriate statistical tools.
  • Evaluate processes and technologies, and suggest revisions to increase productivity and efficiency.
  • Track the flow of work forms, including in-house data flow or electronic forms transfer.
  • Supervise the work of data management project staff.
  • Contribute to the compilation, organization, and production of protocols, clinical study reports, regulatory submissions, or other controlled documentation.
  • Read technical literature and participate in continuing education or professional associations to maintain awareness of current database technology and best practices.
  • Train staff on technical procedures or software program usage.
Still on you0 of 35
  • ⚠️ None — every core task is at least partly within AI's reach. The job won't vanish, but almost all of it changes.

My job is in The Epicenter 🌋

AI's already in the room. Guess I'll learn to aim it.

Theoretical estimate · not a prediction · gistgarden.com

How we measured this — and how fresh it is

AI's theoretical reach data: 2023

From GPTs-are-GPTs (Eloundou et al.), where GPT-4 rated how much of each task an AI tool could meaningfully speed up. This is the most recent open, commercially-usable occupation-level potential dataset — it dates to 2023. Newer multi-model re-runs exist but swing wildly (one 2026 study saw "high-risk" jobs range 2.7%–51.5% by model) and aren't openly licensed, so we show the stable 2023 baseline and pair it with newer real-world data.

Real-world AI use 2026 report

From the Anthropic Economic Index, which observes how real Claude conversations map onto each occupation's tasks. Published in Anthropic's March 2026 labor-market report, based on usage measured in Aug & Nov 2025 (Sonnet 4 / 4.5).

Task list & ratings O*NET 30.3

Tasks come from O*NET 30.3. Each task's "AI can do / speeds up / still on you" tier uses the real task-level exposure scores from GPTs-are-GPTs (E1 / E2 / E0) — not a guess from keywords.

Sources: O*NET 30.3 (CC BY 4.0) · GPTs-are-GPTs (MIT, 2023) · Anthropic Economic Index (CC BY, Aug & Nov 2025). Page compiled June 2026. "O*NET" is a trademark of the U.S. Department of Labor.

This page is for general informational purposes only and is not career, financial, or employment advice. AI exposure reflects research estimates of task overlap, not predictions about any individual's job, employer, or future employment.