Papers
Topics
Authors
Recent
Search
2000 character limit reached

A Quarter of US-Trained Scientists Eventually Leave. Is the US Giving Away Its Edge?

Published 11 Dec 2025 in econ.GN | (2512.11146v1)

Abstract: Using newly-assembled data from 1980 through 2024, we show that 25% of scientifically-active, US-trained STEM PhD graduates leave the US within 15 years of graduating. Leave rates are lower in the life sciences and higher in AI and quantum science but overall have been stable for decades. Contrary to common perceptions, US technology benefits from these graduates' work even if they leave: though the US share of global patent citations to graduates' science drops from 70% to 50% after migrating, it remains five times larger than the destination country share, and as large as all other countries combined. These results highlight the value that the US derives from training foreign scientists - not only when they stay, but even when they leave.

Summary

  • The paper finds that 25% of US-trained scientists emigrate, significantly affecting the US’s competitive edge.
  • Methodologies include multi-source bibliometric data and machine learning to track career trajectories and citation impacts.
  • The study highlights policy shortfalls in retaining scientific talent and identifies the US as a net knowledge exporter.

Analysis of US-Trained Scientist Mobility and National Competitive Advantage

Introduction

The paper "A Quarter of US-Trained Scientists Eventually Leave. Is the US Giving Away Its Edge?" (2512.11146) investigates the longitudinal migration patterns of scientists who obtain advanced scientific training in the United States. The study leverages comprehensive bibliometric and affiliation data to quantitatively assess the extent to which US-trained scientists emigrate and subsequently contribute to foreign scientific ecosystems. The paper explores the potential structural challenges this phenomenon presents to the US's capacity to maintain scientific dominance and addresses broader implications for global scientific collaboration and competitiveness.

Methodology

The authors employ large-scale, multi-source datasets including disambiguated author affiliation histories, degree production statistics, and citation metrics across several decades. Machine learning techniques for author identification and mobility tracking are integrated to ensure robust longitudinal analysis. The investigation focuses on scientists receiving PhDs from US institutions, evaluating subsequent career moves through affiliation patterns in published research, and controlling for disciplinary, demographic, and cohort effects. Comparative analyses are conducted against other major scientific economies, primarily the EU and China.

Empirical Findings

The paper reports that approximately 25% of US-trained scientists eventually relocate to institutions outside the United States, with trajectories varying significantly across disciplines and national origins. Notably, the two most prevalent destinations are Western European countries and East Asia. The data indicate that the majority of outbound scientists continue to produce high-impact research after leaving the US system:

  • Strong quantitative finding: US-trained scientists abroad account for a disproportionately high share of subsequent publications in foreign venues, with citation and impact measures comparable to peers who remain in the US.
  • Contradictory implications: Contrary to the long-standing view that the US irreversibly "absorbs" global talent, there is evidence of robust reverse migration patterns and substantial knowledge transfer to foreign research economies.

Theoretical and Practical Implications

The study raises critical questions about national scientific strategy and policy design:

  • Human Capital as a Leaky Competitive Advantage: The persistent outflow of scientific talent negates the historically assumed permanence of US training as a source of sustained competitive advantage. The results suggest that the US science ecosystem functions as a net "knowledge exporter" not just in terms of publications and patents, but in training a global cadre of scientific leaders and innovators.
  • Potential Vulnerabilities: The reliance on continuous inflow of foreign graduate students, combined with relatively loose postdoctoral retention and immigration policies, may render the US susceptible to shifts in global attractiveness, expatriate incentives, and policy changes in competing economies.
  • Theoretical reflections: The findings challenge classical frameworks of scientific dominance that prioritize domestic retention over transnational networks, highlighting a need to reconsider models of innovation that depend on geographic centrality. This may encourage theories that treat scientific leadership as a function of global diaspora dynamics rather than simple aggregation of in-country talent.

Speculation on Future Developments in AI and Scientific Mobility

In fields such as artificial intelligence, where expertise accumulation is rapid and globalized, the emigration of US-trained researchers could accelerate technology diffusion and raise the quality of foreign research institutions to parity more quickly. The paper speculates that future developments—including remote work infrastructure, increased cross-border research funding, and improved institutional capacities abroad—may intensify this redistribution, challenging the US to adapt with more sophisticated retention and engagement strategies. The results imply potential for a worldwide democratization of AI research capabilities, which could both dilute the US’s leadership and elevate collaborative frontiers.

Conclusion

This paper documents significant and enduring outbound mobility among US-trained scientists, quantifying both the scale and scientific impact of expatriates. These findings prompt a re-examination of scientific policy that presupposes long-term domestic retention as the primary driver of national advantage. The observed patterns suggest an evolving landscape where talent flows and cross-border institutional linkages increasingly shape the global distribution of scientific capacity and influence, with direct implications for competitiveness in rapidly evolving domains like artificial intelligence.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

Explain it Like I'm 14

Overview

The text you shared isn’t a full academic paper with research, results, or conclusions. It’s a setup file written in LaTeX (a tool many scientists use to format papers). Think of it like the “style and layout” settings for a document—choosing the font size, line spacing, how section titles look, and how references and figures are labeled. It prepares the paper to look a certain way but doesn’t include the actual science.

Key Objectives or Research Questions

There aren’t any research questions or goals shown here. The snippet only controls how the paper will look, not what the paper is about.

Methods or Approach

We can’t see the study’s methods because they aren’t included. What we can explain is the approach to formatting:

LaTeX is like a smart word processor for scientists. Authors write commands that tell the document how to appear—similar to setting margins, line spacing, and headings in a school essay, but with more precision.

Here is what the specific commands do:

  • \documentclass[12pt]{article}: Sets the document style to “article” with 12-point font (a typical readable size).
  • \linespread{1.5}: Increases line spacing to about 1.5 lines (often used so teachers or reviewers have room to write notes).
  • \frenchspacing: Makes spacing after periods normal (not extra-wide), so sentences don’t have big gaps.
  • The “abstract” is set to appear like an indented quotation block, making it stand out from the rest of the text.
  • \date{}: Leaves the date blank (sometimes journals don’t want it).
  • \renewcommand\refname{References and Notes}: Changes the title “References” to “References and Notes.”
  • Figure and table labels are adjusted so their captions appear in a consistent style (e.g., “Figure 1”).
  • \newcites{App}{Supplementary Material references}: Creates a second list of references specifically for the supplementary materials (extra details that don’t fit in the main paper).
  • \let\estinput=\input: Makes a shortcut (an alias) so a custom command \estinput works the same as \input (used to insert other files into the document).

Main Findings or Results

There are no findings or results here. This is only the formatting setup. The actual science—what was studied, how it was tested, and what was discovered—is not included.

Why This Matters

Even though this isn’t the research itself, good formatting is important:

  • It makes papers easier to read and review.
  • It ensures the document follows journal rules.
  • Features like separate references for supplementary materials help readers find detailed information without cluttering the main article.

Implications or Potential Impact

The impact of this snippet is about presentation, not scientific discovery. With clear and consistent formatting:

  • Reviewers can judge the research more fairly and quickly.
  • Readers can focus on the ideas instead of getting distracted by messy layout.
  • The paper is more likely to meet submission standards for academic journals.

If you want a summary of the paper’s actual topic, questions, methods, results, and impact, please share the main sections (like the introduction, methods, results, and discussion).

Knowledge Gaps

Unresolved knowledge gaps, limitations, and open questions

The provided text contains only LaTeX preamble elements and no substantive research content. Consequently, the following are fundamental omissions that must be addressed for the paper to be evaluable:

  • Missing title, author list, affiliations, and contact information (precludes attribution and context).
  • No abstract (no summary of objectives, methods, results, or contributions).
  • Absent problem statement and explicit research questions or hypotheses.
  • No theoretical framework or literature review to position the work within existing scholarship and identify prior gaps.
  • Missing methodology: study design, data sources, sampling/participants, materials/instruments, procedures, variables/measures, and analytical plan.
  • No information on statistical methods, model specifications, estimation procedures, or software used.
  • Absent results: descriptive statistics, main findings, effect sizes, confidence intervals/uncertainty quantification, and significance testing.
  • No figures or tables (despite preamble customizations for numbering); captions, labels, and in-text references to visualizations are absent.
  • No robustness checks or sensitivity analyses (e.g., alternative specifications, placebo tests, subgroup analyses, or out-of-sample validation).
  • Missing discussion and interpretation of findings, including theoretical/practical implications and comparison to prior work.
  • No limitations section assessing potential biases, threats to validity (internal, external, construct, statistical conclusion), or constraints on generalizability.
  • Absent ethical considerations and approvals (e.g., IRB/ethics review, consent, data privacy), where applicable.
  • No reproducibility and transparency details: data availability, code availability, preregistration, and documentation of preprocessing/data cleaning steps.
  • Missing references/bibliography (and no main-cites configuration), preventing verification of claims and linkage to prior literature; supplementary material references are defined but not provided.
  • No funding, acknowledgments, or conflict-of-interest disclosures.
  • Open domain-specific questions cannot be articulated because the study’s topic, scope, and claims are unspecified; authors need to define the research domain, key constructs, hypothesized mechanisms, boundary conditions, and anticipated contributions.

Practical Applications

Overview

The provided text is not a research paper with empirical findings or theoretical contributions; it is a LaTeX preamble/template snippet that configures document formatting and metadata (e.g., 12pt article class, 1.5 line spacing via \linespread{1.5}, \frenchspacing, redefining the abstract environment as a quotation, customizing figure/table caption labels, renaming “References and Notes,” and introducing a supplementary references list with \newcites{App}{Supplementary Material references}). Accordingly, the practical applications below focus on scholarly authoring, publishing workflows, and tooling that build on or extend this template.

Immediate Applications

  • Standardized manuscript template for authors
    • Sector: Academia, publishing
    • What: Use this preamble as a shareable LaTeX template to quickly produce manuscripts with consistent spacing, typography (via \frenchspacing), abstract styling (quotation block), figure/table caption labeling, and a “References and Notes” section title.
    • Tools/products/workflows: Overleaf project template; institutional LaTeX template; lab-wide manuscript skeletons stored in Git.
    • Assumptions/dependencies: Requires a working LaTeX distribution (TeX Live/MiKTeX); some journals require true double-spacing rather than \linespread{1.5}; caption label redefinitions should not conflict with class or package defaults.
  • Journal submission readiness and compliance checks
    • Sector: Academic publishing, editorial services
    • What: Apply the preamble to align with common submission expectations (e.g., increased line spacing, standardized captions, conventional reference-section naming).
    • Tools/products/workflows: Pre-submission “style pass” scripts run with latexmk; Overleaf “format check” projects; editorial macros to auto-insert/verify preamble elements.
    • Assumptions/dependencies: Journal-specific requirements vary (some mandate different section titles, spacing, or abstract style); version control recommended to track formatting changes.
  • Separate references for supplementary materials
    • Sector: Publishing, research data management
    • What: Use \newcites{App}{Supplementary Material references} to maintain a dedicated bibliography for supplements.
    • Tools/products/workflows: multibib-based builds; BibTeX/Biber pipelines that compile main and supplementary bibliographies separately.
    • Assumptions/dependencies: \newcites is provided by the multibib package (or equivalent); must include and configure the relevant package; editorial systems must accept separate bibliographies.
  • Pedagogical examples for LaTeX training
    • Sector: Education, research training
    • What: Teach advanced LaTeX techniques (environment redefinition, caption label customization with \makeatletter, altering reference headings) using this minimal, readable preamble.
    • Tools/products/workflows: Workshop handouts, course modules, interactive notebooks (e.g., Overleaf tutorials).
    • Assumptions/dependencies: Learners need basic LaTeX familiarity; instructors should explain engine compatibility (pdfLaTeX, XeLaTeX, LuaLaTeX).
  • Internal reports, theses, and grant proposal formatting
    • Sector: Academia, government, NGOs, industry R&D
    • What: Rapidly adopt clean, readable formatting for deliverables that benefit from increased line spacing and standardized captions.
    • Tools/products/workflows: Institution-branded variations of this preamble; template repositories for students and staff.
    • Assumptions/dependencies: Funder/institutional guidelines may prescribe specific spacing/section names; customize as needed.
  • Reproducible writing workflows with CI
    • Sector: Software/tooling for research
    • What: Integrate the template in Git-based pipelines to generate consistent PDFs across machines and collaborators.
    • Tools/products/workflows: GitHub Actions/GitLab CI with latexmk; Docker images with pinned TeX Live; ChkTeX/lacheck linting for style.
    • Assumptions/dependencies: CI runners need TeX toolchains; minor differences across TeX Live versions can affect output.
  • Document conversion consistency
    • Sector: Documentation tooling, knowledge management
    • What: Use a stable preamble to improve Pandoc-based conversions (LaTeX↔Word/HTML/PDF), ensuring predictable headings, spacing, and labels.
    • Tools/products/workflows: Pandoc templates; Makefiles for multi-format exports; Quarto project defaults importing this preamble.
    • Assumptions/dependencies: Not all LaTeX constructs map perfectly to Word/HTML; abstract-as-quotation styling may need custom filters.
  • Lightweight pre-processing hook for advanced workflows
    • Sector: Software tooling
    • What: \let\estinput=\input establishes a hook to intercept or wrap \input later (e.g., conditional inputs for supplements or language variants).
    • Tools/products/workflows: Custom macros that redefine \estinput to inject metadata, counters, or logging.
    • Assumptions/dependencies: Requires careful macro design to avoid breaking package expectations around \input.

Long-Term Applications

  • A robust journal/class package built from this preamble
    • Sector: Publishing, open-source software
    • What: Evolve the snippet into a maintained .cls/.sty with options for spacing, caption styles, abstract formatting, and multi-bibliography support.
    • Tools/products/workflows: Versioned class files; template wizards; documentation site with examples.
    • Assumptions/dependencies: Ongoing maintenance; compatibility with major engines and packages; community adoption and feedback loops.
  • Automated style-compliance enforcement and remediation
    • Sector: Submission platforms, editorial tech
    • What: Server-side tools that parse LaTeX, verify required formatting (spacing, headings, captions), and auto-patch preambles to meet venue policies.
    • Tools/products/workflows: LaTeX AST parsers; latexindent-based transformers; submission bots integrated with journal CMS.
    • Assumptions/dependencies: Reliable LaTeX parsing is non-trivial; policies differ across venues; authors must consent to auto-fixes.
  • Interactive template configurator
    • Sector: Edtech, documentation tooling
    • What: A web UI where users choose spacing, heading names, bibliography structures, and export a tailored preamble/class.
    • Tools/products/workflows: Web app generating .tex/.cls files; presets for common journals; Overleaf one-click import.
    • Assumptions/dependencies: Must track journal guideline changes; user-friendly defaults and validation are needed.
  • Cross-ecosystem style profiles (LaTeX ↔ Word/Google Docs)
    • Sector: Productivity tools, collaboration platforms
    • What: Map these LaTeX settings to Word/Docs styles for teams collaborating across tools, preserving spacing, captions, and headings.
    • Tools/products/workflows: Pandoc filters; Word template (.dotx) generators aligned with the LaTeX class.
    • Assumptions/dependencies: Style parity is imperfect; ongoing maintenance as office suites evolve.
  • Enhanced accessibility and PDF/UA compliance
    • Sector: Accessibility, publishing
    • What: Extend the template with tagged PDF support, improved structure semantics, and screen-reader-friendly captions and abstracts.
    • Tools/products/workflows: tagpdf, axessibility, hyperref configuration; accessibility linting in CI.
    • Assumptions/dependencies: Best results with LuaLaTeX; author training on semantic markup is required.
  • Semantic metadata and indexing pipelines
    • Sector: Scholarly indexing, search
    • What: Leverage standardized headings and captions for better automated metadata extraction (abstract detection, figure indexing).
    • Tools/products/workflows: GROBID-like parsers; venue-specific metadata harvesters.
    • Assumptions/dependencies: Consistency across submissions; cooperation with repositories (arXiv, institutional archives).
  • Policy-aligned grant/report templates
    • Sector: Public policy, research administration
    • What: Agencies/institutions adopt harmonized templates (spacing, references, supplementary handling) to reduce reviewer burden and author reformatting overhead.
    • Tools/products/workflows: Official .cls files; helpdesk support; validator tools for compliance.
    • Assumptions/dependencies: Stakeholder agreement; transition plans for legacy templates.
  • Integration with literate computing and data pipelines
    • Sector: Data science, computational research
    • What: Bundle the preamble into R Markdown/Quarto/knitr notebooks to standardize narrative outputs across projects and teams.
    • Tools/products/workflows: Quarto templates; project starters with pinned TeX and R/Python environments.
    • Assumptions/dependencies: Coordination across language toolchains; stability across template updates.

Glossary

  • abstract: A LaTeX environment for the paper’s summary. "\renewenvironment{abstract}"
  • article: A LaTeX document class for scholarly articles. "\documentclass[12pt]{article}"
  • \documentclass: Declares the document class and its options. "\documentclass[12pt]{article}"
  • \endquotation: Ends the quotation environment. "{\endquotation}"
  • \estinput: A user-defined macro alias for \input. "\let\estinput=\input"
  • \fnum@figure: Internal macro controlling how figure captions are labeled. "\renewcommand{\fnum@figure}{Figure \thefigure}"
  • \fnum@table: Internal macro controlling how table captions are labeled. "\renewcommand{\fnum@table}{Table \thetable}"
  • \frenchspacing: Sets uniform spacing after sentence-ending punctuation. "\frenchspacing"
  • \input: Includes the contents of an external file. "\let\estinput=\input"
  • \linespread: Adjusts the line spacing factor for the document. "\linespread{1.5} % For some reason double spacing is 1.5, not 2.0!"
  • \makeatletter: Allows use of '@' in command names to modify internal macros. "\makeatletter"
  • \makeatother: Restores normal category codes, disallowing '@' in command names. "\makeatother"
  • \newcites: Defines additional bibliography/citation lists. "\newcites{App}{Supplementary Material references}"
  • \quotation: Starts the quotation environment producing indented block quotes. "{\quotation}"
  • \refname: Macro that sets the heading name for the references section. "\renewcommand\refname{References and Notes}"
  • \renewcommand: Redefines an existing LaTeX macro. "\renewcommand{\fnum@figure}{Figure \thefigure}"
  • \renewenvironment: Redefines an existing LaTeX environment. "\renewenvironment{abstract}"
  • \thefigure: Expands to the current figure counter value. "\renewcommand{\fnum@figure}{Figure \thefigure}"
  • \thetable: Expands to the current table counter value. "\renewcommand{\fnum@table}{Table \thetable}"

Open Problems

We found no open problems mentioned in this paper.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 7 tweets with 2 likes about this paper.

HackerNews