Introduction

Artificial Intelligence (AI) has become a disruptive technology with far-reaching economic, social, political and regulatory repercussions. Recent advances in robotics and automation continue to reshape local labor markets and the global employment landscape1,2,3,4,5,6. In healthcare, machine learning algorithms support the global response to pandemic outbreaks7, transforming the medical image analysis field8,9, and making drug discovery faster, cheaper, and more effective10. On the road, autonomous vehicles hold the promise of improving traffic flow, reducing pollution, and preventing traffic accidents that result from human error11,12,13. In decision making, algorithms are poised to address some of the major societal challenges of today, e.g., by reducing gender bias in hiring decisions14,15, dropping crime rates with more informed bail16and enforcement decisions17, and improving the way societies are governed18. AI can even be helpful in the fight against corruption, with several governments and non-governmental organizations (NGOs) implementing AI-based anti-corruption tools that enable citizens to keep their bureaucratic officials in check19. Nevertheless, despite its potential, the perils of AI are too consequential to ignore. Artificially intelligent systems may reduce human error, but they may also exacerbate discrimination against minorities by targeting disadvantaged groups or training on data that reflect systematic and persistent biases20,21,22,23,24,25. Moreover, malicious AI systems may disrupt peace and incite violence by spreading false information26,27,28or increasing the threat of terrorism and autonomous weapons29,30,31.

Notwithstanding its perils, the technologies driven by AI are likely to underpin the security, prosperity, and welfare of the nations that harness them32. Given its potential to shape global competitiveness, the race for world leadership in AI adoption is intensifying globally, with countries developing national AI strategies in an effort to guide and foster its deployment through targeted investments and strategic collaborations33. China’s New Generation AI Development Plan34manifests its commitment towards making AI the driving force behind its industrial and economic transformation by 2025, and making China one of the world’s primary AI innovation centers by 203035. According to the latest report by the U.S.’s National Security Commission on AI, the U.S. could lose its technological predominance—the backbone of its economic and military power—to China, and should mobilize its intellectuals and allies to shift the tides in its favor36. The European Union, on the other hand, is planning to spend billions of euros to build a talent pipeline and fund research as part of its Coordinated Plan on AI37. Although it is not yet clear who the ultimate leader in the AI arena will be, that leader may well become the world’s next superpower38,39.

This study focuses on scientific collaborations between China-based and U.S.-based scientists in the field of AI. The rationale behind this design is twofold. First, the emergence of China as a leading nation in science40,41is changing the global balance of power and challenging the U.S.’s decades-long dominance in scientific production42,43. In 2020, for example, China produced the largest volume of Science and Engineering publications worldwide (23% of global output) followed by the U.S. (16%)44. In general, beyond just AI, the two countries are leading in terms of the amount spent on research and development (R&D), e.g., in 2019 the U.S. was the world’s leader ($656 billion) followed by China ($526 billion), with their expenditures representing 27% and 22% of the global total, together representing nearly half of the world’s R&D that year45. The gap between these two countries and the rest of the world is even more stark when considering AI venture capital funding in 2020, as 82% of the year’s global investment in this sector went to startups in the U.S. ($27.6 billion) and China ($16.9 billion)46. As we will demonstrate here, Chinese and U.S. AI research reside at the forefront of AI research in terms of novelty and impact.

The second reason is the fierce geopolitical tensions between the U.S. and China, which affected scientific collaborations between the two nations. Consider, for example, the policies and investigations launched under the Trump administration, including (i) the China Initiative—a program launched in 2018 by the U.S. Department of Justice to counter Chinese national security threats, with a particular emphasis on intellectual property and technology47, and (ii) the investigation of hundreds of scientists by the National Institutes of Health in 201848, with the majority of investigated cases involving receipt of resources from China49. In a recent study50, Jia et al. found a marked decline in the frequency of U.S.-China collaborations in the life sciences following these investigations. The authors interviewed a number of scientists who had past, ongoing, or planned collaborations with China-based institutions. Those scientists confirmed that, due to the investigations mentioned above, they were reluctant to initiate new or continue existing collaborations with institutions in China. Other recent policies that could potentially limit scientific collaborations between the two nations include the Evaluation of Representative Outcomes (ERO) released by the Chinese government’s Ministry of Science and Technology in 202051, which encourages scholars in China to publish in domestic journals and downplays the importance of international journals52, hindering cross-border collaborations.

These observations motivate the examination of AI papers produced by the U.S. and China, with a particular attention to those resulting from collaboration between the two countries. Here, we analyze a data set of AI papers and scientists to address the following questions: (i) How do the U.S. and China rank globally in terms of AI novelty, productivity, and impact? (ii) Do scientists who move from one country to the other continue to collaborate with coauthors from the origin country? If so, at what rate? (iii) Are researchers from the U.S. and China more impactful when collaborating together?

Results

Role of the U.S. and China in AI research

We start our analysis by exploring the countries that lead global AI research. To this end, we utilize Microsoft Academic Graph (MAG)53, a widely used dataset containing detailed records of over 263 million scientific publications authored by 271 million scientists. Given our focus on AI research, we focus on papers with the AI field classification according to MAG. For each AI paper, we consider the country of the last author’s affiliation as that paper’s country of origin. This is based on the convention that the last author is typically “the head of the lab that hosted most of the research”54. To compare countries in terms of AI research, we focus on four outcome measures: productivity, impact, novelty, and number of scientists. In particular, we measure productivity as the number of AI papers produced by the country. In line with common practices in the scientometrics literature, we adopt the approach of AlShebli et al. (2022)55 to assess impact as the number of citations that AI papers from the country have accumulated within the first two years post-publication. This time frame is widely recognized as an indicator that strongly correlates with the overall citation performance over longer periods.

Next, we examine two types of novelty, which we refer to as context novelty and content novelty. More specifically, context novelty is computed using the z-score measure as proposed by Uzzi et al56. while focusing on AI papers. Intuitively, for any given AI paper, the measure considers all pairs of journals (“contexts”) referenced therein, and for each pair, quantifies the likelihood of them being co-cited in AI papers. If this likelihood in our observed data is more than expected by random chance, it indicates relatively common or “conventional” pairings. On the other hand, if it is less than expected, it indicates relatively atypical or “novel” pairings. As for content novelty, we compute it following Shi and Evans’ approach57. In particular, we model scientific publications as hypergraphs of keywords (“contents”) and estimate the propensity that a pair of keywords would appear in the same paper. The probability that a set of keywords appears in one publication is calculated as the product of two factors: the proximity between a set of keywords and the popularity of those keywords, which are jointly estimated using a hypergraph embedding algorithm with negative sampling.

Finally, to measure the number of AI scientists who reside in any given country, we first identify AI scientists following the approach of AlShebli et al.55. In particular, we use the MAG dataset, and classify each publishing researcher as an AI scientist if they satisfy the following conditions: (i) they authored at least three papers; and (ii) the majority of their papers are classified as AI papers. The number of AI scientists in any given country is taken as the number of those whose affiliation resides in that country. Scientists who happen to have simultaneous affiliations in multiple countries are double-counted, i.e., they contribute towards the number of AI scientists in each of these countries. We analyze these outcome measures between the years 2000 to 2021, resulting in a dataset of 5,399,828 papers and 362,929 scientists.

Figure 1a depicts the total AI productivity of the 20 most productive countries. As can be seen, the U.S. has produced a total of 1,365,452 AI papers (25.23% of global output) while China has produced a total of 957,840 papers (17.70% of global output), demonstrating their global dominance of AI productivity. When looking at the annual productivity of the U.S. and China over time, we find that China caught up with the U.S. by the year 2010, but has fallen slightly behind in years that followed. Figure 1b focuses on the number of AI scientists in each country, showing that China is leading with 105,103 scientists, compared to the 94,363 in the U.S. This is due to China’s substantial growth in AI scientists over the last 5 years, as shown when plotting these numbers over time.

Figure 1c presents the 20 countries with the highest impact in AI research, where impact is quantified based on citations accumulated two years post publication. We find that the U.S. and China lead the pack with a total impact of 7,368,464 and 2,157,122 citations, respectively. Taken together, these citations amount to 46.4% of global impact. Looking at average impact per paper over time, we observe an overall upward trajectory in average impact over time for both countries. To gain deeper insight, we investigate the countries with the largest share of “hits” based on impact, where a hit is taken as a paper that falls among the 1% of most impactful papers published in that year58. That is, for each country, we compute the percentage of its papers that are hits out of all AI hits produced globally, rather than from all AI papers produced by that country. Figure 1d illustrates that the U.S. has the largest share of AI hits (43.9% of global hits), followed by China (10.9%). Looking at each country’s share per annum, we find that China’s share of hits has increased over time, while the U.S. has witnessed a slight decline in its share.

Finally, Figure 1e and Supplementary Figure 1a show that the U.S. and China are also at the forefront in terms of context novelty and content novelty, respectively. When examining context novelty over time (Figure 1e), we find that the average novelty of China-based papers has been comparable to, if not greater than, that of U.S.-based papers over the past two decades. In terms of content novelty (Supplementary Figure 1a), we find that it is slightly increasing over time for both countries, but the gap between the U.S. and China persists over time. Figure 1f and Supplementary Figure 1b also focus on context novelty and content novelty, respectively, but they depict each country’s share of novelty hits (i.e., share of papers that fall among the 1% most novel papers). As can be seen, out of all countries analyzed, the U.S. and China have the largest shares. When examining the shares of these two countries over time, we find that China has caught up with the U.S. in recent years in terms of context novelty, but remains behind in terms of content novelty.

To determine the degree to which the findings of context novelty are driven by the introduction of relatively new publication venues, we repeated the context novelty analysis while excluding any venues that appear for the first time in the MAG dataset after 2015. Note that the novelty score of any paper published after 2015 may change as a result of this exclusion. We compared the scores of these papers before and after the exclusion of such references; see Supplementary Figure 2. The broad trends remain largely unchanged, suggesting that they cannot be explained by the introduction of newer venues.

Cross-border AI scientists’ movement and collaboration

Our exploratory analysis has shown that the U.S. and China are leading in AI research. Next, we focus on AI scientists moving from the U.S. to China and vice versa. That is, we analyze individuals who were research active in one of the two countries, before moving to the other. Note that those who were in China, for example, and moved to the U.S. before publishing any papers are not considered in our analysis, because they obtained the “scientist” status after, not before, the move took place. We rely on affiliations to identify scientists who have moved from one country to another. More specifically, individuals who published AI papers with an affiliation in Country A, and then started publishing AI papers with an affiliation in Country B, are considered to have moved from A to B. Scientists who simultaneously held affiliations in multiple countries are excluded from this analysis. Note that our analysis does not study movements from Chinese institutions to American ones, but rather from China to the U.S., as some institutions in China may not necessarily be Chinese.

Results from this analysis are summarized in Figure 2. In particular, Figure 2a examines the distribution of countries from which AI scientists moved to the U.S. during the past 20 years, showing that most of them come from China. Moreover, the number of AI scientists moving from China to the U.S. has increased rapidly over the past decade. Similarly, as shown in Figure 2b, the country from which AI scientists move the most to China is the U.S., and the number of such movements has increased steadily over the past two decades. Figures 2c to 2e show that China is attracting AI scientists who are more experienced, impactful, and productive than those attracted to the U.S. These figures also show that China-based scientists who move from the U.S. have greater experience, impact, and productivity than those who do not. The same holds for U.S.-based scientists who move from China compared to those that do not.

To better understand the nature of such movements, we infer the race of the scientists moving from China to the U.S. and vice versa, using a name-ethnicity classifier. More specifically, we use NamePrism, which classifies scientists into six different racial groups: Asian / Pacific Islander (API), American Indian / Alaskan Native (AIAN), Black, Hispanic, Two or more races (2PRACE), and White59. Note that NamePrism is widely used in the social sciences to infer the race or ethnicity of given names60,61,62,63,64,65. We find that 98.8% of scientists moving from China to the U.S., and 96.0% of those moving from the U.S. to China have Asian ancestry. This suggests that the predominant flow of movement between these two nations is among individuals of Asian descent.

Next, we examine the relationship between the university ranking and the rate at which scientists move to, or from, that university. We find that 27% of AI scientists moving from the U.S. to China come from a top-100 institution. Similarly, 29% of those moving in the opposite direction come from a top-100 institution. As such, the two nations seem comparable in the rate at which they attract AI scientists from the other nation’s top institutions. However, the nations differ considerably in the rate at which their top institutions recruit talent from the other nation. In particular, 37% of those who move from China to the U.S. are recruited by a top-100 institution, whereas the rate is only 20% of those moving in the opposite direction.

Finally, we compare China-based AI scientists who move from the U.S. to those who do not, in terms of the rate at which they collaborate with U.S.-based coauthors after the movement took place. Likewise, we compare U.S.-based scientists who move from China to those who do not, in terms of the rate at which they collaborate with China-based coauthors. The comparison is carried out using Coarsened Exact Matching while controlling for (i) career age, (ii) productivity in the movement year, and (iii) citations accumulated by the movement year. Each matching experiment compares the treatment group (scientists in country A who moved from country B) to the control group (scientists in A who did not move from B) in terms of the rate at which they collaborate with coauthors from B in the years following the movement. Naturally, one would expect the rate to be higher for the treatment group. Nevertheless, this analysis allows us to quantify the difference in the rate between the two groups.

The two matching experiments and their outcomes are illustrated in Figure 2f; see Supplementary Table 1 for numeric values. As shown in this figure, China-based AI scientists who moved from the U.S. are nearly 20 times more likely to collaborate with U.S.-based coauthors, compared to their counterparts who did not move from the U.S. (54.45% vs. 2.78%). Similarly, the likelihood of U.S.-based AI scientists who moved from China to collaborate with China-based coauthors is nearly 30 times greater, compared to their counterparts who did not move from China (42.21% vs. 1.46%). Furthermore, we examined the relationship between the academic age of the AI scientist and the rate at which they collaborate with the country of origin (i.e., the country that they have moved from). We found a weak correlation (\(r = 0.22\); \(p < 0.001\)) between academic age and collaboration rate, but only when the move is from China to the U.S. (as for the other direction, no significant correlation was found).

Naturally, one would expect the “effect” of the relocation to be strongest in the time immediately after the relocation, and fade thereafter as local collaborations put out international ones with costly communication (e.g., time zone differences between the U.S. and China). To determine whether this is the case, we examined the rate at which scientists collaborate with the country of origin over time, focusing on the 10 years that followed the move. Indeed, as shown in Supplementary Figure 3, the effect fades out for China-based scientists, dropping from 57% (immediately after they relocate to China) to about 36% a decade after the move. Interestingly, however, the opposite is true for U.S-based scientists; the rate at which they collaborate with China-based coauthors actually increases (from 45% to 51%) in the decade that follows their move to the U.S.

Collaborative AI research between the U.S. and China

Our final analysis focuses on papers produced when the two countries collaborate and compares them to those produced when each country works without the other. International collaborations have been shown to provide rich opportunities to enhance research66, as they free scientists from local constraints67and offer diverse views and experiences, thereby providing a competitive advantage68. However, the rivalry between the U.S. and China, especially in the field of AI, raises several questions about the international collaborations between the two nations in this field. For instance, is the number of such collaborations decreasing in recent years due to the aforementioned rivalry? Are such collaborations taking place mainly among higher-ranked universities? And if there is indeed a citation premium for such collaborations, is it greater in the field of AI compared to other fields? and can it be explained by “home citations”?

To explore these questions, for any given paper, if the last author is affiliated with an institution from the U.S., and at least one coauthor is affiliated with an institution from China, we consider this to be a U.S.-based paper produced in collaboration with China. Similarly, if the last author has a China-based affiliation, and at least one coauthor has a U.S.-based affiliation, we consider this to be a China-based paper produced in collaboration with the U.S. This approach is similar to the one used by AlShebli et al.55, except that they focus on cities, while our focus is on countries. Given our reliance on the order of authorship to infer lead author, it is not possible to detect cases where there are multiple leads. Moreover, since we focus on papers either led by China or led by the U.S., any paper involving a China-based coauthor and a U.S.-based coauthor is only considered a collaboration between the two countries if one of the two coauthors is a last author.

To be more precise, we compare U.S.-based papers produced in collaboration with China, denoted by \(( US , China )\), to those produced without such collaboration, denoted by \(( US , \lnot China )\). We also compare China-based papers produced in collaboration with the U.S., \(( China , US )\), to those produced without such collaboration, \(( China , \lnot US )\). More specifically, Figure 3a depicts the number of such papers over time on a log scale. As can be seen, collaborations between the U.S. and China in the field of AI were relatively rare before 2010 (fewer than 1000 papers per year), and started increasing in the second decade of the millennium. Despite this increase, collaborations between the two countries continue to represent only a small fraction of their overall AI productivity.

Figure 3b compares the number of authors on these papers. As shown in this figure, teams that involve collaborations between the two countries are, on average, larger than those that do not. Figure 3c compares these papers in terms of the percentage of last-author affiliations that fall among the top 100 most impactful institutions in the field of AI. As can be seen, last authors of China-based papers are more likely to be affiliated with a top-100 institution when collaborating with the U.S.

Figures 3a to 3c imply that, when comparing the impact of papers that involve a collaboration between the U.S. and China to those that do not, one needs to control for publication year, team size, and last-author affiliation. To this end, we use Coarsened Exact Matching (CEM)69. The matching process is illustrated in the left-hand side of Figure 3d, emphasizing that the impact of papers in \(( US , China )\) is compared to the impact of those in \(( US , \lnot China )\) that have the same publication year, team size, and last-author affiliation. Likewise, the impact of papers in \(( China , US )\) is compared to the impact of those in \(( China , \lnot US )\) while controlling for the above confounders. The right-hand side of Figure 3d depicts the relative difference in impact between the papers and their matched ones over time; see Supplementary Tables 2 and 3 for the numeric values. As can be seen, for the entirety of the period considered in our analysis, China-based papers that involve U.S.-based collaborators have been more impactful than those that do not. As for U.S.-based papers, the first decade in our analysis shows no difference in impact when involving China-based scientists. However, the last five years have witnessed a significant increase in impact associated with collaborators from China. Similarly, when examining the rate at which the two countries produce impact-based AI hits, we find that each of them achieves a notably higher rate when collaborating with the other; see Supplementary Figure 4.

As a robustness check, we perform a within-subject analysis by controlling for the last author. That is, we compare papers that involve U.S.-China collaborations to those produced by one country without the other, but the comparison is now performed among papers that have the same last-author. This naturally yields a smaller dataset compared to our original one, since there are many scientists who do not meet this requirement. To account for the smaller dataset, we relaxed our matching criteria by allowing for up to two years difference in publication date, and binning the sizes of teams that involve five or more members. The results of this analysis are depicted in Supplementary Figure 5. As can be seen, our main findings persist, i.e., both countries yield more impactful AI research when collaborating with each other, especially in recent years.

Our next robustness check involves expanding the definition of “AI papers” by including those classified as Computer Vision, Pattern Recognition, Machine Learning, or Natural Language Processing (NLP). Due to the relatively small number of NLP papers in the MAG dataset that include U.S.-China collaborations, we opted to merge NLP with Machine Learning. For each of the resulting subfields, we replicated Figure 3d, examining the percentage increase in impact when the U.S. and China collaborate together. Additionally, we replicated the figure, but this time combining all subfields together with all the papers from our original dataset. The results of these analyses confirm our main finding, i.e., the two nations are more impactful when collaborating together; see Supplementary Figure 6.

We performed yet another robustness check, where we explore an alternative way of identifying collaborations between the two countries. Instead of the lead author being in one country and a coauthor being in the other (regardless of the coauthor’s position on the paper), we now require the coauthor to be the first author on the paper. This yields a stricter criterion than the one used earlier, as it effectively excludes all collaborations where the coauthor is not the first author. This suggests that a scientist from one country has “done the most work”, and a scientist from the other country has “provided the most guidance”. The result of this analysis is presented in Supplementary Figure 7. As can be seen, the same broad trends are observed, i.e., China and the U.S. tend to produce more impactful AI research when collaborating together.

Our citation-based analysis focused on two years post publication in order to include recent cohorts of papers. Including such papers is particularly relevant in our context, because AI is a fast-moving field, and the number of AI papers generated in both the U.S. and China have rapidly increased over time. Thus, by limiting the number of years analyzed to just two, we can include the most recent developments in the field. There have been debate regarding the use of short-term impact as a predictor of long-term impact, however; see, e.g70.,. This controversy considered only papers published in Physical Review; the extent to which this holds in the field of AI remains unknown. Against this background, we computed the correlation between the number of citations received within two years, and those received within five years; we found a strong correlation between the two (Pearson’s r = 0.85). Additionally, as a robustness check, we repeated the same analysis of Figure 3d, while focusing on five years instead of two; see Supplementary Figure 8. Again, the same broad trends are observed, i.e., papers from China that involve U.S.-based collaborators receive more citations than those that do not. Similarly, in recent years, papers from the U.S. that involve China-based collaborators receive more citations than those that do not.

To determine whether the observed trend (i.e., the citation premium of U.S.-China collaborations in recent years) is specific to AI, or is a reflection of a general U.S.-China collaboration, we performed the same Coarsened Exact Matching analysis, but for seven additional fields of Computer Science, namely Algorithms, Computer Networks, Computer Security, Databases, Operating Systems, Programming Languages, and World Wide Web. The results of this analysis are depicted in Supplementary Figure 9. As can be seen in this figure, the citation premium associated with U.S.-China collaborations is greater in AI than any of the other Computer Science fields considered in our analysis. This finding suggests that such collaborations matter more in AI than in other fields.

One possible explanation behind these trends could be the prevalence of “home citations”, i.e., the citations received from papers produced by the same country. If scholars consider how citations are used to rank scholars and institutions, and if their access to government resources depends on raising their international profiles, then they may have an incentive to “game the system” by deliberately adopting practices that raise their measured impact. In particular, scholars may follow a conscious and deliberate practice of citing one another in an effort to improve their measured academic output and that of their home country institutions. To explore this possibility, we repeat the same analysis while focusing solely on the citations made by non-U.S. papers to U.S.-based work, as well as those made by non-China papers to China-based work. In this context, we use two alternative ways of identifying the non-U.S. and non-China papers. The first focuses on the last author (e.g., the paper is non-U.S. if the last author does not have a U.S.-based affiliation), while the second considers all authors (e.g., the paper is non-U.S. if none of the affiliations therein are U.S.-based). The results of this analysis are depicted in Supplementary Figure 10. As shown in this figure, the same broad trends are observed, suggesting that they cannot be entirely explained by home citations.

Up to this point in our analysis of U.S.-China collaborations, we focused on citation count. Next, we focus on an alternative measure—the publication rate in top AI conferences. To this end, we use CORE, a website providing conference rankings that are widely used by computer scientists71. According to CORE, “conference rankings are determined by a mix of indicators, including citation rates, paper submission and acceptance rates, and the visibility and research track record of the key people hosting the conference and managing its technical program”. The highest rank is “A*”, and only 7% of computer science conferences receive this highly prestigious ranking71. Importantly, these conferences are double-blind, meaning that reviewers cannot see the authors’ names or affiliations, unlike journal submissions. Since CORE does not specify which of the A* conferences is AI-related, we identified the 10 most highly ranked AI venues according to Google Metrics, and found five that are A* conferences (the other five venues were journals); these conferences are: AAAI, ICLR, ICML, IJCAI, and NeurIPS. When analyzing the papers published in these conferences, we focus on the same four types of papers used earlier, i.e., \(( US , China )\), \(( US , \lnot China )\), \(( China , US )\), and \(( China , \lnot US )\). For each type, we compute the percentage of papers produced by that type that get published in each of the top five conferences; see Figure 4a. As shown in this figure, \(( US , China )\) and \(( China , US )\) publish in the top five A* conferences at a relatively higher rate than \(( US , \lnot China )\) and \(( China , \lnot US )\), respectively. When considering these rates over time, we find that the publication premium of U.S.-China collaborations has become particularly pronounced in recent years (Figure 4b). Finally, when considering the conferences in isolation, we find that the publication premium is greater in each of the five Computer Science conferences that are considered in our analysis (Figure 4c). These results further support our finding that China and the U.S. produce more impactful AI research when collaborating together.

Discussion

Our study contributes to a growing body of work that focuses on AI bibliometrics. For example, Frank et al72. examined research that interacts with AI using the MAG dataset to identify disciplines frequently citing or cited by AI papers. Similarly, Tran et al73. examined AI dynamics in medicine, analyzing diseases most frequently studied most and least in AI using the Web of Science. Martínez-Plumed et al74. identified research communities most likely to advance the state of the art in AI using Papers With Code—a repository of AI benchmarks along with their associated papers. In related research, Tang et al75. gained insight into how the pace of AI innovations has changed over the past years by analyzing preprints on arXiv.org. Klinger et al76. identified the AI subfields most involved in the development of Explainable AI—an area of research that focuses on making AI-based decisions more understandable to humans using bibliometric data from Scopus. Finally, Stathoulopoulos and Mateos-Garcia probed gender differences in the authorship of AI papers77. Despite the number and variety of these analyses, U.S.-China collaborations in the field of AI have not been closely examined to date. Our study fills this gap in the literature, by examining the migration of AI scientists from one country to the other, and by analyzing the impact of AI papers resulting from collaborations between the two countries.

Our study also contributes to another line of research that analyzes the mobility of scientists. In particular, some scholars have modeled this phenomenon78,79,80and identified factors that influence scientists’ migration decisions81,82,83,84, while others have focused on the impact of migration on scientists’ careers85,86,87,88or countries’ research activities89,90,91,92. Some of this work has focused on the mobility of scientists from specific disciplines, e.g., physics78,88,93, economics87, genomics94, chemical engineering95, life sciences84, stem cell research96, biology, chemistry, material science, and environmental science86. The mobility of AI scientists has only been considered very recently by AlShebli et al55., who constructed networks of inter-city AI citations, collaborations, and migrations. Using these networks, the authors quantified the role that different cities play in bridging East and West, and showed that Beijing’s role surpasses that of all other cities combined, becoming a global hub for AI researchers. Moreover, the authors tracked the AI center of mass—the average geographic location of AI research activities—over the past three decades, and found that it has been drifting steadily towards Asia, with Beijing’s gravitational pull increasing each year, suggesting an eastward shift in the tides of AI research. Our study complements this literature, showing that the majority of AI scientists moving to the U.S. come from China, and vice versa. Moreover, those who move continue to collaborate with scientists in the origin country. Our findings imply that the movement of AI scientists between the U.S. and China fosters fruitful collaborations that can be advantageous to both countries.

Studies examining international collaborations in general (rather than in the field of AI) suggest that the impact of international collaborations varies across countries97. Such collaborations are associated with greater impact for UK-based scientists98as well as Norway-based scientists99. Conversely, most highly cited papers from the U.S. are produced by domestic, rather than international, collaborations100. Similarly, Kwiek101 analyzed papers from six fields of research—engineering and technologies, agricultural sciences, humanities, natural sciences, medical sciences and social sciences—and found that, for both the U.S. and China, domestic collaborations (i.e., those involving U.S.-based only, or China-base only, coauthors) have a greater impact than their international counterparts. In contrast, our study shows that when it comes to AI research, papers involving collaborations between the U.S. and China (whether U.S.- or China-led) are more impactful than those led by one of the two countries and not involving the other.

We focused on the U.S. and China in our analyses given the geopolitical tension between the two, and that they are the most productive countries in terms of AI research. Nevertheless, the synergy we found between the U.S. and China is not unique; similar synergies also exist when these nations collaborate with other countries. We examined this possibility by examining their collaborations with the most productive countries in AI research. Results from this analysis are depicted in Supplementary Figure 11. This reveals that the U.S. produces more impactful AI research when collaborating with Australia, the UK, and Canada. Similarly, China produces more impactful AI research when collaborating with India, South Korea, Australia, the UK, Canada, and France. The most impactful collaborations with the U.S. are with China, but the converse is different. In particular, China’s collaborations with the U.S. are less impactful than those with India, South Korea, Australia, the UK, and Canada. This exploratory analysis raises the possibility that the U.S. may have more to lose than China from policies discouraging collaborations between the two nations, as China would continue to have more powerful collaborators, whereas the U.S. would lose its most powerful one. Further research, however, is needed to examine this possibility in greater depth.

Our study is not without limitations. First, although we rely on a widely used dataset (namely Microsoft Academic graph), this dataset is historically biased toward English-speaking cultures, and lacks papers written in Mandarin or Cantonese, thereby underrepresenting the research output of China. Second, since our dataset ends in 2021, it does not capture the latest developments in the field of AI such as, e.g., the recent rise of large language models (LLMs), which may have dramatically shifted the fields of AI and NLP. As such, it remains to be seen whether the U.S.-China race to innovate in the space of LLMs exhibits similar trends to the ones reported in our study. Third, when quantifying impact, we only consider citations that the paper has accumulated in the two years (Figure 3d, as per55) or five years (Supplementary Figure 8, as per60) post publication. As such, our analysis does not highlight “sleeping beauties”102that start accumulating papers at a later stage (e.g., a decade after publication, as per103). Fourth, when comparing the U.S. and China in terms of the number of AI scientists, the comparison is done in absolute rather than relative terms, i.e., we do not account for the substantial difference in population size between the two nations. Fifth, our approach of inferring scientists’ Asian ancestry cannot capture the qualitative nuance of scientists migrating from the U.S. to China or vise versa, e.g., those who do not wish to identify with their prior ancestry..

Future extensions could look beyond English-centric data sources, by examining bibliometric datasets that include papers written in Mandarin or Cantonese104,105, thereby providing a more comprehensive view of U.S.-China research collaboration. Additionally, future research could examine the influence of broader social dynamics and terminological variations on migration trends and research outputs, focusing on the history of AI winters and how it influenced both the migration of AI researchers as well as the rebranding of AI terminology to fit different funding schemes106. Future work may also enrich our analysis by contextualizing the long-standing history of AI development and collaboration between China and the U.S., including the military and defense implications of AI, which underscore the dual-use nature of such technologies, complicating the narrative of academic collaboration107.

Analysis of AI’s potential impact on U.S.-China relations is heavily influenced by technonationalism, which emphasizes interstate competition over technological assets108. For instance, the National Security Commission on AI warns that the U.S. “must win the AI competition that is intensifying strategic competition with China” and that “China’s plans, resources, and progress should concern all Americans36. Such portrayal of China as a threat to the U.S. in the AI arena may hinder scientific collaborations between the two nations. Indeed, these collaborations can be viewed from a national perspective, where the emphasis is on increasing economic competitiveness, ensuring national security, and signaling national prestige. After all, the line between basic and commercial research is thinning, and this is challenging conventional norms about international collaboration and the nature of scientific ownership109. Nevertheless, despite potential spillovers that might work against national interests, such collaborations can also be viewed from a perspective that transcends political agendas—a global perspective where the emphasis is on the advancement of knowledge for the benefit of all.

Fig. 1
figure 1

The U.S. and China leading in impact, novelty, productivity, and number of AI scientists. (a), Main plot: The 20 countries producing the most AI papers. Inset: The number of AI papers produced by the leading two countries (U.S. and China) over time. (b), Main plot: The 20 countries with the largest number of AI scientists. Inset: The number of AI scientists in the leading two countries (U.S. and China) over time. c, Main plot: The 20 countries that garner the highest impact in AI research. Inset: The average impact of AI papers in the leading two countries (U.S. and China) over time. (d), Main plot: The 20 countries with the greatest share of hits based on impact. Inset: The share of hit AI papers based on impact in the leading two countries (U.S. and China) over time. (e) and (f), The same as (c) and (d), respectively, but for context novelty instead of impact.

Fig. 2
figure 2

The interplay between AI scientists’ migration and cross-border collaboration. (a), Main plot: The 20 countries from which the largest number of AI scientists migrate to the U.S. Inset: For the top five countries, the number of AI scientists that migrate to the U.S. over time. (b), The same as (a) but for China instead of the U.S. (c-e), For AI scientists migrating from the U.S. to China, and vice versa, the distributions of career age, impact, and productivity. (f), Out of all AI scientists that are based in country A, comparing those who migrated from country B to those who did not, in terms of the percentage of their papers that involve coauthors from B; the comparison is done using Coarsened Exact Matching, and only considers the years after the migration, while controlling for career age, impact, and productivity. Error bars represent bootstrapped 95% confidence intervals; P values are from t-tests; * \(p<.05\); *** \(p<.001\).

Fig. 3
figure 3

The impact of U.S.-China AI collaborations. Analyzing four types of AI papers: (i) U.S.-based papers produced in collaboration with China, \(( US , China )\); (ii) U.S.-based papers produced without China, \(( US , \lnot China )\); (iii) China-based papers produced in collaboration with the U.S., \(( China , US )\); (iv) China-based papers produced without the U.S., \(( China , \lnot US )\). (a), Annual number of papers per type (log scale). (b), Distribution of team size per type. (c), For each type, the percentage of papers of which the last author’s affiliation is among the 100 most impactful institutions in AI. (d), The infographic illustrates the Coarsened Exact Matching (CEM) experiment, while the plot depicts the percentage increase in impact of \(( US , China )\) compared to \(( US , \lnot China )\), as well as the percentage increase in impact of \(( China , US )\) compared to \(( China , \lnot US )\), over time. Error bars represent bootstrapped 95% confidence intervals; P values are calculated using t-tests (a, b, and d) and two-sided Fisher’s exact test (c); * \(p<.05\); ** \(p<.01\); *** \(p<.001\).

Fig. 4
figure 4

Publication venue analysis. Comparing four types of AI papers: (i) U.S.-based papers produced in collaboration with China, \(( US , China )\); (ii) U.S.-based papers produced without China, \(( US , \lnot China )\); (iii) China-based papers produced in collaboration with the U.S., \(( China , US )\); (iv) China-based papers produced without the U.S., \(( China , \lnot US )\). (a), For each type, the percentage of papers published in the top five AI conferences. (b), Similar to (a) but over time. (c), Similar to (a) but disaggregated across the top five AI conferences.