Small and large teams are fundamentally different. Small teams ask questions and disrupt existing theories. Large teams answer questions and stabilize established paradigms.
ONE of the most universal trends in science and technology today is the growth of large teams in all areas, as solitary researchers and small teams diminish in prevalence. Increases in team size have been attributed to the specialization of scientific activities, improvements in communication technology, or the complexity of modern problems that require interdisciplinary solution. This shift in team sizes raises the question of whether and how the character of the science and technology produced by large differs from that of small teams. Here we analyse more than 65 million papers, patents and software products that span the period 1954–2014, and demonstrate that across this period smaller teams have tended to disrupt science and technology with new ideas and opportunities, whereas larger teams have tended to develop existing ones. Work from larger teams builds on more-recent and popular developments, and attention to their work comes immediately. By contrast, contributions by smaller teams search more deeply into the past, are viewed as disruptive to science and technology and succeed further into the future—if at all. Observed differences between small and large teams are magnified for higher-impact work, with small teams known for disruptive and large teams for developing work. Differences in topic and research design account for a small part of the relationship between team size and disruption; most of the effect occurs at the level of the individual, as people move between smaller and larger teams. These results demonstrate that both small and large teams are essential to a flourishing ecology of science and technology and suggest that to achieve this science policies should aim to support a diversity of team sizes.
The Web of Science (WOS) database contains more than 44 million articles published between and 611 million citations among them.
The US Patent and Trademark Office (USPTO) database contains 5 million patents and 65 million citations added by patent applicants.
The GitHub database contains 16 million software projects and 9 million forks to them added by software developers.
We open data to serve the Science of Science. Please feel free to use:
D-score of 19,495,814 MAG papers (1830-2021) published on Harvard Dataverse
And cite:
Lingfei Wu; Dashun Wang; James Evans, 2021, "Replication Data for: Large teams develop and small teams disrupt science and technology", https://doi.org/10.7910/DVN/JPWNNK, Harvard Dataverse, V1
Wu, L., Wang, D., & Evans, J. A. (2019). Large teams develop and small teams disrupt science and technology. Nature, 566(7744), 378-382.
See the figure below for replication results.
Featured in Nature Behavioural and Social Sciences Community
The Formation of Our Small Team
Spring is ideal for relaxation in Santa Fe, New Mexico. But in 2015, uncertainty in my academic life distracted me from enjoying the landscape. Visiting my friend, a post-doc researcher at Santa Fe Institute (SFI), I was hoping to find some big ideas worth pursuing and passionate colleagues to pursue them with. One afternoon, James Evans, a sociologist from the University of Chicago, was giving a talk on the collective production of knowledge. I planned to stay for ten minutes. Three hours later, we were in the same room, talking about Immanuel Kant's theory of categories, the Church–Turing thesis, and how natural and artificial languages shape how society thinks and what society knows. The sun went down and I worried that it might be our last conversation. I ventured whether there was an opportunity to work together. James thought for three seconds, and replied: “Do you know of Dashun Wang?” He mentioned that they were applying for a grant and could use a collaborator. Dashun interviewed me online and introduced a research project they were conceiving.
Disruption: Idea Eclipse in the Expanding Universe of Knowledge
James and Dashun are interested in shooting stars. When we met at Dashun’s hotel room at Phoenix, Arizona to prepare for a satellite session at the Conference on Complex Systems (CCS’15), “Quantifying Science”, James mentioned the idea "small team jump" and Dashun immediately echoed with an example, which was interdisciplinary, innovative research by his colleagues on the controllability of complex networks (Liu et al., 2011). We all agreed that small teams are typically creative and able to follow fast breaks in the space of ideas, especially when their members come from radically different backgrounds. I think that was a definitive moment at which the main idea of our paper was anchored.
The journey accelerated after I moved to Chicago and joined Knowledge Lab as a postdoc. James pointed our attention to a new measure proposed by Funk and Owen-Smith (2016), which operationalized the concept of disruption in Christensen’s influential book The Innovator's Dilemma (Christensen, 1997). As James used the words “eclipse” and “overshadow” a lot in our discussions, these words start to haunt me. At first, they were too soft to hold on to. The eureka moment came in a quiet afternoon, when I was sitting on the shore of Michigan lake, staring at the ripples on lake surface. All of sudden, I realized that the word “eclipse” used by James went beyond metaphor – it literally quantifies how a new idea’s shadow may block pre-existing ideas, as hand drawn in the shown picture (see the figure in the last session).
If we imagine articles as stars shinning in the expanding universe of human knowledge (Larsen and von Ins, 2010), an article disrupts by being more visible than its references to an observer. The fact that the focal paper overshadows the past and redirects all future attention to itself is evidence that the focal paper is perceived and received as a novel contribution by its field. More interestingly, if small teams are more disruptive, as we confirmed very quickly in the early stage of our project, this metaphor of the expanding universe of knowledge immediately predicts that small-team stars (articles) must targeted older and less visible stars (references), a pattern we had never seen discussed before. Over the ensuing months, we observed this pattern universally across three datasets from very different domains: research articles, patents, software. Despite my best efforts, I could not break the finding.
It Takes a Small team to Think Deep and A Long Time to Grow Young
As we summarize in our paper, the main driver of difference in disruption between large and small teams appears to be how each treat the history of their field. Small teams search deeply into the past and try to remember forgotten ideas, whereas large teams herd for hot spots and forget less popular ideas. As we shown in the paper, those remembering will be remembered (i.e., citations to small teams last longer, if at all) and those forgetting will be forgotten (citations to large teams occur almost immediately). These propensities are likely associated with different risk preferences—small teams have more to gain and less to lose from taking risks. Large teams may also interfere with individual member insights.
Skyping each week, James, Dashun and I would attack each other’s worst ideas and echo our best ones. As we discussed, unpacking findings and responding to reviewers, we built confidence in our original ideas, but pivoted to new insights that went well far beyond this article. As we engaged over the years, it highlighted to me the power of small teams to move quickly and sustain rapid cycles of searching, formalizing, and testing ideas. As we continue to move forward along this journey, I personally feel that innovation is not about scaling up team size, but rather persistence, loneliness, and a sustained but flexible focus that is nimble enough to follow a fast break.
Disruption as idea eclipse in the expanding universe of knowledge
The visibility (popularity) of articles (stars) decays exponentially over time, as the size of knowledge universe increases exponentially. An average observer (or an average future citation, represented by the opened eye) determines the disruption of a focal paper by comparing its visibility against the visibility of its references, rescaled by considering the temporal decay of visibility. See Extended Data Figure 5-6 from our paper.
Four main versions of text: the first submission to Science (2017-08), the second submission to Science, the first submission to Nature (2018-04), and the final version at Nature (2019-02 Feb).
Citation tree visualization that illustrates the visual influence of focal papers, drawing on past work and passing ideas onto future work. "Roots" are references and citations to them, with depth scaled to their publication date; "branches" on the tree are citing articles, with height scaled to publication date and length scaled to the number of future citations. Branches curve downward (brown) if citing articles also cite the focal paper’s references, and upward (green) if they ignore them. Disruption (D), citations (n), published year (Y) and team size (m) of papers are shown in the bottom left corner of each tree.
Made with Mobirise css web themes