There are far more scientists in today’s world, and they’re publishing research papers at a much faster pace. However, all of this growth hasn’t translated to more scientific progress. As University of Chicago Professor James Evans argues, scientists are overloaded by the flood of research papers they have to read, which is causing them to cite the same few papers over and over again. This dilemma is leaving newly published papers with less of a chance to disrupt existing work.
Evans directs the Knowledge Lab at UChicago, which leverages machine learning and is trying to reimagine the scientific process by providing better pathways for new scientific ideas to be shared with others. He explains how we can get back to advancing science again.
(Episode published March 31, 2022)
- New ideas are struggling to emerge from the sea of science—Axios
- Data science predicts which failures will ultimately lead to success—UChicago News
- Politically polarized teams produce better work, analysis of Wikipedia finds—UChicago News
- Bigger teams aren’t always better in science and tech—UChicago News
- Computer programming languages can impact science and thought—UChicago News
Paul Rand: How can nonprofits drive social change in Chicago and beyond? Join business and nonprofit leaders on April 29th for the On Board conference from Chicago Booth’s Rustandy Center for Social Sector Innovation. Gain insights from Obama Foundation CEO Valerie Jarrett on strengthening community engagement and building partnerships. Connect with experts in the social sector and discover how Chicago organizations are creating cultures of innovation and tackling pressing issues. Register at chicagobooth.edu/onboard.
Paul Rand: If you like Big Brains, there’s another science podcast you should check out: TNAS Science Sessions, from the Proceedings of the National Academy of Sciences, features short, in depth conversations with the world’s top scientific researchers. In less time than it takes to drink a cup of coffee, you can learn about advances such as aquaculture techniques for coral reef restoration. Listen and subscribe to Science Sessions to hear more about their coral reef series on iTunes, Spotify, Google Play, Stitcher, or wherever you get your podcasts.
Paul Rand: Climate change, viral pandemics, global poverty, the solutions to our greatest challenges call for scientific innovation.
James Evans: The stakes are high. The vast majority of productivity improvements come from improvements in science and technology.
Paul Rand: But when it comes to progress, science has a problem with a confusing paradox at its center. Despite a dramatic increase in scientists and scientific papers, new ideas, innovative discoveries, and disruptive progress are actually slowing down.
James Evans: We look at all fields, and in all these fields, as the number of papers goes up, as the growth of the fields goes up, the number of new ideas that enter the canon exponentially declines.
Paul Rand: That’s James Evans, a professor of computational social science and the director of The Knowledge Lab at the University of Chicago, where they use massive data sets, I’m talking millions and millions of scientific papers, findings, patents, discoveries, combined with machine learning techniques to study the science of how science does science.
James Evans: We believe that things will get better if I invest enough money with the expectation that there will be some magical discovery that takes place in science, technology, or entrepreneurship that it’s going to allow us to do better. This is one of the reasons why I am deeply interested in kind of understanding this in a very nuts and bolts way, so that we can think about how to engineer this at a societal level.
Paul Rand: Think of science, the system of science, as a brain. Each field of science is in a different region, and each scientist, a different cell producing and communicating across its surface. Evans studies how this brain as a whole thinks, and how to make it think differently. His lab has shown that this system has gotten so efficient that it’s losing its ability to innovate. In fact, it’s stalled to such a degree they can actually model what this brain will think before it does.
James Evans: One of the things that interesting is in these models, we can predict with some reasonable accuracy, not only what’s going to be discovered, but who’s going to discover it.
Paul Rand: At The Knowledge Lab, they’ve mapped out our system of science over and over again, almost like taking MRIs on its brain. And Evans says they’ve discovered ways to reorganize it such that it will generate a greater number of new ideas. And with our global crises worsening by the day, the stakes of scientific progress have never been higher.
James Evans: I think that we have a handle on some things that measurably look very promising for science, but they’re hard things. None of them are cheap things.
Paul Rand: From the University of Chicago Podcast Network, this is Big Brains, a podcast about the pioneering research and the pivotal breakthroughs that are reshaping our world. On this episode, the science of science. I’m your host, Paul Rand. Being in charge of something called The Knowledge Lab sounds like a pretty amazing responsibility to be in charge of knowledge. How did you find yourself in such a position?
James Evans: Well, we were studying how knowledge was made, where ideas came from. We were exploring how one could make ideas better, which is to say to serve our purposes more. And we’re at a new knowledge intensive and data intensive age, where we can look at knowledge in completely different ways than we could before, which require entirely new embedding and other kinds of encoding possibilities, both generatively and analytically.
Paul Rand: Imagine you wanted to understand the underlying dynamics that shape the roughly two million papers that are published each year. What are the invisible forces pushing those studies to be done, rather than others? How would you do it? No single person has time to read all of those papers, much less analyze them, drawing connections and forming correlations to understand the mechanics of knowledge creation. But now with machine learning and algorithms that can hold all that data at once, that’s suddenly possible.
James Evans: Right, so we can both understand how it is that knowledge is being produced, but we can also produce or generate knowledge based on these models in new ways. And so that model just felt like we needed a place that brings together people from the full space of imaginaries and techniques across the scientific and technical world to study and analyze and improve the way in which we invest the $100 billion that we invest in knowledge every year. And this is where kind of the idea of Knowledge Lab came from, was that if you look at places, intelligent institutions like the NIH, or The National Science Foundation, they’re investing billions of dollars in science every year. And you were to ask someone at any of those institutions, “Well, what do you fund?” They couldn’t provide an answer to that question because there’s no portfolio. There’s a bunch of individual program officers who were responding to a bunch of individual scientists who do their thing, typically scientists who had productivity in the past, who were basically defining the direction for the future. And it’s not a way to build integrated strategy for success.
Paul Rand: If what we want out of science is new ideas and innovations, we may be building the wrong system. In a recent study, his team designed a model using 1.8 billion citations from about 90 million academic papers. And when they combed through the data, they uncovered a few answers to one of science’s most concerning question. Why in a time of maximum scientific output is progress and innovation slowing down? The first answer to how this increase has slowed progress is obvious, but not intuitive, before you hear it.
James Evans: As fields become enormously flooded with ideas and papers, it becomes much more difficult for ideas to disseminate or filter in because they have this limited attention span.
Paul Rand: With millions of papers every year, it’s like information overload. Fields and communities end up focusing on the ideas that merely expand on well established research, while more innovative and disruptive ideas get lost in the shuffle.
James Evans: Just to speak to one another, they have to agree on enough things. And if they’re all growing, then we’re collectively agreeing on fewer things as being foundational.
Paul Rand: Another explanation is both simple and troubling. Our current fields of science are just simply running out of things to discover.
James Evans: The same way that Wikipedia, the first year of Wikipedia, you could go to Africa, and there’s no Africa, and you can kind of click the button and type in. There were so many open questions. And so now we’re in a different place. It’s hard to find a page that’s not covered in Wikipedia. There are diminishing marginal returns to fields and theories, which is to say a field, a discipline, is a collection of problems and methods. And each discipline is a hypothesis. These problems will best be solved by these methods. Once the low hanging fruits have been plucked, then we have to climb higher on those trees of knowledge, or we have to transit to other trees, other combinations of problems and methods that weren’t basically conceived of, formalized, taught, et cetera, inside our current disciplines.
Paul Rand: And while possible, Evans thinks there’s a more systemic problem going on, the professionalization of science.
James Evans: In this new age, there are I would say a deviation of incentives for science as a whole to maximize discovery, and for scientists as person to increase the productivity, their visibility, their ability to be promoted, their ability to survive in the scientific world.
Paul Rand: As scientist, research, professor have become viable careers, employed positions in those fields have become more competitive.
James Evans: Which is to say that individual scientists benefit their careers by staying close to the pack, by maximizing productivity, by making incremental advances. They maximize the likelihood that they’re going to be cited, which is the currency, one of the major currencies of the scientific realm. And all those things are critical for promotion and advance and survival in science. And so the result if all those individuals are engaging in that way, rather than we’ve got these millions of researchers that could cover this enormous space, they’re actually clumped together in very narrow pockets around the space because that’s the best thing for them to do.
James Evans: Scientists, to demonstrate success, try to hedge their bets. How do they hedge their bets? They do what movie production studios do. I mean, if a movie production studio is trying to decide whether they should invest in the next Slum Dog Millionaire, just an independent script, could be amazing, or Transformers Nine, they’re going to go with Transformers Nine. They’re going to get the same ... They’re going to get Transformers Eight receipts minus epsilon. They’re assured to get them in the same way big teams bet on yesterday’s ideas. They follow up on yesterday’s science papers.
James Evans: They hedge their bets, and so sometimes this requires us to be not behaving like the pack animals that we are, which is to say clustering around the last biggest, best win. It means that sometimes we need to explore across this landscape, and that’s hard. That’s hard. It’s hard in science to survive if you explore the landscape, for example, without tenure. And once you have tenure, it’s very difficult to do anything other than what you’ve done. If scientists are following one investigation leads to another, their history predicts their future investigations, their future interests, regardless, in this case, of whether or not those things, those paths not taken, are scientifically or technologically efficacious or promising.
Paul Rand: This problem has gotten so extreme that Evans’ team can even build algorithms and AIs that can predict scientific discoveries before they even happen.
James Evans: We were looking at what materials have valuable energy related properties, medical properties, et cetera. So we’re predicting drugs that would be valuable against disease related genes. We’re predicting their thermoelectric, ferroelectric, other kinds of energy related properties. If we basically build kind of a knowledge manifold of all the prior discoveries that are made, we can do a great job of predicting what materials will be discovered to have these properties in the future. And one of the things that was interesting is we looked at this for all these different disease and drug and other categories. And by incorporating the people, by basically incorporating where the people were sitting across this space, we could dramatically improve by more than 100% our prediction of what would be discovered in the next period.
Paul Rand: They even looked at this in the context of discoveries around COVID.
James Evans: In the context of COVID, so we were predicting basically: What were those medicines and vaccines that would either prevent or be associated with COVID, useful COVID therapies? This is an area where it became a problem a couple of years ago, and then there was an enormous flood of people coming in to try to solve this problem. And it turns out by incorporating the distribution of scientists, at a prior period even before we begin this wave of discovery, we can improve our prediction of the things that will be discovered by 400%.
James Evans: But when we incorporate people into those predictions of COVID, where a ton of people who moved into the space, I thought we would have less purchase because new people would bring their different expertise, and they’d be coming at this problem from all different angles, but exactly the opposite is the case because everybody came in so quickly. Everybody relied on the few pathways that were available at the very beginning, with the very ... Everything was completely path dependent. And that path dependency was locked in like a crystal. So very few investigations went outside what people already had supposed and had been investigating at the very beginning of the crisis.
Paul Rand: There’s another aspect to this that ties into something we’ve talked about a lot on this show, the replication crisis.
James Evans: Yeah. So one of the things that we’ve been looking at is in addition to how fast can science explore the space of opportunities, and to what degree can it explore and discover new things, also how robust is the knowledge that it produces itself beyond the kind of fragile context of your particular experiment, and this has been in the context of I would say really a 15 to 20 year concern about the degree to which scientific findings in top scientific venues replicate, and the discovery that many of them don’t replicate at the level that we expected that they might.
Paul Rand: And the replication crisis has unveiled to us the knowledge we’re producing isn’t as robust as we would hope. Many papers that for decades were thought to be foundational are turning out to not replicate. By modeling hundreds of thousands of papers, Evans’ team can start to identify what conditions lead to more replicable results.
James Evans: And one of the things that we find in a series of projects, I mean, we’ve looked at this in the context of randomized clinical trials in medicine, we’ve looked at this in the context of gene interactions, hundreds of thousands of papers looking at perturbagens or chemicals and disease-related genes. And we model all these aspects, team size, the likelihood that the groups are working together, that they’re independent, that they’re using different approaches or methods. And we find time and time again, deeply replicated is when you bet all your chips on one set of teams, or a community that’s dense, or the authors and the centers are all interspersed, it dramatically decreases the likelihood that those findings will replicate outside the fragile cluster of that team. And you can see just a linear and massive linear increase just when you add independent communities to the study of this phenomena. It just linearly increases the likelihood that they’re going to replicate in the future and that they’re going to be able to spin off insights into things like the clinic.
Paul Rand: So teams that are separated from those dense clusters of professionalized researchers focused on that long standing canon of that field actually make discoveries that are more replicable and more robust, which is really what we hope for from our system of scientific discovery, more innovation that’s robust and replicable. And Evans has some ideas of how to reorganize our system to get us there, one of which is really out of the box, literally. Those ideas after the break.
Paul Rand: Have you ever wondered what goes on inside a black hole, or why time only moves in one direction, or what is really so weird about quantum mechanics? Well, you should listen to Why This Universe? On this podcast, you’ll hear about the strangest and most interesting ideas in physics broken down by physicists Dan Hooper and Shalma Wegsman. If you want to learn about our universe from the quantum to the cosmic, you won’t want to miss Why This Universe? Part of the University of Chicago Podcast Network. There’s an underlying theme you may have noticed to all of Evans’ work.
James Evans: I think one is diversity.
Paul Rand: And when Evans says diversity, he doesn’t just mean diversity of identity, although that is part of it, but diversity in every sense of the word. All of the Knowledge Lab’s models point to diversity being a major driver of innovation.
James Evans: We have a paper that will be coming out very soon, but it’s on the archive. We look at surprising discoveries. So we basically build a model of expectation that kind of predicts new configurations and things that will show up in next year’s papers and patents. And then we identify those that are the least likely, the most improbable to show up under that most probable model. And then we find that these actually are surprising to the people. People, scientists who annotate these papers say, “This is really controversial. This is really a surprising discovery.”
James Evans: The question is: Where do these come from? The majority of these things come from, or the most predictive ones come from where a group of people from one discipline or background travels over to another space, and publishes something in a journal, solving a problem that no one like them has looked at before. No one in that space has seen someone like them before. So these are low probability bets. I mean, these are the things that succeed where the vast majority of such hypothesis fail.
Paul Rand: Physicists think differently than biologists, who think differently than chemists. And usually, each wants to invest their time and energy building a knowledge and a reputation in their respective fields. But it’s when they take a bet, bringing their knowledge to a completely foreign space, creating a diversity of expertise, that we actually get better results.
James Evans: This is a bet that this very distant method is going to help solve this problem. And I would say this is an opportunity. This is something that we didn’t have even 100 years ago because of the density of science and the diversity of fields. Now we’re in a place where when you can’t solve your problem, typically it ends up becoming solved not by looking harder at nature, but by basically transferring insights and theories that came from completely disconnect the other place in the scientific enterprise. So this is, I think of it as another aspect of diversity.
Paul Rand: There’s another project that Evans did at The Knowledge Lab to investigate the power of diversity to generate more robust results. But it took them outside of the sciences and into the realm of politics.
James Evans: We’re not just interested in science and technology, but also civic knowledge. I mean, what do people believe about the world? How does that impact their actions that they take in the context of public health crises, and even following and creating laws that are useful and conventions?
Paul Rand: Evans wanted to know if diversity of political opinion would lead to better knowledge creation. So to find answers, they turned to Wikipedia.
James Evans: So one of the things that we explored in the context of Wikipedia was diversities, in this case political diversity, the different exposures to knowledge that different political positions would have would make stronger, more robust pages that were perceived as higher quality by independent evaluators.
Paul Rand: Basically, our Wikipedia pages that were written by both right ring and left wing people are more accurate than pages written by just one side or the other.
James Evans: And that was exactly the case. And we showed that when that’s controllable, the conversations between people with these diverse knowledge exposures do end up producing more even handed, thorough, deep, clear knowledge products in this context.
Paul Rand: In a world in which politics seem to be ripping our country apart, the results of this paper beg the question. Could harnessing the power of diversity underlying political polarization actually be useful?
James Evans: One of the critical things now is for us not just to think about these issues in the context of science and technological advance, but also in the context of civic knowledge more broadly. I mean, our governments are inventions. And we’re all engaged in the project and the collective action of kind of supporting and breaking down and kind of building anew these kinds of institutions. And so I really feel like for understanding knowledge more broadly, we need to think of these things in similar ways and really try to explore and support I would say the science of democracy, the science of these kinds of social inventions, which are so critical for continuing support and sustaining of technological and scientific advance. These are even more critical. Right? If they’re not present, they’re the things that fund other classes of discovery and they operate in similar ways. And so it’s critical to understand how all these things fit together. That’s what’s going to support the long-term ability to bet on science and technology.
Paul Rand: Evans wants to take the power of diversity beyond just independent versus concentrated teams, or taking bets of field mixing studies, way beyond. He says if we really want to supercharge our ability to make new discoveries, we may need to create whole new intelligences, intelligence that are so diverse, they don’t even think like us.
James Evans: I think there are a couple of frames of AI. I would say the classic framing of artificial intelligence in 1955 when it was the basis of a conference at Dartmouth, the idea was that we were going to try to create intelligences in the kind of the Alan Turing model, following the imitation game idea. We’re going to try to create intelligences that can think creatively like humans, think creatively that can solve problems like humans solve problems. We’re going to create artificial humanoid and human intelligences. And I think we’re in a different place today for I would say competing, including competing against nature, or for creating something that’s fundamentally complementary to human intelligence and capacity, we might want to do something that may be close to the opposite of that, which is to say to understand how it is that humans individually and collectively think, and then build alien intelligences, alternative or complementary intelligences that can maximize our capacities by doing things that we can’t do.
Paul Rand: Give me an example of an opposite of human intelligence could operate like and how it would work.
James Evans: I have one example that we’ve recently engaged in that works. So I mentioned how we were improving our predictions. So one of the things we thought at that point, so if we just follow where scientists are going to go in the next period, we can dramatically improve our prediction of future scientific discoveries. But it becomes a little less thrilling when you realize that you’re predicting that because you’re predicting the activity that inertial scientists are going to make in the next five or 10 years. You’re basically scooping scientists.
James Evans: It’s neither an ethical in some ways, or an efficient use of computational intelligence. So we thought, “Well, what if we actually tried to avoid scientists in this space?” So if we actually tried to identify potential discoveries that are equally scientifically promising, but are not likely to be discovered just because the distribution of people are not sitting there to make those inferences. So it’s kind of like it’s ... Imagine the world is at it is, except we bet on different disciplines. And so if we build a set of predictions from that, and then we took these first principles, simulations of these properties, which are used in the sciences, in material science and genetic medicine to predict the efficacy, for example, of these things, and we look at the discoveries that are made by these alien intelligences, they end up being much more likely even than discoveries that are made and published in the scientific literature.
Paul Rand: Much more likely to be successful?
James Evans: Much more likely to be successful, as gaged by these first principle simulations of these properties, so which I would say is a conservative standard. That’s based by scientists in these fields. So I think even based on existing theories, these things that are discovered by aliens that are very unlikely to be discovered by any particular person because there’s no person who sits there to make the inference.
Paul Rand: I see.
James Evans: Are more likely, and the reason they’re more likely is because of again, diminishing margin of returns to theory. These are things that if they would be discovered, the things that we discover through this method are not discoverable for years and sometimes decades after because the whole system of science has to reconfigure and educate people who sit in those boundaries for anyone to be able to discover that possibility. So we’re certainly exploring and promoting this, and I think we’re also interested in the idea of looking at fundamental human limitations of capacity.
Paul Rand: Out of all of these concerns, there’s one that Evans says is the most fundamental to holding us back from creating a system focused on innovation, just as it’s fundamental to each of our individual brains, the brain of our system of science suffers from it too, the fear of failure.
James Evans: We need to invest in failure. So I think this is a deeply challenging issue, is that so for places like The National Science Foundation, The National Institutes of Health, they don’t want some senator or congressperson digging up grants, coming up with seemingly ridiculous and ultimately failed approaches that got funding from the federal government. But we know from our research that these failures are tightly and highly correlated with dramatic success.
Paul Rand: But that failure is antithetical to the professionalization of science. Failure means jeopardizing your tenure track or making your research institution look bad, or having to justify a failed project to funders.
James Evans: Those are people’s careers. Those are institutions’ reputations. Right? Those are big funding institutions habits, and so breaking these down is a challenge. One of the things that we see is the institution defend themselves. Once you have a success, that in some ways decreases the likelihood of another dramatic success because it has the potential to destroy all the capacity that was in the prior thing. That was this idea of creative destruction. So this is the challenge, is to build basically a kind of a techo-scientific environment where we’re surfing that boundary between order and chaos, where we’re basically undertaking and willing to undertake future potential failures and with the not only the willingness, but the expectation that successes, dramatic successes, are going to deconstruct the value of prior successes.
Paul Rand: So if you were going to get to, you have to get to the funders more than anyone else to get them to ask for something different than what they’re currently funding and what they’re currently being pitched. Is that the crux of this?
James Evans: I think that the funders are a critical place to be. And the reason is exactly as you suggest. I mean, right now, if the incentive is for people to survive to do really incremental and conservative science, that’s what we’re going to produce at a large scale, unless the funders, who are really driving in many areas, not all areas, but in many areas of science, advance, then they’re going to have to make commitments. And those commitments are going to have to be, rather than just funding the marginal discovery in this big institute, we’re going to have to fund another approach that’s independent of that. We’re going to have to fund people that don’t come from that network of highly previously successful persons. All these bets are going to entail failure. And so I think really part of it’s a public education campaign, not just the funders, but the society. Society has to understand that if we want to fund innovation, we have to fund failures. These things go hand in hand. Failures are not a sign of failure as an institution. I think that’s probably one of the biggest challenges.
Matt Hodapp: Big Brains is a production of the University of Chicago Podcast Network. If you like what you heard, please leave us a rating and a review. The show is hosted by Paul M. Rand and produced by me, Matt Hodapp and Lea Ceasrine. Thanks for listening.
How an Indiana Jones-type figure transformed the field—and the questions scholars are wrestling with today
Neuroscientist uses MRIs to measure your attention—and learn how to improve it
Explore the surprising history of these cosmic monsters—and the future of research from leaders in the field
A behind-the-scenes look at world-changing ideas at UChicago and beyond
Groundbreaking study examines one program’s impact on employment and incarceration
One scientist’s quest to improve technology as we transition to green energy
Scholar explores the science behind animal consciousness
Research examines—and could help predict—extreme weather events around the world
Why Air Pollution Is Cutting Years Off Our Lives, with Christa Hasenkopf and Anant Sudarshan (Ep. 95)
Scholars behind Air Quality Life Index report warn of threat to our global health
Scholar examines how the wealthy, global corporations exploit tax loopholes in the tiny U.S. state