糖心破解版

Skip to main content
Markkula Center for Applied Ethics Homepage

Pride, Prejudice, and Predictions about People

ink bottle with quill

ink bottle with quill

On Avoiding Pitfalls

Irina Raicu

Irina Raicu is the director of the Internet Ethics program (@IEthics) at the Markkula Center for Applied Ethics. Views are her own.

It is a truth universally acknowledged鈥攐r at least a belief shared by many artificial intelligence and machine learning researchers鈥攖hat, given a vast database and sophisticated modeling, an algorithm will be able to predict the behavior of individual human beings.

Jane Austen might seem like the wrong authority to turn to in order to dispute this. But she does have some relevant insights on the topic.

You might remember that in her novel Pride and Prejudice Austen features a heroine named Elizabeth who has a lot of interactions with a character named Mr. Darcy, whom she eventually marries. Initially, though, over the course of several meetings, and from various stories that other characters tell her, Elizabeth collects information about Darcy that leads her to dislike him鈥攁nd to believe that he dislikes her in return. As a result, she is stunned when he eventually, suddenly, proposes to her. That was not the behavior that she would have predicted, from him.

Of course, her prediction was based on a limited amount of information about him鈥攕ome of it collected and delivered to her by motivated (biased) sources. The novel is thus a cautionary tale relevant to the unwarranted pride and prejudice embedded in the belief that AI can effectively anticipate individual people鈥檚 behavior.

In the Fall of 2020, two Princeton professors co-taught As they explained in a very useful , 鈥渞esearchers and companies have made many optimistic claims about the ability to predict phenomena ranging from crimes to earthquakes using data-driven, statistical methods. These claims are widely believed by the public and policy makers.鈥 Looking at the accuracy of current predictive algorithms, however, the professors (Arvind Narayanan and Matt Salganik) ask whether there are 鈥減ractical limits to predictions that will remain with us for the foreseeable future,鈥 and explain the importance of determining this: 鈥淚f we are entering a world where the future is predictable, we need to start preparing for the consequences, both good and bad. If, on the other hand, commercial claims are overhyped, we need the knowledge to push back effectively.鈥

In their essay, the professors offer several hypotheses about the limits of prediction. In regard to predictions about human beings鈥 actions, one of them seems particularly applicable: they call it 鈥淪hocks.鈥 鈥淟ife trajectories,鈥 they write, are 鈥渟ometimes upended by the kinds of inputs that seem likely to remain unmeasurable for the foreseeable future: a lottery jackpot; an accident; a crime of passion committed in the heat of the moment; a college admission for which one just made the cut. What is unclear is how common these are in the typical life course and to what extent they limit predictability.鈥 The range of 鈥渟hocks鈥 that substantially alter human behavior but are likely to remain unmeasurable inputs is much broader, though, and some are much more common: the birth of a child; emigration; illness; heartbreak; travel to different countries, etc.

In Pride and Prejudice, for example, we find out that Darcy鈥檚 behavior is greatly changed in part by the shock of Elizabeth鈥檚 stunned and angry refusal of his first marriage proposal. His second goes much better.

Of course, Elizabeth had not correctly anticipated the first one; that might be due in part to another factor that the professors list as a separate hypothesis for limits to predictions: In regard to human beings, this seems more of an axiom. As Narayanan and Salganik note, 鈥渞elevant attributes are often unavailable for prediction鈥; as an example, they add that 鈥渁s long as people鈥檚 thoughts remain inaccessible to predictive algorithms, that will impose limits to the predictability of some types of events.鈥 In Pride and Prejudice, Mr. Darcy is often accused of being inscrutable鈥攁nd in Jane Austen鈥檚 world many kinds of 鈥渋nputs鈥 were not to be said or done. Even today, though, many people are hard to read, and the variety of human responses to different situations and contexts makes it likely that we will all be at least sometimes misread. Unobserved or unobservable or incorrectly interpreted inputs are likely to limit predictions about human behavior forever.

The fact that Elizabeth bases her prediction of Darcy鈥檚 actions on an insufficient amount of information would fall under what the Princeton professors call and her acceptance of information about Darcy from some people who have reason to dislike him and misrepresent him points to the broader issue of bias in datasets, which are themselves, as researcher Solon Barocas has pointed out, 鈥.鈥 In the context of AI predictive models, these issues, too, might lead to inaccurate predictions about particular people or groups.

Ironically, in Pride and Prejudice, Elizabeth herself initially fails to heed her own insight. Early in the book, when someone argues that country society offers limited opportunities for 鈥渟tudiers of character鈥 because it is so 鈥渃onfined and unvarying,鈥 Elizabeth answers, 鈥淏ut people themselves alter so much, that there is something new to be observed in them for ever.鈥

People change; learn; impact each other. Technologists might call that 鈥渄rift.鈥

Of course, the proponents of AI/ML predictions claim specifically that vast datasets and carefully designed algorithms are an improvement over any Elizabeth鈥檚 analysis and will lead to predictions that are much more accurate than those made by human beings. As noted above, however, predictive models face many of the same limitations that human predictors do (in their essay, the Princeton professors list many more than the ones mentioned here). Unfortunately, in the case of algorithms, the limitations are often more hidden.

As Narayanan and Salganik stress, much of the current critique of predictive algorithms 鈥渉as rarely contested the predictive analytics industry鈥檚 claim that machine learning methods are delivering great improvements in accuracy compared to human experts and traditional statistics. Questioning that assumption changes the debate completely.鈥 (Note the reference to experts; part of the issue in the first part of Pride and Prejudice is that Elizabeth, though very smart and insightful, is very young and inexperienced, too. She, too, changes as she learns.)

We need to change, completely, the debate about the use of various AI tools to accurately predict individual human behavior. When it comes to anticipating the impact of predictive algorithms on society, write the Princeton professors, 鈥淸a] pitfall that鈥檚 just as common as failing to anticipate advances is to overreact by assuming that a breakthrough is just around the corner.鈥 Another pitfall is to claim, hubristically, that it鈥檚 already occurred.

Photo: by (cropped) is licensed under

Feb 23, 2021
--