Article: Hazards of Confidence

new

« Prev
1
Next »

LWPD
Director of Archives

L-FSCE/Open Source Architect

Posts: 6,605

Article: Hazards of Confidence Oct 30, 2011 8:16:28 GMT -5

Quote

Post by LWPD on Oct 30, 2011 8:16:28 GMT -5

"When a compelling impression of a particular event clashes with general knowledge, the impression commonly prevails. And this goes for you, too. The confidence you will experience in your future judgments will not be diminished by what you just read, even if you believe every word."

Below is an excellent article on a common cognitive fallacy, the illusion of validity. WYSIATI (what you see is all there is) is necessary to help people make sense of the world around them, but fails as a predictive indicator because it doesn't allow for the processing of myriad factors that aren't known about the future and/or aren't a part of a given persons 'story'. The human mind is a powerful tool, awareness of its built-in defense mechanisms when making intuitive judgments can prove helpful.

Courtesy of NY Times
Don’t Blink! The Hazards of Confidence
By Daniel Kahneman

Many decades ago I spent what seemed like a great deal of time under a scorching sun, watching groups of sweaty soldiers as they solved a problem. I was doing my national service in the Israeli Army at the time. I had completed an undergraduate degree in psychology, and after a year as an infantry officer, I was assigned to the army’s Psychology Branch, where one of my occasional duties was to help evaluate candidates for officer training. We used methods that were developed by the British Army in World War II. One test, called the leaderless group challenge, was conducted on an obstacle field. Eight candidates, strangers to one another, with all insignia of rank removed and only numbered tags to identify them, were instructed to lift a long log from the ground and haul it to a wall about six feet high. There, they were told that the entire group had to get to the other side of the wall without the log touching either the ground or the wall, and without anyone touching the wall. If any of these things happened, they were to acknowledge it and start again.

A common solution was for several men to reach the other side by crawling along the log as the other men held it up at an angle, like a giant fishing rod. Then one man would climb onto another’s shoulder and tip the log to the far side. The last two men would then have to jump up at the log, now suspended from the other side by those who had made it over, shinny their way along its length and then leap down safely once they crossed the wall. Failure was common at this point, which required starting over.

As a colleague and I monitored the exercise, we made note of who took charge, who tried to lead but was rebuffed, how much each soldier contributed to the group effort. We saw who seemed to be stubborn, submissive, arrogant, patient, hot-tempered, persistent or a quitter. We sometimes saw competitive spite when someone whose idea had been rejected by the group no longer worked very hard. And we saw reactions to crisis: who berated a comrade whose mistake caused the whole group to fail, who stepped forward to lead when the exhausted team had to start over. Under the stress of the event, we felt, each man’s true nature revealed itself in sharp relief.

After watching the candidates go through several such tests, we had to summarize our impressions of the soldiers’ leadership abilities with a grade and determine who would be eligible for officer training. We spent some time discussing each case and reviewing our impressions. The task was not difficult, because we had already seen each of these soldiers’ leadership skills. Some of the men looked like strong leaders, others seemed like wimps or arrogant fools, others mediocre but not hopeless. Quite a few appeared to be so weak that we ruled them out as officer candidates. When our multiple observations of each candidate converged on a coherent picture, we were completely confident in our evaluations and believed that what we saw pointed directly to the future. The soldier who took over when the group was in trouble and led the team over the wall was a leader at that moment. The obvious best guess about how he would do in training, or in combat, was that he would be as effective as he had been at the wall. Any other prediction seemed inconsistent with what we saw.

Because our impressions of how well each soldier performed were generally coherent and clear, our formal predictions were just as definite. We rarely experienced doubt or conflicting impressions. We were quite willing to declare: “This one will never make it,” “That fellow is rather mediocre, but should do O.K.” or “He will be a star.” We felt no need to question our forecasts, moderate them or equivocate. If challenged, however, we were fully prepared to admit, “But of course anything could happen.”

We were willing to make that admission because, as it turned out, despite our certainty about the potential of individual candidates, our forecasts were largely useless. The evidence was overwhelming. Every few months we had a feedback session in which we could compare our evaluations of future cadets with the judgments of their commanders at the officer-training school. The story was always the same: our ability to predict performance at the school was negligible. Our forecasts were better than blind guesses, but not by much.

We were downcast for a while after receiving the discouraging news. But this was the army. Useful or not, there was a routine to be followed, and there were orders to be obeyed. Another batch of candidates would arrive the next day. We took them to the obstacle field, we faced them with the wall, they lifted the log and within a few minutes we saw their true natures revealed, as clearly as ever. The dismal truth about the quality of our predictions had no effect whatsoever on how we evaluated new candidates and very little effect on the confidence we had in our judgments and predictions.

I thought that what was happening to us was remarkable. The statistical evidence of our failure should have shaken our confidence in our judgments of particular candidates, but it did not. It should also have caused us to moderate our predictions, but it did not. We knew as a general fact that our predictions were little better than random guesses, but we continued to feel and act as if each particular prediction was valid. I was reminded of visual illusions, which remain compelling even when you know that what you see is false. I was so struck by the analogy that I coined a term for our experience: the illusion of validity.

I had discovered my first cognitive fallacy.

Decades later, I can see many of the central themes of my thinking about judgment in that old experience. One of these themes is that people who face a difficult question often answer an easier one instead, without realizing it. We were required to predict a soldier’s performance in officer training and in combat, but we did so by evaluating his behavior over one hour in an artificial situation. This was a perfect instance of a general rule that I call WYSIATI, “What you see is all there is.” We had made up a story from the little we knew but had no way to allow for what we did not know about the individual’s future, which was almost everything that would actually matter. When you know as little as we did, you should not make extreme predictions like “He will be a star.” The stars we saw on the obstacle field were most likely accidental flickers, in which a coincidence of random events — like who was near the wall — largely determined who became a leader. Other events — some of them also random — would determine later success in training and combat.

You may be surprised by our failure: it is natural to expect the same leadership ability to manifest itself in various situations. But the exaggerated expectation of consistency is a common error. We are prone to think that the world is more regular and predictable than it really is, because our memory automatically and continuously maintains a story about what is going on, and because the rules of memory tend to make that story as coherent as possible and to suppress alternatives. Fast thinking is not prone to doubt.

The confidence we experience as we make a judgment is not a reasoned evaluation of the probability that it is right. Confidence is a feeling, one determined mostly by the coherence of the story and by the ease with which it comes to mind, even when the evidence for the story is sparse and unreliable. The bias toward coherence favors overconfidence. An individual who expresses high confidence probably has a good story, which may or may not be true.

I coined the term “illusion of validity” because the confidence we had in judgments about individual soldiers was not affected by a statistical fact we knew to be true — that our predictions were unrelated to the truth. This is not an isolated observation. When a compelling impression of a particular event clashes with general knowledge, the impression commonly prevails. And this goes for you, too. The confidence you will experience in your future judgments will not be diminished by what you just read, even if you believe every word.

I first visited a Wall Street firm in 1984. I was there with my longtime collaborator Amos Tversky, who died in 1996, and our friend Richard Thaler, now a guru of behavioral economics. Our host, a senior investment manager, had invited us to discuss the role of judgment biases in investing. I knew so little about finance at the time that I had no idea what to ask him, but I remember one exchange. “When you sell a stock,” I asked him, “who buys it?” He answered with a wave in the vague direction of the window, indicating that he expected the buyer to be someone else very much like him. That was odd: because most buyers and sellers know that they have the same information as one another, what made one person buy and the other sell? Buyers think the price is too low and likely to rise; sellers think the price is high and likely to drop. The puzzle is why buyers and sellers alike think that the current price is wrong.

Most people in the investment business have read Burton Malkiel’s wonderful book “A Random Walk Down Wall Street.” Malkiel’s central idea is that a stock’s price incorporates all the available knowledge about the value of the company and the best predictions about the future of the stock. If some people believe that the price of a stock will be higher tomorrow, they will buy more of it today. This, in turn, will cause its price to rise. If all assets in a market are correctly priced, no one can expect either to gain or to lose by trading.

We now know, however, that the theory is not quite right. Many individual investors lose consistently by trading, an achievement that a dart-throwing chimp could not match. The first demonstration of this startling conclusion was put forward by Terry Odean, a former student of mine who is now a finance professor at the University of California, Berkeley.

Odean analyzed the trading records of 10,000 brokerage accounts of individual investors over a seven-year period, allowing him to identify all instances in which an investor sold one stock and soon afterward bought another stock. By these actions the investor revealed that he (most of the investors were men) had a definite idea about the future of two stocks: he expected the stock that he bought to do better than the one he sold.

To determine whether those appraisals were well founded, Odean compared the returns of the two stocks over the following year. The results were unequivocally bad. On average, the shares investors sold did better than those they bought, by a very substantial margin: 3.3 percentage points per year, in addition to the significant costs of executing the trades. Some individuals did much better, others did much worse, but the large majority of individual investors would have done better by taking a nap rather than by acting on their ideas. In a paper titled “Trading Is Hazardous to Your Wealth,” Odean and his colleague Brad Barber showed that, on average, the most active traders had the poorest results, while those who traded the least earned the highest returns. In another paper, “Boys Will Be Boys,” they reported that men act on their useless ideas significantly more often than women do, and that as a result women achieve better investment results than men.

Of course, there is always someone on the other side of a transaction; in general, it’s a financial institution or professional investor, ready to take advantage of the mistakes that individual traders make. Further research by Barber and Odean has shed light on these mistakes. Individual investors like to lock in their gains; they sell “winners,” stocks whose prices have gone up, and they hang on to their losers. Unfortunately for them, in the short run going forward recent winners tend to do better than recent losers, so individuals sell the wrong stocks. They also buy the wrong stocks. Individual investors predictably flock to stocks in companies that are in the news. Professional investors are more selective in responding to news. These findings provide some justification for the label of “smart money” that finance professionals apply to themselves.

Although professionals are able to extract a considerable amount of wealth from amateurs, few stock pickers, if any, have the skill needed to beat the market consistently, year after year. The diagnostic for the existence of any skill is the consistency of individual differences in achievement. The logic is simple: if individual differences in any one year are due entirely to luck, the ranking of investors and funds will vary erratically and the year-to-year correlation will be zero. Where there is skill, however, the rankings will be more stable. The persistence of individual differences is the measure by which we confirm the existence of skill among golfers, orthodontists or speedy toll collectors on the turnpike.

Mutual funds are run by highly experienced and hard-working professionals who buy and sell stocks to achieve the best possible results for their clients. Nevertheless, the evidence from more than 50 years of research is conclusive: for a large majority of fund managers, the selection of stocks is more like rolling dice than like playing poker. At least two out of every three mutual funds underperform the overall market in any given year.

More important, the year-to-year correlation among the outcomes of mutual funds is very small, barely different from zero. The funds that were successful in any given year were mostly lucky; they had a good roll of the dice. There is general agreement among researchers that this is true for nearly all stock pickers, whether they know it or not — and most do not. The subjective experience of traders is that they are making sensible, educated guesses in a situation of great uncertainty. In highly efficient markets, however, educated guesses are not more accurate than blind guesses.

Some years after my introduction to the world of finance, I had an unusual opportunity to examine the illusion of skill up close. I was invited to speak to a group of investment advisers in a firm that provided financial advice and other services to very wealthy clients. I asked for some data to prepare my presentation and was granted a small treasure: a spreadsheet summarizing the investment outcomes of some 25 anonymous wealth advisers, for eight consecutive years. The advisers’ scores for each year were the main determinant of their year-end bonuses. It was a simple matter to rank the advisers by their performance and to answer a question: Did the same advisers consistently achieve better returns for their clients year after year? Did some advisers consistently display more skill than others?

To find the answer, I computed the correlations between the rankings of advisers in different years, comparing Year 1 with Year 2, Year 1 with Year 3 and so on up through Year 7 with Year 8. That yielded 28 correlations, one for each pair of years. While I was prepared to find little year-to-year consistency, I was still surprised to find that the average of the 28 correlations was .01. In other words, zero. The stability that would indicate differences in skill was not to be found. The results resembled what you would expect from a dice-rolling contest, not a game of skill.

No one in the firm seemed to be aware of the nature of the game that its stock pickers were playing. The advisers themselves felt they were competent professionals performing a task that was difficult but not impossible, and their superiors agreed. On the evening before the seminar, Richard Thaler and I had dinner with some of the top executives of the firm, the people who decide on the size of bonuses. We asked them to guess the year-to-year correlation in the rankings of individual advisers. They thought they knew what was coming and smiled as they said, “not very high” or “performance certainly fluctuates.” It quickly became clear, however, that no one expected the average correlation to be zero.

What we told the directors of the firm was that, at least when it came to building portfolios, the firm was rewarding luck as if it were skill. This should have been shocking news to them, but it was not. There was no sign that they disbelieved us. How could they? After all, we had analyzed their own results, and they were certainly sophisticated enough to appreciate their implications, which we politely refrained from spelling out. We all went on calmly with our dinner, and I am quite sure that both our findings and their implications were quickly swept under the rug and that life in the firm went on just as before. The illusion of skill is not only an individual aberration; it is deeply ingrained in the culture of the industry. Facts that challenge such basic assumptions — and thereby threaten people’s livelihood and self-esteem — are simply not absorbed. The mind does not digest them. This is particularly true of statistical studies of performance, which provide general facts that people will ignore if they conflict with their personal experience.

The next morning, we reported the findings to the advisers, and their response was equally bland. Their personal experience of exercising careful professional judgment on complex problems was far more compelling to them than an obscure statistical result. When we were done, one executive I dined with the previous evening drove me to the airport. He told me, with a trace of defensiveness, “I have done very well for the firm, and no one can take that away from me.” I smiled and said nothing. But I thought, privately: Well, I took it away from you this morning. If your success was due mostly to chance, how much credit are you entitled to take for it?

We often interact with professionals who exercise their judgment with evident confidence, sometimes priding themselves on the power of their intuition. In a world rife with illusions of validity and skill, can we trust them? How do we distinguish the justified confidence of experts from the sincere overconfidence of professionals who do not know they are out of their depth? We can believe an expert who admits uncertainty but cannot take expressions of high confidence at face value. As I first learned on the obstacle field, people come up with coherent stories and confident predictions even when they know little or nothing. Overconfidence arises because people are often blind to their own blindness.

True intuitive expertise is learned from prolonged experience with good feedback on mistakes. You are probably an expert in guessing your spouse’s mood from one word on the telephone; chess players find a strong move in a single glance at a complex position; and true legends of instant diagnoses are common among physicians. To know whether you can trust a particular intuitive judgment, there are two questions you should ask: Is the environment in which the judgment is made sufficiently regular to enable predictions from the available evidence? The answer is yes for diagnosticians, no for stock pickers. Do the professionals have an adequate opportunity to learn the cues and the regularities? The answer here depends on the professionals’ experience and on the quality and speed with which they discover their mistakes. Anesthesiologists have a better chance to develop intuitions than radiologists do. Many of the professionals we encounter easily pass both tests, and their off-the-cuff judgments deserve to be taken seriously. In general, however, you should not take assertive and confident people at their own evaluation unless you have independent reason to believe that they know what they are talking about. Unfortunately, this advice is difficult to follow: overconfident professionals sincerely believe they have expertise, act as experts and look like experts. You will have to struggle to remind yourself that they may be in the grip of an illusion.

L-FSCE Video Library

GWF 2098 Booked On Death Ground: 1 Year Finished In 8 Weeks!

Honest Effort: The Trials of Lincoln

LWPD
Director of Archives

L-FSCE/Open Source Architect

Posts: 6,605

Article: Hazards of Confidence Dec 11, 2011 17:12:19 GMT -5

Quote

Post by LWPD on Dec 11, 2011 17:12:19 GMT -5

Heuristics allows people to make decisions at a faster pace, but taking shortcuts can often come at the expense of reaching accurate conclusions. Being aware of the role cognitive biases can play in the thinking process is the best defense against suffering too deeply from their possible consequences. Vanity Fair recently profiled the work of Nobel Prize winner Daniel Kahneman and looked at his most recent book Thinking, Fast and Slow. Highly recommended for anyone interested in theories as to how the human mind works.

Video: Thinking Fast and Slow - Daniel Kahneman in conversation with Professor Lord Richard Layard:

Courtesy of Vanity Fair
The Quiz Daniel Kahneman Wants You to Fail
By Jaime Lalinde

In the December 2011 issue of Vanity Fair, Michael Lewis profiles Nobel Prize–winning psychologist Daniel Kahneman, who pioneered research into “heuristics,” or the shortcuts humans use when making decisions. Below, take our quiz to see how your own mind works. Plainly put, a “heuristic” is a tool we use to simplify the decision-making process. For example, if you’re driving in the United Kingdom for the first time and don’t know the traffic laws, heuristics might help you correctly assume that a green light means go and a red light means stop. By applying what you already know about driving in America, you won’t have to waste hours reading up on England’s traffic laws. However, that same heuristic could prove harmful if you start driving in the right-hand lane, against traffic.

Research psychologist Daniel Kahneman—Nobel Prize winner, and the subject of Michael Lewis’s article in this month’s issue, “The King of Human Error”—spent a great part of his life’s work discovering and cataloging the heuristics people use. Specifically, he concentrated on the situations where they lead us astray. By nature, heuristics are both useful and inaccurate; our minds have developed them to deal with a wide-ranging set of problems.

In Kahneman’s forthcoming book, Thinking, Fast and Slow, he separates the thinking process into two types—System 1, in which efficiency comes at the cost of accuracy, and System 2, which requires a lot of focus and can sometimes prevent System 1 from making mistakes. When you’re asked what “2 + 2” equals, System 1 takes over, but when you’re asked what “17 x 24” equals, System 2 takes the reins. The questions in this quiz are designed to trigger System 1, which relies heavily on intuition to provide us with answers that we perceive to be correct. Whenever you find yourself “going with your gut,” that’s System 1—often standing in the way of rational thought. It’s no wonder that the word “heuristic” has its root in the word “eureka.” Go ahead and take this quiz, based (loosely) on Kahneman’s four decades of research; follow your gut and see just how wrong you are.
________________________________

5 Question Quiz

1. A town has two hospitals: one large and one small. Assuming there is an equal number of boys and girls born every year in the United States, which hospital is more likely to have close to 50 percent girls and 50 percent boys born on any given day?

A. The larger
B. The smaller
C. About the same (say, within 5 percent of each other)

_______________________________

The knee-jerk reaction is to select answer C; we expect things to follow a proven pattern regardless of size. But size matters. A small sample size (i.e., the small hospital) will often contain extreme proportions, while a large sample size (i.e., the large hospital) will more likely reflect real-world distributions. The heuristic shown here can be used to understand some forms of prejudice—if you haven’t been exposed to a large number of people from a certain group, you’re more likely to have incorrect assumptions about them. When you do not account for the size of a sample, Kahneman and his colleague Amos Tversky say, you have used the “representativeness heuristic.”
_______________________________

2. A team of psychologists performed personality tests on 100 professionals, of which 30 were engineers and 70 were lawyers. Brief descriptions were written for each subject. The following is a sample of one of the resulting descriptions:

Jack is a 45-year-old man. He is married and has four children. He is generally conservative, careful, and ambitious. He shows no interest in political and social issues and spends most of his free time on his many hobbies, which include home carpentry, sailing, and mathematics.

What is the probability that Jack is one of the 30 engineers?

A. 10–40 percent
B. 40–60 percent
C. 60–80 percent
D. 80–100 percent
_______________________________

If you answered anything but A (the correct response being precisely 30 percent), you have fallen victim to the representativeness heuristic again, despite having just read about it. When Kahneman and Tversky performed this experiment, they found that a large percentage of participants overestimated the likelihood that Jack was an engineer, even though mathematically, there was only a 30-in-100 chance of that being true. This proclivity for attaching ourselves to rich details, especially ones that we believe are typical of a certain kind of person (i.e., all engineers must spend every weekend doing math puzzles), is yet another shortcoming of the hyper-efficient System 1.

_______________________________

3a. How many dates did you have last month?

A. 1–3
B. 3–5
C. 0

3b. On a scale of 1 to 5, how happy are you these days (5 being the happiest)?

A. 1
B. 2
C. 3
D. 4
E. 5

_______________________________

Regardless of how you answered, it is likely that your answer to question (a) is positively correlated to your answer to question (b)—that is, you rated your happiness higher if you had more dates and lower if you had fewer dates. However, when the order of these questions was reversed, as was done by two German researchers, people’s happiness became untethered from their dating life. This experiment demonstrates the brain’s deferral to System 1, the faster and easier of the two processes. When faced with an objective question (in this case, How many dates did you have last month?), followed by a subjective one (How happy are you these days?), people often simply carry over their answer for the first to the second. This heuristic is called substitution.
_______________________________

4. Imagine that you decided to see a play and you paid $10 for the admission price of one ticket. As you enter the theater, you discover that you have lost the ticket. The theater keeps no record of ticket purchasers, so the ticket cannot be recovered. Would you pay $10 for another ticket to the play?

A. Yes
B. No

_______________________________

If you answered no, as most people do, consider the following question:

Imagine that you decide to see a play and you will pay $10 for the admission price of one ticket at the door. As you enter the theater, you discover that you have lost a $10 bill. Would you still pay $10 for a ticket to the play?

If you answered yes to this analogous scenario (as both result in the net loss of $10), it’s likely you fell victim to what Kahneman and Tversky call the “framing effect”: being swayed by the way in which questions are worded rather than responding just to their substance. When Kahneman and Tversky performed this experiment in 1981, they found that 46 percent of participants would pay for another ticket, while 88 percent of participants would purchase the ticket in the analogous example mentioned above. The framing effect is also used to explain the influence of positive and negative information on our decisions—for example, why consumers prefer to buy ground beef labeled 80 percent lean rather than 20 percent fat.

_______________________________

5a. Choose between getting $900 for sure or a 90 percent chance of getting $1,000.

A. Getting $900
B. 90 percent chance of getting $1,000

5b. Choose between losing $900 for sure or a 90 percent chance of losing $1,000.

A. Losing $900
B. 90 percent chance of losing $1,000

_______________________________

The results of this simple problem set, for which most participants answer A and then B, were used to develop the thesis that would make Kahneman and Tversky famous: prospect theory. In a 1979 paper, they documented a peculiar behavioral tendency: when people faced a gain, they became risk averse; when they faced a loss, they became risk seeking. As a result of their discovery, Kahneman and Tversky debunked Bernoulli’s utility theory, a cornerstone of economic thought since the 18th century. (Bernoulli first proponed that a person’s willingness to gamble a certain amount of money was a product of how that amount related to his overall wealth—that is, $1 million means more to a millionaire than it does to a billionaire.)
_______________________________

Along with playing a large role in Kahneman’s being awarded the Nobel Prize in 2002, the theory also spawned a new academic pursuit, the field of behavioral economics. Prospect theory, Michael Lewis writes, explains “why people are less likely to sell their houses and their stock portfolios in falling markets; why, most sensationally, professional golfers become better putters when they’re trying to save par (avoid losing a stroke) than when they’re trying to make a birdie (and gain a stroke).”

L-FSCE Video Library

GWF 2098 Booked On Death Ground: 1 Year Finished In 8 Weeks!

Honest Effort: The Trials of Lincoln