The underinformative formulation of conditional probability

Laura Macchi; Maria Bagassi

doi:10.1017/S0140525X07001823

The underinformative formulation of conditional probability

Published online by Cambridge University Press: 29 October 2007

Laura Macchi and

Maria Bagassi

Show author details

Laura Macchi: Affiliation:
Department of Psychology, University of Milano-Bicocca, 20126 Milan, Italy. laura.macchi@unimib.itmaria.bagassi@unimib.it
Maria Bagassi: Affiliation:
Department of Psychology, University of Milano-Bicocca, 20126 Milan, Italy. laura.macchi@unimib.itmaria.bagassi@unimib.it

Article contents

Abstract
Footnotes
References

Rights & Permissions

Abstract

The formulation of the conditional probability in classical tasks does not guarantee the effective transmission of the independence of the hit rate from the base rate. In these kinds of tasks, data are all available, but subjects are able to understand them in the specific meanings proper to a specialized language only if these are adequately transmitted. From this perspective, the partitive formulation should not be considered a facilitation, but rather, a way of effectively transmitting the conditional probability.

Consider the following two phrases:

1 The death-rate among men is twice that for women.
2 In the deaths registered last month there were twice as many men as women.

Are these two different ways of saying the same or are these different events? In fact, they are different events. (Lindley 1985, p. 44)

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 30 , Issue 3 , June 2007 , pp. 274 - 275

DOI: https://doi.org/10.1017/S0140525X07001823 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2007

The two phrases presented by Lindley describe two completely different probabilities which are connected to the same pair of events (P[Death/Men] and P[Men/Death]). In the first phrase, the probability is about the rate of mortality, given the gender; in the second one, the probability is about the rate of gender, given the mortality. Whereas the second phrase implies that P(M/D) is equal to 2/3, the first does not.Footnote ¹ The confusion between these two notions is a very common phenomenon, and has great implications for reasoning and decision making. According to Dawes (Reference Dawes1988, p.80), “words are poor vehicles for discussing inverse probabilities.” A main question is about the nature of this confusion and, consequently, the understanding and the use of the conditional probability.

We propose a pragmatic explanation of the phenomenon of the confusion between inverse probabilities and of the base-rate fallacy. In particular, concerning the kinds of problems considered in the literature on the base-rate fallacy, a pragmatic analysis of the texts/problems allowed us to identify, as responsible for the fallacy, the ambiguous formulation of a likelihood, instead of an intrinsic difficulty to reason in Bayesian terms (Macchi Reference Macchi1995; Reference Macchi2000; Macchi & Mosconi Reference Macchi and Mosconi1998).

Let us consider, for example, three kinds of problems formulated as follows:

“If a woman has breast cancer, the probability is 80% that she will get a positive mammography.” (Medical Diagnosis problem)
“The percentage of deaths by suicide is three times higher among single individuals than among married individuals.” (Suicide problem)
“The witness made correct identifications in 80% of the cases and erred in 20% of the cases.” (Cab problem)

This sort of formulation does not seem to express the intrinsic nature of a conditional probability, which conditions an event (A) to the occurrence of another event (B). Nor does this sort of formulation represent the cases in which, given the occurrence of B, A also occurs. In other words, it transmits just a generic association of events: A & B. For statistically naive subjects, this kind of formulation is not informative even if all the data were literally spelled out. The distinction between sentence and utterance is at the core of Grice's communication theory, according to which phrases imply and mean more than what they literally say (Grice Reference Grice, Cole and Morgan1975). What is implied is the outcome of an inferential process, in which what is said is interpreted in the light of the intentions attributed to the speaker and of the context (unavoidably elicited and determined by any communications).

Common language is ambiguous in itself. The understanding of what a speaker means requires a disambiguation process, centered on the attribution of intention. Differently, specialized languages (e.g., logical and statistical ones) presuppose an univocal, unambiguous interpretation (the utterance corresponds to the sentence).

The formulation of these kinds of problems uses common language. Data are all available, but subjects are able to understand them in the specific meanings proper to a specialized language, only if they are adequately transmitted. Then, the particular interpretation of the data, required for a correct solution, needs a “particularized,” marked formulation (see Grice Reference Grice, Cole and Morgan1975; Levinson Reference Levinson and Goody1995; Reference Stanovich and West2000).

In the sentences from Lindley (Reference Lindley1985) quoted earlier, the effective transmission of the independence of the hit rate from the base rate does not seem guaranteed. This is a crucial assumption for proper Bayesian analysis (Birnbaum Reference Birnbaum1983), because the a posteriori probability P(H/D) is calculated on the basis of the base rate and is therefore dependent upon it. If the hit rate depended on the base rate, it would already include it and, if this were the case, we would already have the a posteriori probability and it would be unnecessary to consider the base rate itself. This is what often underlies the base-rate fallacy, which consists of a failure to consider the base rate on account of the privileging of hit-rate information. In our view, the confusion sometimes generated between the hit rate and a posteriori probability is due to an unmarked formulation of conditional probability, or, in other words, to the absence of a partitive formulation.

If this is true, the partitive formulation should not be considered a facilitation, as Barbey & Sloman (B&S) argue, but a way of transmitting a particular information, able to translate a specialized language into a common, natural language, differently from the “step by step” question form used to compute the Bayesian ratio (adopted by Girotto & Gonzalez Reference Girotto and Gonzalez2001). An example of partitive formulation of likelihood in the diagnoses problem is: “80 per cent of women who have breast cancer will get a positive mammography” (Macchi Reference Macchi, Hardman and Macchi2003).

The low performance with the Medical Diagnosis (MD) problem has usually been considered as evidence of the activation of System 1, which operates associatively (fast, automatic, effortless). Vice versa, the high performance with this kind of problem is ascribed to System 2, which is able to process rule-based inferences. However, in a recent study (Macchi et al. Reference Macchi, Bagassi and Ciociola2007), statistically sophisticated subjects who solved the MD problem (66% of 35 subjects) were not able to solve the computationally less complex Linda problem (only 14% of the subjects did not commit the conjunction fallacy). So, the conceptual distinction between two reasoning systems, which explains the biases recurring in System 1 and the normative performance as the result of the activation of System 2, gives rise to some doubts.

According to us, the ability of statistically sophisticated subjects to grasp the informativeness of the data and the aim of the task in Bayesian tasks is a pragmatic ability. Also, when subjects don't give the logical-normative solution to the Linda problem, they are again considering the informativeness of the data, which, in this instance, hinders the intent of the experimenter (concerning the inclusion-class rule), because of a misleading contextualization of the task.

We could further speculate that, instead of having an ability for decontextualizing the task (Stanovich & West 2000), those gifted subjects who give the normative solution to the Linda problem (14%) would have a high ability to understand which context the experimenter intended, thereby revealing an interactional intelligence.

Footnotes

1. Except when the number of men and women is the same.

References

Birnbaum, M. H. (1983) Base rates in Bayesian inference: Signal detection analysis of the cab problem. American Journal of Psychology 96:85–94.CrossRef Google Scholar

Dawes, R. M. (1988) Rational choice in an uncertain world. Harcourt Brace Jovanovich.Google Scholar

Girotto, V. & Gonzalez, M. (2001) Solving probabilistic and statistical problems: A matter of information structure and question form. Cognition 78:247–76.CrossRef Google Scholar PubMed

Grice, H. P. (1975) Logic and conversation. In: Syntax and semantics: Vol. 3. Speech acts, ed Cole, P. & Morgan, J. L..Academic Press.Google Scholar

Levinson, S. C. (1995) Interactional biases in human thinking. In: Social intelligence and interaction, ed Goody, E..Cambridge University Press.Google Scholar

Larkin, J. H. & Simon, H. A. (2000) Presumptive meanings. MIT Press.Google Scholar

Lindley, D. V. (1985) Making decisions. Wiley.Google Scholar

Macchi, L. (1995) Pragmatic aspects of the base rate fallacy. The Quarterly Journal of Experimental Psychology 48 A(1):188–207.CrossRef Google Scholar

Macchi, L. (2000) Partitive formulation of information in probabilistic problems: Beyond heuristics and frequency format explanations. Organizational Behavior and Human Decision Processes 82:217–36.CrossRef Google Scholar PubMed

Macchi, L. (2003) The partitive conditional probability. In: Thinking: Psychological perspectives on reasoning, judgment and decision making, ed Hardman, D. & Macchi, L., pp. 165–87. Wiley.CrossRef Google Scholar

Macchi, L., Bagassi, M. & Ciociola, P. B. (2007) The pragmatic approach versus the dual-process theories on probabilistic reasoning. (Unpublished manuscript.)Google Scholar

Macchi, L. & Mosconi, G. (1998) Computational features vs frequentist phrasing in the base-rate fallacy. Swiss Journal of Psychology 57:79–85.Google Scholar

Stanovich, K. E.West, R. F. (2000) Individual differences in reasoning: Implications for the rationality debate. Behavioral and Brain Sciences 23:645–726.CrossRef Google Scholar PubMed

Article contents

The underinformative formulation of conditional probability

Abstract

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests