Parallel attentive processing and pre-attentive guidance

Hermann J. Müller; Heinrich René Liesefeld; Rani Moran; Marius Usher

doi:10.1017/S0140525X16000194

Parallel attentive processing and pre-attentive guidance

Published online by Cambridge University Press: 24 May 2017

Hermann J. Müller ,

Heinrich René Liesefeld ,

Rani Moran and

Marius Usher

Show author details

Hermann J. Müller: Affiliation:
Department of Psychology, Ludwig-Maximilians-Universität München, Munich D-80802, Germany; hmueller@psy.lmu.deheinrich.liesefeld@psy.lmu.dehttp://www.psy.lmu.de/exp/people/prof/mueller/http://www.psy.lmu.de/exp/people/ma/liesefeld_hr/ Department of Psychological Sciences, Birkbeck College, University of London, London WC1E 7HX, United Kingdom; http://www.bbk.ac.uk/psychology/our-staff/academic/hermann-muller
Heinrich René Liesefeld: Affiliation:
Department of Psychology, Ludwig-Maximilians-Universität München, Munich D-80802, Germany; hmueller@psy.lmu.deheinrich.liesefeld@psy.lmu.dehttp://www.psy.lmu.de/exp/people/prof/mueller/http://www.psy.lmu.de/exp/people/ma/liesefeld_hr/
Rani Moran: Affiliation:
Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London WC1B 5EH, United Kingdom; rani.moran@gmail.comhttps://iris.ucl.ac.uk/iris/browse/profile?upi=RMORA40 Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, United Kingdom; School of Psychological Sciences and Sagol School of Neuroscience, Tel Aviv University, Ramat Aviv, Tel-Aviv 69978, Israel. marius@post.tau.ac.ilhttps://en-social-sciences.tau.ac.il/profile/marius
Marius Usher: Affiliation:
School of Psychological Sciences and Sagol School of Neuroscience, Tel Aviv University, Ramat Aviv, Tel-Aviv 69978, Israel. marius@post.tau.ac.ilhttps://en-social-sciences.tau.ac.il/profile/marius

Article contents

Abstract
References

Rights & Permissions

Abstract

This commentary focuses on two related, open questions in Hulleman & Olivers' (H&O's) proposal: (1) the nature of the parallel attentive process that determines target presence within, and thus presumably the size of, the functional visual field, and (2) how the pre-attentive guidance mechanism must be conceived to also account for search performance in tasks that afford no reliable target-based guidance.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 40 , 2017 , e149

DOI: https://doi.org/10.1017/S0140525X16000194 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2017

Hulleman & Olivers (H&O) make an interesting case for an approach that takes eye fixations, rather than individual items, as its central unit. Within the fixational “functional field of view” (FFV), items are processed in parallel. The size of the FFV is adjusted according to search (target discrimination) difficulty, determining the number of fixations and thus RTs. While H&O's, and previous (e.g., Zelinsky Reference Zelinsky2008), arguments that eye movements and the FFV play a role in realistic visual search are persuasive, their model leaves (1) the attentional process that detects targets and (2) the pre-attentive process that guides fixations underspecified. Here, we discuss point (1) in relation to Humphreys and Müller's (Reference Humphreys and Müller1993) “Search via Recursive Rejection” (SERR) model (discussed by H&O in sect. 3.2), which, arguably, anticipated some of the ideas advocated by H&O, and (2) the need for a pre-attentive search-guidance mechanism in both SERR and H&O's model.

1. Like H&O's model, SERR deploys a sequence of parallel search steps to decide whether a target is present in the display. Although H&O are silent about the process that determines whether the target is present in each FFV region (a process their model considers as error-free), SERR – a connectionist implementation of Duncan and Humphreys' (Reference Duncan and Humphreys1989) “Similarity Theory” – posits an error-prone mechanism. In SERR, items, the target and the distractors, within some FFV of spatially parallel processing compete for activating their (higher-level) template representations. When there are multiple distractors of the same complex feature description in the FFV, they are likely to win the competition over the single target, whereupon they are top-down suppressed “as a group.” This process operates recursively until either (1) the target activates its template, triggering a target-present (TP) decision; or (2) all items are “removed” from the FFV, leading to a target-absent (TA) decision. These dynamics are influenced by target–distractor similarity: The more similar the target is to (some of) the distractors, the more likely it is to be rejected along with a distractor group, yielding increasing miss rates. To bring the rate of target misses down to acceptable levels (matching those exhibited by humans), SERR must make several rechecking “runs” at the items in the FFV, until the target is either detected or consistently not found. Importantly, SERR produces miss rates that accelerate positively with the number of items in the FFV (especially with multiple distractor groups), in which case the rechecking strategy can become prohibitively expensive. As discussed by Humphreys & Müller (Reference Humphreys and Müller1993, p. 105), “A solution is to limit SERR's functional field so that there is a balance between the first-pass miss rate and the time cost incurred by rechecking” – providing an explicit, error-based “rule” for the FFV size adjustment. The adjusted FFV would then have to be deployed serially across the display (whether this involves covert or overt attention shifts). This resembles some of H&O's central ideas concerning discriminability-dependent FFV adjustments, which would be reflected in the number of attention shifts necessary to perform the task. As an aside, H&O are not quite right in stating that “the…empirical work [associated with SERR] focused on relatively shallow search slopes” (sect. 3.2, para. 3): Müller et al. (Reference Müller, Humphreys and Donnelly1994) present simulations of human slopes (with slope estimates derived from simulated mean RTs and RT distributions) ranging, for example, in their Experiment 1, from about 30 to well over 200 ms/item.
2. Given a need for overt or covert attention shifts, efficient search would require an element of pre-attentive “guidance” for the FFV to be directed to (only) the most “promising” regions of the display. In principle, guidance can be provided by a combination of bottom-up and top-down mechanisms, for example, through the computation of local feature-contrast signals and their summation, across dimensions, on some search-guiding “overall-saliency” or “priority” map of the field. Note that this map is generally conceived as a pre-attentive representation, even though it is subject to top-down (feature- and dimension- as well as memory-based) biasing. Notions of guidance are at the heart of models from the Guided-Search (GS) family, including our “Competitive GS” model (e.g., Liesefeld et al. Reference Liesefeld, Moran, Usher, Müller and Zehetleitner2016; Moran et al. Reference Moran, Zehetleitner, Müller and Usher2013; Reference Moran, Zehetleitner, Liesefeld, Müller and Usher2016), and well supported empirically. Although feature contrast computations themselves are not necessarily “item-based” (see, e.g., Itti & Koch Reference Itti and Koch2001), much of what is known about their workings stems from item-based search experiments! Arguably, then, as acknowledged by H&O (in sect. 6.6), their model (and SERR!) would need to incorporate some notion of “guidance” to fully account for human search performance – which would bring it closer into line with “traditional,” two-stage models of visual search like GS.

Note that H&O “buy in” guidance from models such as Zelinsky's (Reference Zelinsky2008) “Target Acquisition Model” or Pomplun et al.'s (Reference Pomplun, Reingold and Shen2003) “Area Activation Model.” In these types of model, guidance is exclusively top-down: target- (template- or feature-) based. In fact, Zelinsky (Reference Zelinsky2008) finds it “arguable whether a model that combines both top-down [target-template-based] and bottom-up [saliency] signals would be more successful than TAM in describing human behavior, at least in tasks in which the top-down target information [is] highly reliable” (p. 825). Such models, however, fail to address what determines target detection in search for (feature or feature conjunction) singleton targets, where there is no (reliable) target template to top-down guide the search (Müller et al. Reference Müller, Heller and Ziegler1995; Weidner & Müller Reference Weidner and Müller2013); for example, is target “pop-out” based on a parallel attentive process operating over the whole display or a pre-attentive, salience-based process? One interesting possibility is that, on TP trials, detection decisions are triggered directly by the salience map – consistent with studies showing pop-out detection with no or minimal target identity processing (e.g., Müller et al. Reference Müller, Krummenacher and Heller2004; Töllner et al. Reference Töllner, Rangelov and Müller2012b) and some process of parallel distractor rejection taking place on TA trials (e.g., Müller et al. Reference Müller, von Mühlenen and Geyer2007). On more difficult search trials, the pre-attentive guidance mechanism could direct the attentive process to sample an area that surrounds the location of the highest salience. Here, models such as H&O's may indeed add to the traditional item-based models.

References

Duncan, J. & Humphreys, G. W. (1989) Visual search and stimulus similarity. Psychological Review 96:433–58. doi: 10.1037/0033-295X.96.3.433.CrossRef Google Scholar PubMed

Humphreys, G. W. & Müller, H. J. (1993) SEarch via Recursive Rejection (SERR): A connectionist model of visual search. Cognitive Psychology 25:43–110. doi: 10.1006/cogp.1993.1002.CrossRef Google Scholar

Itti, L. & Koch, C. (2001) Computational modelling of visual attention. Nature Reviews Neuroscience 2(3):194–203.CrossRef Google Scholar PubMed

Liesefeld, H. R., Moran, R., Usher, M., Müller, H. J. & Zehetleitner, M. (2016) Search efficiency as a function of target saliency: The transition from inefficient to efficient search and beyond. Journal of Experimental Psychology: Human Perception and Performance 42(6):821–36. doi: 10.1037/xhp0000156.Google Scholar PubMed

Moran, R., Zehetleitner, M., Liesefeld, H. R., Müller, H. J., & Usher, M. (2016) Serial vs. parallel models of attention in visual search: accounting for benchmark RT-distributions. Psychonomic Bulletin and Review 23:1300–15. doi: 10.3758/s13423-015-0978-1.CrossRef Google Scholar PubMed

Moran, R., Zehetleitner, M., Müller, H. J. & Usher, M. (2013) Competitive guided search: Meeting the challenge of benchmark RT-distributions. Journal of Vision 13(8):24. doi: 10.1167/13.8.24.CrossRef Google Scholar PubMed

Müller, H. J., Heller, D. & Ziegler, J. (1995) Visual search for singleton feature targets within and across feature dimensions. Perception and Psychophysics 57:1–17. doi: 10.3758/BF03211845.CrossRef Google Scholar PubMed

Müller, H. J., Humphreys, G. W. & Donnelly, N. (1994) SEarch via Recursive Rejection (SERR): Visual search for single and dual form-conjunction targets. Journal of Experimental Psychology: Human Perception and Performance 20:235–58. doi: 10.1037/0096-1523.20.2.235.Google Scholar PubMed

Müller, H. J., Krummenacher, J. & Heller, D. (2004) Dimension-specific inter-trial facilitation in visual search for pop-out targets: Evidence for a top-down modulable visual short-term memory effect. Visual Cognition 11:577–602. doi: 10.1080/13506280344000419.CrossRef Google Scholar

Müller, H. J., von Mühlenen, A. & Geyer, T. (2007) Top-down inhibition of distractors in parallel visual search. Perception and Psychophysics 69:1373–88. doi: 10.3758/BF03192953.CrossRef Google Scholar PubMed

Pomplun, M., Reingold, E. M. & Shen, J. Y. (2003) Area activation: A computational model of saccadic selectivity in visual search. Cognitive Science 27:299–312. doi: 10.1016/S0364-0213(03)00003-X.CrossRef Google Scholar

Töllner, T., Rangelov, D. & Müller, H. J. (2012b) How the speed of motor-response decisions, but not focal-attentional selection, differs as a function of task set and target prevalence. Proceedings of the National Academy of Sciences of the United States of America 109:E1990–99. doi: 10.1073/pnas.1206382109.Google Scholar

Weidner, R. & Müller, H. J. (2013) Dimensional weighting in cross-dimensional singleton conjunction search. Journal of Vision 13(3):25. doi: 10.1167/13.3.25.CrossRef Google Scholar PubMed

Zelinsky, G. J. (2008) A theory of eye movements during target acquisition. Psychological Review 115:787–835. doi: 10.1037/a0013118.CrossRef Google Scholar PubMed