nature.com

Three-dimensional object recognition is viewpoint dependent - Nature Neuroscience

️Gauthier, Isabel
️Sat Aug 01 1998

References

Marr, D. Vision: A Computational Investigation into the Human Representation and Processing of Visual Information (Freeman, San Francisco, 1982).
Google Scholar
Biederman, I. Psychol. Rev. 94, 115–147 ( 1987).
Article Google Scholar
Biederman, I. & Gerhardstein, P. C. J. Exp. Psychol. Hum. Percept. Perform. 19, 1162–1182 ( 1993).
Article CAS Google Scholar
Hayward, W. G. & Tarr, M. J. J. Exp. Psychol. Hum. Percept. Perform. 23, 1511–1521 ( 1997).
Article CAS Google Scholar
Jolicoeur, P. Mem. Cognit. 13, 289–303 ( 1985).
Article CAS Google Scholar
Bülthoff, H. H. & Edelman, S. Proc. Natl. Acad. Sci. USA 89, 60–64 (1992).
Article Google Scholar
Humphrey, G. K. & Khan, S. C. Can. J. Psychol. 46, 170–190 (1992).
Article CAS Google Scholar
Tarr, M. J. Psychonomic Bull. Rev. 2, 55–82 ( 1995).
Article CAS Google Scholar
Tarr, M. J., Bülthoff, H. H., Zabinski, M. & Blanz, V. Psychol. Sci. 8, 282–289 (1997).
Article Google Scholar
Perrett, D. I. et al. Proc. R. Soc. Lond. B 223, 293–317 (1985).
Article CAS Google Scholar
Logothetis, N. K., Pauls, J. & Poggio, T. Curr. Biol. 5, 552– 563 (1995).
Article CAS Google Scholar
Poggio, T. & Edelman, S. Nature 343, 263–266 (1990).
Article CAS Google Scholar
Loftus, G. R. & Masson, M. E. J. Psychonomic Bull. Rev. 1, 476–490 (1994).
Article CAS Google Scholar

Acknowledgements

This research was supported by an Air Force Office of Scientific Research Grant. We thank Jay Servidea and Jaymz Rosoff for their assistance in running the psychophysical studies.

Author information

Authors and Affiliations

Department of Cognitive and Linguistic Sciences, Brown University, Box 1978, Providence, 02912, Rhode Island, USA
Michael J. Tarr
Department of Psychology, University of Massachusetts, 100 Morrissey Boulevard, Boston, 02125-3393, Massachusetts, USA
Pepper Williams
Department of Psychology, University of Wollongong, Northfields Avenue, Wollongong, 2552, NSW , Australia
William G. Hayward
Department of Psychology, Yale University, PO Box 208205, New Haven, 06520-8205 , Connecticut, USA
Isabel Gauthier

Authors

Michael J. Tarr
You can also search for this author in PubMed Google Scholar
Pepper Williams
You can also search for this author in PubMed Google Scholar
William G. Hayward
You can also search for this author in PubMed Google Scholar
Isabel Gauthier
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael J. Tarr.

Supplementary information

220 Yale University undergraduates participated in exchange for course credit or cash payment; numbers of participants in each individual experiment are given in Fig. 2. Line drawings of three viewpoints of ten single geons were scanned into a Macintosh computer from Biederman and Gerhardstein's³ Fig. 12 for use in experiments 1a and 2a. Shaded images of the same 10 geons (Fig. 1; available at ftp://www.cog.brown.edu/pub/tarrlab/stim/geons8.sit.hqx), matching the views used by Biederman and Gerhardstein as closely as possible, were created and rendered using CAD software for use in all other experiments. All stimuli subtended approximately 70 by 70 of visual angle when viewed by participants approximately 60 cm from the computer screen. All experiments were performed on Macintosh computers using RSVP software (http://psych.umb.edu/rsvp).

Each sequential matching trial (experiments 1a-e) consisted of the following sequence of events: blank screen for 1000 ms, fixation cross for 500 ms, object image for 200 ms, mask (consisting of random combinations of features from the line drawings or shaded images) for 750 ms, second object image for 100 ms, mask for 500 ms. The trial timed out 1500 ms later if no response was given. In experiments 1a, b and d, a two-key procedure was used, in which the participant pressed the 'V' key on the computer keyboard if the two images were of the same geon (even if shown in a different viewpoint), or the 'M' key if the two images were of different geons. In experiments 1c and e, a go/no-go procedure was used, in which the participant pressed the space bar if the two objects were the same or allowed the trial to time out otherwise. Participants in experiments 1d and e were informed after each trial of their response time and the accuracy of their response (as shown in Fig. 2, this feedback lowered overall response times, but had little impact on viewpoint effects). For 'same' trials, each of the three views of each geon was presented three times as the first object in a trial and three times as the second object, producing three trials in which the two views were identical, four trials in which they differed by 45°, and two trials in which they differed by 90°. In these and subsequent experiments, trials were presented in a different random order for each participant.

Match-to-sample experiments (2a-c) each consisted of 10 blocks of trials. Each block included an initial presentation of a target object for 20 s, followed by 18 test trials in the following sequence: blank screen for 250 ms, fixation cross for 500 ms, object for 150 ms, mask for 500 ms, time out 1500 ms later if no response was given. Participants memorized the initial target, then pressed the space bar if the test object on subsequent trials matched the target object (even if the viewpoint varied), or let the trial time out if the test and target objects did not match. Participants in experiment 2c received feedback of the same sort as experiments 1d and e. The target object was always shown in the 0° viewpoint. Every test block included three same trials in each of the three viewpoints (0°, 45°, and 90°) of the target object and 9 'different' trials (one for each of the non-target objects).

In experiment 3, participants learned labels (given in Fig. 1) for the 10 geons, then performed trials in which they verbally named test images as quickly as possible. Participants first studied a sheet of paper showing the 10 objects with their names, then performed 20 trials in which each object was shown twice, along with its name, on the computer screen. Four practice blocks of five trials with each object followed, in which participants saw objects without their names and spoke the names. Objects were always shown in the 0° viewpoint during practice trials. Participants then performed two test blocks of six trials with each object, distributed equally between the 0°, 45°, and 90° viewpoints. All practice and test trials consisted of a 500 ms blank screen, 500 ms fixation cross and an object that stayed on the screen until the participant responded or until 2500 ms had elapsed. Response times were recorded via the voice trigger, but accuracy was not recorded. (The experimenter observed the first few participants, and found that accuracy was almost always perfect.)

Rights and permissions

About this article

Cite this article

Tarr, M., Williams, P., Hayward, W. et al. Three-dimensional object recognition is viewpoint dependent. Nat Neurosci 1, 275–277 (1998). https://doi.org/10.1038/1089

Download citation

Received: 04 May 1998
Accepted: 04 June 1998
Issue Date: August 1998
DOI: https://doi.org/10.1038/1089