uphigh,downlow,tooslow - skemman.is
TRANSCRIPT
Lokaverkefni til BS-prófsí sálfræði
Up high, down low, too slowEffects of nature on directed attention differ between high and low spatial
frequencies
Brynjólfur Gauti Jónsson
Málfríður Guðný Kolbeinsdóttir
Júní 2018
Up high, down low, too slowEffects of nature on directed attention differ between high and low spatial frequencies
Brynjólfur Gauti Jónsson
Málfríður Guðný Kolbeinsdóttir
Lokaverkefni til BS-gráðu í sálfræði
Leiðbeinandi: Heiða María Sigurðardóttir
Sálfræðideild
Heilbrigðisvísindasvið Háskóla Íslands
Júní 2018
Ritgerð þessi er lokaverkefni til BS gráðu í sálfræði og er óheimilt að afrita ritgerðina á
nokkurn hátt nema með leyfi rétthafa.
©Brynjólfur Gauti Jónsson og Málfríður Guðný Kolbeinsdóttir, 2018
Prentun:
Reykjavík, Ísland 2018
Research has shown that viewing images of natural environments can promote attentional
restoration significantly better than urban environments, but results have been mixed. Other
studies have also found that visual complexity can have a mediating effect on one’s prefer-
ence for environments. The objective of this study was to investigate whether the restorative
effects of natural environments on directed attention are dependent on visual information con-
tained in high and low spatial frequency components. 60 university students and young adults
participated in this study. Participants performed three rounds of the Sustained Attention to
Response Test (SART) between which they were shown images according to their experi-
mental condition. The images portrayed natural or urban environments and were filtered to
contain either only high spatial frequencies or only low spatial frequencies. When looking
only at the high spatial frequency components of images, watching images of natural envi-
ronments led to worse performance on the SART than urban ones, a trend that was reversed
for lower spatial frequencies. Our results indicate that the restorative effect of natural envi-
ronments on directed attention might be confounded by greater visual complexities of natural
environments conveyed via high spatial frequencies.
3
Acknowledgements
This thesis was written under the guidance of Heiða M. Sigurðardóttir. We want to thank her
for her invaluable assistance and guidance. We would also like to thank Sabine Süsstrunk
for giving us permission to use École Polytechnique Fédérale de Lausanne’s RGB-NIR
Scene Dataset.
4
Contents
Abstract 3
Introduction 6
Method 13
Participants . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Stimulus Materials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Sustained Attention to Response Task (SART) . . . . . . . . . . . . . . . 15
Image Categorization Task . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Results 19
Descriptive Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Attentional restoration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Categorization of images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Discussion 22
References 25
Appendix 32
Removed Image . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Spatial Frequency Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Low Pass Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
High Pass Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
FFT Shift . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Inverse FFT Shift . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
5
As well as being an important part of daily life, constant attention is required in many lines of
work (e.g. airport radar technicians, security guards) and a small slip can have dire conse-
quences. Of particular importance to someone working under such pressure is their capacity
for directed attention: Inhibiting unimportant stimuli in order to focus on a particular task
(Theeuwes, Atchley, & Kramer, 2000), a mechanism that is susceptible to fatigue (Kaplan,
1995; Kaplan & Berman, 2010). Finding factors that help one recover from such fatigue can
be an important goal.
Previous research has shown that natural environments and interactions within
them are one such factor (Berman, Jonides, & Kaplan, 2008; Faber Taylor & Kuo, 2009;
Lee, Williams, Sargent, Williams, & Johnson, 2015; Lin, Tsai, Sullivan, Chang, & Chang,
2014), and the mere viewing of images which depict these environments can have restorative
benefits when compared to urban environments (Berto, 2005; Gamble, Howard Jr, & Howard,
2014). One theory set forth to explain this phenomenon is Attention Restoration Theory
(ART) (Kaplan, 1995; Kaplan & Berman, 2010).
The framework of ART is based on a distinction between voluntary and involun-
tary attention originally made by William James in 1892 (Kaplan, 1995). Voluntary atten-
tion is reminiscent of directed attention, a key component of top-down executive function-
ing, while involuntary attention shares similarities with more automatic bottom-up attention
(Theeuwes et al., 2000). To avoid confusion with James’s definitions of attention, ART uses
the terms directed attention and fascination, of which there are two types, soft and hard fasci-
nation (Kaplan & Berman, 2010). ART claims that directed attention and fascination are two
separate cognitive functions that work in tandem. Directed attention is utilized in day-to-day
tasks that require conscious effort, such as doing homework assignments, and is susceptible
to fatigue as one has to pay attention to a particular stimulus while actively inhibiting other
competing stimuli (James, 1983). Soft fascination is a gentle bottom-up attention that does
not interfere with other thoughts such as a calm meadow or a pleasant painting, whereas hard
fascination encompasses perceptions that preclude any other thoughts such as athletic or vio-
lent events (Kaplan & Berman, 2010). According to ART, activating soft fascination allows
fatigued supplies of directed attention to be renewed (Kaplan, 1995). A mainstay of ART is
that natural environments often incite this sense of fascination, giving directed attention time
to rest (Kaplan, 1995; Kaplan & Berman, 2010). But what is it about nature that gives it this
restorative property, even when it is only experienced through images?
6
According to ART, so-called restorative environments provide an opportunity for
soft fascination, or reflection, thus motivating involuntary attention and helping people re-
cover from directed attentional fatigue (Kaplan, 1995). Restorative environments are defined
as those environments that possess four distinct qualities. First, they must provide an escape
from fatiguing daily activities (sense of being away). Second, there must be elements that
effortlessly catch attention (soft fascination). Third, individuals must feel immersed in the
environment (sense of extent). Fourth and finally, the environment must be compatible with
a person’s current goals (compatibility). Natural environments often have all of these com-
ponents, and exposure to them could allow a person to recover from directed attention fatigue
(Kaplan, 1995; Kaplan & Berman, 2010).
Berto (2005) used the sustained attention to response task (SART; Robertson,
Manly, Andrade, Baddeley, & Yiend, 1997) to measure participants’ ability to sustain di-
rected attention before and after viewing three different sets of images. Conditions involved
supposed restorative environments (trees, lakes etc.), non-restorative environments (build-
ings, cars etc.), or geometric figures. Only participants who viewed restorative environments
showed significant improvements on the SART test (Berto, 2005).
A study by Berman (2008) included two experiments. The former had participants
perform the digit span backward task (DSB), an attentionally demanding task that depends
on directed-attention abilities (Wechsler, Coalson, & Raiford, 1997) before and after a walk
in an environment that was either urban or natural. Only participant who walked in the natu-
ral environment showed improvements on the DSB task. In another experiment, participants
merely watched images of either natural or urban environments on a computer screen between
tasks, consisting of the DSB as well as the attentional network test (ANT), a tool used to test
three networks of attention (alerting, orienting and executive control) (Fan, McCandliss,
Sommer, Raz, & Posner, 2002). Results were comparable, and interestingly, only the execu-
tive portions of the ANT task were improved, which is consistent with the theoretical effects
of nature on directed attention (Berman et al., 2008). A study by Gamble (2014), showed
the same improvements in the executive portions of the ANT, this time also in older adults
(Gamble et al., 2014).
In Faber (2009), the attentional performance of children diagnosed with ADHD
significantly improved after walking in a park compared to downtown and neighborhood
7
walks (Faber Taylor & Kuo, 2009). Restorative effects have also been seen when natural envi-
ronments are partly integrated into urban environments and can impact attentional restoration
even without participants being aware of it. This is seen in studies by Lin (2014), where partic-
ipants watching images of streetscapes only improved on the DSB task when the streetscapes
contained trees, even if participants were not aware of them (brief flashes of trees), and in
Lee (2015) where university students taking a 40 second break on a green-roof (a building
with a garden on its roof) made significantly fewer errors on the SART test than students
who took a break on a bare concrete roof (Lee et al., 2015; Lin et al., 2014).
Thus it would seem that there is evidence of a mechanism that restores attention
and interacts preferably with natural environments, but independent systematic reviews by
Ohly et al. (2016) and Pullin et al. (2010) found mixed results. In their review, Pullin et
al. (2010) analyzed effect sizes from 5 different studies and found that effect sizes were non-
significant after adjusting for pre-test differences (Bowler, Buyung-Ali, Knight, & Pullin,
2010). Echoing these results, Ohly et al. (2016) pooled together results from 24 articles
finding significant effect sizes for some measures of attention but many others showed non-
significant effects (Ohly et al., 2016). In light of these results we might ask whether some
visual factors other than the semantic type of environment (nature vs. urban) are having
confounding effects on attentional restoration. To try to answer this question we look to
visual aesthetics and the study of visual complexity.
The study of visual aesthetics goes all the way back to the ancient Greeks and
their use of the golden ratio in architecture (Livio, 2002). Gustav Fechner (1886), a pioneer
of experimental aesthetics, suggested a statistical study of the subject, and among other things
had a special fondness for the golden ratio (Phillips, Norman, & Beers, 2010). In his book
Aesthetic Measure (1933) George Birkhoff suggested that the pleasure derived fromwatching
an object could be calculated via the function M = OC , where M is its aesthetic score, O is
a measure of the object’s order or harmony and C is its complexity or number of chaotic
elements. According to this measure, humans would prefer objects with greater harmony and
less complexity, which is supported by research showing that being exposed to more complex
or disorderly environments can have negative effects (Chae & Zhu, 2013; Heintzelman, Trent,
& King, 2013; Kotabe, 2014).
Recent research has found that natural environments are consistently rated as being
8
more disorderly than urban environments (Kotabe, Kardan, & Berman, 2017). If natural
environments are more disorderly than urban, we might expect preference for them to be
lower. Contrary to this, Kaplan et al. (1972) showed that while the visual complexity of
an environment is on its own a poor predictor of preference, visual complexity is predictive
of preference when this relationship is separately assessed within each environmental type
(natural/urban) (Kaplan & Wendt, 1972).
A more modern approach to visual complexity is fractal geometry and the frac-
tal dimension, which is a measure of the number of elements located in a space (Machado
et al., 2015). Van Den Berg (2016) utilized a method coined by Benoit Mandelbrot in 1981
in which images of natural and urban environments were cut into smaller pieces and mag-
nified to their original size. If a magnified image retained many different visible elements
its fractal complexity was considered to be higher (Mandelbrot, 2007). The purpose of this
study was to quantify any mediating effect of fractal complexity on preference for natural
environments over urban. Participants viewed the original images as well as two different
magnifications (400% and 1600%) and then answered statements intended to measure the
images’ complexity and restorative qualities. For unmagnified images there was no difference
in rated complexity between environmental types, but for magnified images natural environ-
ments were rated as significantly more complex. Natural images were consistently rated as
beingmore restorative, and this difference was significantly greater for magnified images. Re-
searcher also found that the effects of environmental type on perceived restorativeness greatly
decreased after correcting for the images’ complexity score, indicating a mediating effect of
complexity on the relationship between environment and restorativeness (Van den Berg, Joye,
& Koole, 2016).
In a recently published study, Kotabe et al. (2017) performed several experiments
to test what they call the nature-disorder paradox. To explain this paradox, they tested
three hypotheses, but found evidence supporting only one, the nature-trumps-disorder hy-
pothesis. According to this hypothesis, preference for nature simply overpowers our aversion
towards disorder. In their experiments, participants scored natural and urban environment
images on three scales: preference, naturalness, and disorder. In each experiment, natural-
ness was significantly and highly positively correlated with disorder as well as with pref-
erence. The correlation between disorder and preference was either low or insignificant, but
after adjusting preference scores by regressing them onto naturalness they found that disorder
9
was highly negatively correlated with this residual preference (Kotabe et al., 2017). Linear
models of preference regressed upon naturalness, disorder and their interaction showed the
same positive effects of nature and negative effects of disorder as well as negligible or non-
significant interaction effects. In support of their hypothesis the effect size of naturalness on
preference was significantly larger than that of disorder, though this difference was reduced
when adjusting for low-level visual features of the images implying an asymmetric role of
visual features in participants’ naturalness and disorder scoring. The researchers confirmed
these findings for a nature-trumps-disorder effect in a second experiment that was an exact
conceptual replication. When participants were unable to form semantic interpretations of
the environments, due to a random shuffling of the images’ pixels or rapid and horizontally
inverted presentations (50ms), this trumping effect of nature of disorder disappeared. Inter-
estingly, when participants were shown merely nouns conveying a range of natural and urban
semantics (e.g. office, park, lake), the results from the first two experiments resurfaced and
showed remarkably similar correlations and linear models. What’s more, the effect sizes of
naturalness and disorder for the noun experiment were almost identical to the effect sizes
from the first experments calculated after adjusting for low-lever visual features, indicating
that scene semantics explained a large portion of this nature-trumps-disorder effect (Kotabe
et al., 2017).
Among other important visual features of scenes affecting our aesthetic prefer-
ences are lighting (Masuda & Nascimento, 2013) and colour (Palmer & Schloss, 2010). In
their study Kardan et al. (2015) sought to unearth the connections between these visual fea-
tures and the perceived naturalness of environments as well as our preference for them. To
do this, the researchers had participants rate scene images by perceived naturalness as well as
preference and tracked their reaction times (how long it took for them to give a rating). Re-
searchers then regressed these naturalness scores onto images’ visual features, leaving them
with two kinds of naturalness scores, which they called low-level naturalness (the variance
captured by their model) and high-level or semantic naturalness (the non-modelled residuals
of the model). They then used these scores to predict participant’s preference of these environ-
ments. In support of this two-fold naming, the researchers found that low-level naturalness
better predicted preference when the reaction times were shortest (bottom-up judgement), but
semantic naturalness performed best when reaction times were longer (top-down judgement)
(Kardan et al., 2015). These two different effects are in agreement with their later research
10
(Kotabe et al., 2017). It is thus well documented that people thus have an aesthetical pref-
erence for natural environments over urban ones, and some underlying low-level as well as
high-level factors have been researched, but how does viewing pictures of nature help restore
our directed attention?
While ART is an influential theory, another competing theory, the psycho-
evolutionary theory (PET) by Ulrich (1983), also aims to explain differences in preference
and restorative properties of natural and urban environments (Ulrich, 1983). Like in Kaplan’s
ART, restorative effects are attributed to bottom-up processes that are activated by certain
types of environments. What separates these theories is that whereas Kaplan (1995) attributes
the restorative elements of nature to a replenishment of directed attention, Ulrich (1983)
views them as a product of early affective responses to the environment, what he calls an
initial affective response (like/dislike), that can then be modified by an individual’s cognitive
appraisal of the environment (Kaplan, 1995; Ulrich, 1983). Nonetheless, Valtchanov and
Ellard (2015) proposed that these theories could be two sides of the same coin, as the
mechanisms behind both are based on bottom-up processes that set in motion a restorative
process due to exposure to certain types of environments (Valtchanov & Ellard, 2015).
Based on functional neuroimaging research by Yue, Vessel and Biederman (2006), they
suggested a link between visual pathways and this potential reward system activated by what
Kaplan (1995) calls soft fascination but Ulrich (1983) named an initial affective response
(Biederman & Vessel, 2006; Valtchanov & Ellard, 2015; Yue, Vessel, & Biederman, 2006).
Yue, Vessel & Biederman (2006) found that viewing images of environments that
were rated as aesthetically pleasing led to greater neural activation in the ventral striatum, a
critical component in the brain’s reward system (Yager, Garcia, Wunsch, & Ferguson, 2015),
and parahippocampal cortex (PHC), a brain area involved in scene recognition (Epstein &
Ward, 2009; Park, Brady, Greene, & Oliva, 2011), than when viewing non-pleasing images,
and theorized that this was caused by cortical µ-opioid receptors in the ventral visual path-
ways (Biederman & Vessel, 2006; Yue et al., 2006). These receptors might mediate reward
processing and serve as a gateway to drug addiction in humans (Contet, Kieffer, & Befort,
2004; Nummenmaa et al., 2018), are sparse in the early stages of the visual pathway (V1 and
V2), but increase in density as one moves towards the PHC, an area with a high density of
µ-opioid receptors (Yue et al., 2006).
11
One area in the posterior part of the PHC, the parahippocampal place area (PPA),
has been shown to react more strongly to images of spatial scenes than of singular objects (Ep-
stein &Kanwisher, 1998). In particular the PPA responds strongly to the spatial boundary of a
scene, but does not differentiate between the objects one would expect to find inside the scene
(Park et al., 2011). It is widely acknowledged that our visual system extracts different infor-
mation from different spatial frequencies, low spatial frequencies (LSF), conveyed by faster
magnocellular pathways, carrying coarse information about a scene (e.g. spatial layout and
structure), while information from high spatial frequencies (HSF), conveyed more slowly by
parvocellular pathways, provide more fine-grained information (e.g. edges and outlines of
objects located within a scene) (Bar, 2004; Hegdé, 2008; Schyns & Oliva, 1994). In a series
of studies, Schyns & Oliva (see Oliva, 2005 for a review) found evidence for coarse-to-fine
sequencing, meaning that LSF information is processed faster and utilized in a top-downman-
ner to help process HSF information [Schyns & Oliva (1994); Oliva & Schyns (1997); Oliva
(2005)).
While neuroimaging studies have found that the PPA is differentially sensitive to
the spacial frequencies of a scene (see Kauffmann, Ramanoël, & Peyrin, 2014 for a review),
there are mixed results as to whether LSF or HSF cause greater activation (Peyrin, Baciu,
Segebarth, & Marendaz, 2004; Rajimehr, Devaney, Bilenko, Young, & Tootell, 2011; Zeid-
man, Mullally, Schwarzkopf, & Maguire, 2012). Meanwhile, Musel et al. (2014) showed
that the PPA shows greater activation to coarse-to-fine stimuli presentations, meaning that
stimuli were presented in rapid sequence ranging from primarily LSF to HSF (@ Schyns &
Oliva, 1994), than the other way around (Musel et al., 2014). Thus, it would seem that the
PPA responds to both LSF and HSF components of scenes but might process them differently.
In sum, research has seen positive effects of exposure to natural environments on
attentional restoration (Berman et al., 2008; Berto, 2005; Faber Taylor & Kuo, 2009; Gamble
et al., 2014; Lee et al., 2015; Lin et al., 2014) but not after pooling effect sizes (Ohly et
al., 2016) or adjusting them for pre-test differences (Bowler et al., 2010). In spite of these
mixed results there seems to be a trend where overall natural environments have a positive
effect but this might be confounded by other factors. Some research into these factors has
been done in the nature-trumps-disorder effect (Kotabe et al., 2017), the mediating effects
of fractal complexity on preference (Van den Berg et al., 2016), and distinctions between
low-level and high-level naturalness (Kardan et al., 2015). Building on this research and
12
the fact that HSF and LSF components are integral to visual perception as well as conveying
distinct information, the authors of this paper designed and performed an experiment to test for
differences in the effects of natural and urban environments on attentional restoration, based
on the spatial frequencies representing them. Higher frequencies show finer details of a scene
(i.e. edges and lines) without carrying with them its broad context. Based on the nature-
disorder-paradox and the fact that natural scenes are consistently rated as more disorderly
than urban ones, we might expect the positive effects of nature on attentional restoration to be
negatively confounded by this edge-based representation. If this effect of disorder is related
to a scene’s fractal complexity, the coarse information carried in LSF might show a weaker
negative confounding than in HSF.
Method
Participants
60 (M=32) participants with a mean age of 28 (σ=6) took part in the experiment. Subjects
were selected with a convenience sample and assigned double blindly into a 2x2 mixed facto-
rial experiment via quasi-random assignment where researchers had no knowledge or choice
over which participants were assigned to which blocks. More precisely, a list of random
block assignments was generated using the blockrand package for R according to which par-
ticipants were automatically assigned to one of four experimental conditions (Snow, 2013).
All participants saw both natural and urban environments but the order in which they were
presented was randomized to balance any ordering effect. This leaves two levels of spatial
filter, high pass or low pass, and two ordering levels, nature images first or urban images first.
Tests revealed no significant differences between participants in different ordering conditions
so they were pooled into two groups of 30 participants each: High pass filter (mean age=29,
sd.age = 7, n.males = 20), or low pass filter (mean age=27, sd.age =5, n.males = 12).
Participants in this study were volunteers and received no reward for participation.
13
Stimulus Materials
Images were downloaded from École Polytechnique Fédérale de Lausanne’s RGB-NIR Scene
Dataset (Brown & Süsstrunk, 2011). The original image set contains 477 images in 9 cate-
gories: country, field, forest, indoor, mountain, old building, street, urban, water. Out of these
477 images, 100 images depicting urban environments (indoor, old building, street, urban)
and another 100 depicting natural environments (country, field, forest, mountain, water)
were chosen as stimuli. Care was taken so that natural images did not include people and had
no discernible manmade structures, and urban images had no or limited natural features.
The dataset was preprocessed using the EBImage package for R (Pau, Fuchs, Skl-
yar, Boutros, & Huber, 2010). First the images were resized to 1280x800 pixel sizes and their
luminance histograms equalized followed by conversion to grayscale using CIE 1931 lumi-
nance weights (Fairman, Brill, & Hemmendinger, 1997). Research has shown that color is a
vital feature for scene categorization, especially for lower spatial frequencies (Oliva, 2005),
but images were grayscaled for this study since the colour for HSF images can be completely
different from LSF images which could create undue biases between groups. The grayscaled
images were then converted to the frequency domain using a fast Fourier transform and fil-
tered by calculating their Hadamard product with equally sized 2D radial kernels having peaks
at the middle of their respective arrays, and containing numbers between 0 and 1. The radial
kernels can be seen in figure 1 as well as example pre-processed stimuli from the RGB-NIR
Scene Dataset (Brown& Süsstrunk, 2011). The high pass filter was calculated with a standard
deviation of s = 0.02, while the low-pass filter was calculated by creating a matrix of ones
and subtracting from it a kernel with standard deviation s = 0.1. R scripts used for image
filtering can be found in the appendix, and contains snippets of code posted to StackOverflow
by rayryeng (rayryeng, 2016). The final stimulus set consisted of 400 images (100 LSF ur-
ban images, 100 LSF nature images, 100 HSF urban images and 100 HSF nature images)
in 1280x800 pixel size that were stretched out to fill a 1920x1080 computer screen.
14
High Pass Low Pass None
Ker
nels
Urb
anN
atur
al
Filter type
Env
iron
men
t
0
1/2
1
Kernel V
alues
Figure 1: Example stimuli and radial kernels. Images are from École Polytechnique Fédéralede Lausanne’s RGB-NIR Scene Dataset (Brown and Süsstrunk, 2011).
Measures
Sustained Attention to Response Task (SART)
SART (Sustained Attention to Response Task), as its name implies, measures a participant’s
ability to sustain directed attention, and is predictive of everyday lapses of attention in partic-
ipants with brain injury as well as control groups (Robertson et al., 1997). SART is a continu-
ous performance paradigm where participants must press a key whenever a non-target stimu-
lus appears (go trials) but withhold presses when a target appears (no-go trials). Even though
the test is short and easy to understand it offers little chance for practice effects, although par-
ticipants might form a response pattern where they oscillate between lower response times
with higher error rates and higher response times with lower error rates (Peebles & Bothell,
2004). In order to do well on the test, participants must stay vigilant to keep themselves from
developing an automatic response and pressing when the no-go target appears. This coupled
with the temporal rhythm of the test and random placement of no-go targets makes the test
very cognitively fatiguing.
The application of SART used in this experiment was based on a python version by
Cary Stothart, but adapted for the current study’s needs (Stothart, 2017). The stimuli used for
15
the SART were digits ranging from 1 to 9, all shown equally often in five different font sizes,
with the digit 3 being the target. This means that go trials (8 out of 9 trials) far outweighed
no-go trials (1 out of 9 trials). Each SART block consisted of of 270 trials, thereof 27 no-go
trials and 216 go trials, where nine digits in five different font sizes were shown six times each.
The font sizes used were 1.20, 1.80, 2.35, 2.50, 3.00 corresponding to height in centimeters
on the screen. For each block a list of all 270 trials was prepared and then randomly shuffled.
Each digit was presented for 250 ms followed by a 900 ms mask consisting of an X enclosed
in a ring with radius 1.50 cm. Participants thus had a window of 1150ms to respond positively
by pressing the space-bar after which the response was recorded as a negative and the next
trial began.
D-prime D-prime, or d’, calculates participants’ sensitivity to stimuli, a higher d’ indicating
greater sensitivity. Put more formally it quantifies the distance between the means of the
noise distribution and the signal distribution in units of standard deviations. It is calculated
via the formula: d′ = Φ−1(H) − Φ−1(F), where Φ is a function that projects z-scores to
their respective probabilities H is the hit rate and F is the false-alarm rate of a participant.
For the purpose of calculations this study assumes thatH and F are normally distributed with
equal standard deviations (Stanislaw & Todorov, 1999). Concerning this study, a hit (true
positive) is defined as a digit other than 3 being shown and the participant pressing the space-
bar. Similarly a false alarm (false positive) occurs when the participant presses the space-bar
while the digit 3 is shown.
C criterion The C criterion, or c, measures response bias in participants, a higher c indicat-
ing a bias towards responding positively (pressing the space-bar more often). It is defined
as the distance between the criterion and the intersection point of the signal and noise distribu-
tions, often called the neutral point. It is calculated as: c = −Φ−1(H)+Φ−1(F)2 , or the negative
average of the quantiles corresponding to the false alarm rates’ and hit rates’ z-scores (Stanis-
law & Todorov, 1999).
Reaction Time In this study reaction time is measured in milliseconds and is simply the
time between a stimulus appearing on the screen and the participant pressing the specified
keyboard button. Evaluating reaction times alongwith signal detectionmeasures has been rec-
16
ommended for the SART since it has demonstrated a sizeable speed-accuracy trade-off [Hel-
ton, Weil, Middlemiss, & Sawers (2010); Manly, Robertson, Galloway, & Hawkins (1999);
Robertson et al. (1997); Shalgi, O’Connell, Deouell, & Robertson (2007)).
Image Categorization Task
Since the filtering of the images could cause a bias in participants’ ability to semantically
categorize the environments shown, all participants performed an image categorization task.
In this task each participant was shown 100 images sampled without replacement from the
full set of 400 images and had to make a two alternative forced choice as to whether an natural
or man-made environment was depicted. This means that participants saw both high pass and
low pass filtered images regardless of their experimental condition. Each image was shown
for one second after which participants had an unlimited amount of time to answer, pressing
the n button for nature and m button for man-made.
Procedure
The study was approved by the National Bioethics Committee and the Icelandic Data Protec-
tion Authority. Participants met the researchers at the University of Iceland in a laboratory
provided by the Department of Psychology. After hearing a short description of the study and
beingmade aware of any possible complications participants whowished to partake gave their
written consent.
The experiment took place in a dark, closed room and was programmed in Python
using the PsychoPy software package (Peirce, 2007) and executed on a 27’ ASUS ROG
PG279 monitor, with refresh rate 60 Hz and resolution 1920x1080. A researcher first read
a prewritten text to introduce the experiment after which participants were seated approxi-
mately 60 cm from the computer screen and started the experiment. Further instructions were
shown on the computer screen. Participants were asked to give equal weight to responding
quickly and responding correctly.
Participants were randomly assigned a slot with precomputed quasi-random block
assignment. Half of the participants were shown high spatial frequency (HSF) images be-
tween SART tests and the other half were shown low spatial frequency (LSF) images. All
17
participants were shown both natural and urban scenes but the order in which they were shown
was randomized to balance out any ordering effect. Participants went through a SART prac-
tice block consisting of 18 trials (16 go and 2 no-go). A press on the spacebar key was
required for go trials. Upon completion of the practice block, the researcher answered any
remaining questions before leaving the participant alone in the closed room. Participants had
unlimited time to read preliminary instructions but afterwards had only preprogrammed fixed
times to watch images and read short messages.
Each participant completed three blocks of the SART. Following the first and sec-
ond SART blocks, a series of randomly shuffled images were shown for 3 s each. Half of
the participants saw 100 urban images after the first SART block and 100 natural images
after the second block. The remaining participants viewed natural images first and then ur-
ban images. Images shown to a participant had all been filtered to contain either only HSF
or LSF information, depending on the participants’ subgroup assignment. After completing
three blocks of the SART and going through both image conditions, participants could stretch
their arms before starting the final part of the study. A random sample of 100 images from
the full 400 images (both environments, both spatial frequency conditions) was chosen and
participants were told to categorize them into natural or urban environments. Each image
was shown for 1 s and participants made a two-alternative forced-choice between natural and
urban environments. Figure 1 shows the experimental layout for each of the four groups.
SART 1
High Pass
NatureSART 2
High Pass
UrbanSART 3
SceneCategorization
SART 1
High Pass
UrbanSART 2
High Pass
NatureSART 3
SceneCategorization
SART 1Low Pass
Nature
SART 2Low Pass
Urban
SART 3Scene
Categorization
SART 1Low Pass
Urban
SART 2Low Pass
Nature
SART 3Scene
Categorization
Group 4
Group 3
Group 2
Group 1
Figure 2: Experimental layout. All participants performed three rounds of SART andwatchedtwo types of images before performing the categorization task.
18
Results
Descriptive Statistics
Statistical analyses were performed in the RStudio IDE, version 1.1.447 (RStudio Team,
2016), for R, version 3.5.0 (R Core Team, 2018). First, speed-accuracy trade-off was esti-
mated by correlating total errors and mean response times for each participant over all non-
practice SART blocks.A sizeable negative correlation between errors and reaction times was
seen, r = −.81, 95% CI [−.88, −.71], t(58) = −10.66, p < .001. This indicates a strong
speed-accuracy trade-off which agrees with previous research (Robertson et al., 1997).
Following this, participants’ post-test scores were adjusted for pre-test differences,
pre-test scores being scores from the first SART block performed before any images were
shown, and post-test scores being scores from the second and third SART blocks. Adjust-
ments were performed by subtracting each participant’s pre-test score from each of their post-
test scores, transforming the three dependent variables, d’, c and RT (reaction time), into the
new variables,∆d’,∆c and∆RT. Table 1 shows descriptive statistics for these variables. Pos-
itive means for ∆d’ in all groups indicates a general improvement on the SART test, though
high standard deviations indicate a number of scores below zero as well. For∆d’, semblance
of a pattern can be seen where nature had a more positive effect for the LSF group but ur-
ban environments had a more positive effect for the HSF group. Interestingly, the means are
almost identical although mirrored across spatial frequency conditions.
Table 1: Means for dependent variables with standard deviations in parentheses
Δd’ Δc ΔRT
Filter Nature Urban Nature Urban Nature Urban
Low Pass 0.32 (0.85) 0.2 (0.95) 0.05 (0.31) 0.02 (0.31) -0.02 (0.06) -0.02 (0.06)High Pass 0.21 (0.76) 0.38 (0.73) -0.03 (0.22) 0.06 (0.32) 0 (0.06) 0.01 (0.05)Note:Δ stands for participants’ scores subtracting pre-condition scores
Attentional restoration
A 2 (spatial frequency; between subjects) x 2 (environment; within subjects) repeated mea-
sures ANOVA was performed to test for differences in attentional restoration. Independent
19
variables were the filter type (high pass/low pass) and environment type (natural/urban) of
images as well as their interaction. Dependent variables were ∆d’, ∆c and ∆RT. After the
ANOVA tests, effects were quantified by contrasting the estimated marginal means (EMM)
for environments within each filter and testing the null hypothesis that this contrast was equal
to zero (H0 : M = ∆nature − ∆urban = 0 and H1 : M = ∆nature − ∆urban ̸= 0)
The effects of environmental type and spatial frequency on ∆d’ showed no signif-
icant main effects of filter type F(1, 58) = 0.03, p = 0.867, or environment F(1, 58) = 0.15,
p = 0.701. Interaction effects were significant, F(1, 58) = 4.08, p = 0.048. Pairwise con-
trasts for the LSF group showed no significant difference in effects (M = 0.118, SE = 0.102,
p = 0.253). Interestingly, for the HSF group the effect was greater as well as reversed but
non-significant (M = -0.174, SE = 0.102, p = 0.094), indicating the possibility that urban
environments had a greater positive effect on signal sensitivity.
The effects of environmental type and spatial frequency on ∆c showed non-
significant main effects of filter type, F(1, 58) = 0.08, p = 0.782, and environment F(1,
58) = 1.01, p = 0.319, but nearly significant interaction effects, F(1,58) = 3.50, p = 0.066.
Pairwise contrasts showed for the LSF group no significant effect difference (M = 0.026,
SE = 0.042, p = 0.543). Again, the difference in effects was greater and reversed for the
HSF group, but now also significant (M = -0.085, SE = 0.042, p = 0.046). This implies
that participants were significantly more biased towards responding positively (pressing the
space-bar) after watching images of natural environments than urban.
For ∆RT, there were non-significant main effects of filter type, F(1, 58) = 2.19,
p = .144, and environment, F(1, 58) = 0.73, p = .395, as well as non-significant interaction
effects F(1, 58) = 0.40, p = .530.Since all effects were far from significant no contrasting
tests were performed.
In sum, tests showed no significant main effects of spatial frequency or environ-
ment but peculiarly their interactions showed significant results for c and nearly significant
ones for d’. It seems that the effect of natural environments compared to urban environments
on participant sensitivity is reversed between spatial frequency groups, being positive for LSF
but negative for HSF. The interaction effects of environment and spatial frequency on Δd’ and
Δc are visualized on figure 2.
20
∆c ∆d'
urban nature urban nature
0.1
0.2
0.3
0.4
0.5
−0.05
0.00
0.05
0.10M
argi
nal M
eans
Spatial Filter
Low Pass High Pass
Figure 3: Predicted values for dependent variables by mixed ANOVA models. Interestinglythe effects of environment are reversed between spatial frequency conditions.
Categorization of images
To exclude the possibility of bias caused by some images’ environments being harder to recog-
nize than others, all participants categorized 100 images each into one of four groups,HSF ur-
ban, HSF nature, LSF urban or LSF nature. Of all 6000 categorization trials, 5892 (98.2%)
were categorized correctly. As seen on table 3 there is no easily discernible pattern of error
between the image groups.
Table 2: Percent correctly categorized images
Image Filter
Environment High Pass Low Pass
Nature 98.1% 97.5%Urban 98.7% 98.6%
Significance tests were performed to quantify any systematic differences. Since
each participant categorized all four image groups the tests performed were 2 (environment;
within subjects) x 2 (spatial frequency; within subjects) repeated measures ANOVA. The
dependent variable was percent correct responses and the independent variables were the type
of filter applied (high pass/low pass) and environment shown (nature/urban). There were no
significant main effects for spatial frequency, F(1, 59) = 0.79, p = 0.377, but the main effect
of environment was nearly significant, F(1, 59) = 3.69, p = 0.059. Interaction effects were
non-significant, F(1, 59) = 0.15, p = 0.699. To investigate this effect of environment, images
21
were sorted by decreasing error rates and the images with the highest error rates compared to
each other.
After sorting the images by error rates the hardest image was found to have an error
rate of 47% while the second 20%. It turned out that the anomalous image was a high pass
filtered image of a garden with fences and buildings in the background. Another significance
test was performed after removing this image. The main effects were now non-significant for
environment, F(1, 59) = 1.67, p = .201, and filter, F(1, 59) = 2.66, p = .108, as well as a
non-significant interaction effect F(1,59) = 1.21, p = .275, implying that the difference in
sorting-difficulty were caused by this image.
In sum, after removing a stray natural image showing several manmade objects,
results showed that participants were adept at recognizing which environments were depicted,
regardless of the type of filter applied. The removed image can be seen in figure 4 in the
appendix.
Discussion
Results from this experiment showed that, when looking at only the high spatial frequency
(HSF) component of images, watching images of natural environments lead to worse perfor-
mance on the SART than after watching images of urban environments, an effect that was
not seen for the low spatial frequency (LSF) group. This was evidenced by the fact that
viewing images of natural environments had a more negative effect on c than urban ones,
i.e. caused participants to be more biased towards pressing the spacebar, as seen by signifi-
cant pairwise differences in estimated marginal means (EMM). Effects on d’, a measure of
sensitivity to stimuli, showed the same trend as for c, although ANOVA tests were significant
but not pairwise EMM comparisons. All together, these results show that in the HSF group,
participants who viewed images of natural environments where biased towards pressing the
space-bar more often, but were also worse at discriminating between stimuli. Since the SART
test places high demand on sustained directed attention so that participants can keep from de-
veloping an automatic pattern of pressing the space-bar, these results would suggest that HSF
natural images had a smaller restorative effect on directed attention than HSF urban images.
The reversal in the effects of environment type between filter groups was also
22
interesting. In the HSF group urban environments had a more positive effect than nature but
in the LSF group the effects of natural environments, although not significant, were more
positive. This might conform with the research by Kotabe et al. (2017) into how one can
have a positive preference for natural over urban environments, while disliking disorderly
environments, even though natural environments are consistently rated as more disorderly
(Kotabe et al., 2017). To explain this phenomenon, they came up with the nature-trumps-
disorder hypothesis, which states that the positive effects of nature on preference simply
overpower the negative effects of disorder (Kotabe et al., 2017). In this study, it seems that
the negative effects of disorder overpowered the positive effects of nature in the HSF group,
but not in the LSF group.
One possible explanation of this difference could be the images’ fractal complexity,
or the number of distinct objects found within them. Van Den Berg et al. (2016) found that
images’ fractal complexity had a highly mediating effect on their restorative effects, as rated
by participants (Van den Berg et al., 2016). Images’ LSF components contain few distinct
objects since they convey no outlines or edges, but the HSF components on the other hand
carry most of the information about object outlines.
Research has found evidence to support the restorative effects of natural environ-
ments on directed attention, as is theorized by Attentional Restoration Theory (ART) [Berman
et al. (2008); Lin et al. (2014); Faber Taylor & Kuo (2009); Lee et al. (2015)), even when
only experienced through images (Berman et al., 2008; Berto, 2005; Gamble et al., 2014), but
systematic reviews have found results to be mixed (Bowler et al., 2010; Ohly et al., 2016). It
is not entirely clear what aspects of natural environments are behind this effect. It might be
attributed to a positive affective response to the environment (Ulrich, 1983), a stimulation of
involuntary attention by soft fascination (Kaplan, 1995; Kaplan & Berman, 2010), or it might
have something to do with the visual features of the environment causing us to receive reward
from µ-opioid receptors in our ventral visual pathways (Biederman &Vessel, 2006; Yue et al.,
2006). Whatever the underlying factors may be, more research is required to identify them.
To our knowledge, this is the first study researching how natural and urban envi-
ronments affect directed attention when broken into distinct spatial frequency components.
The fact that effects differ so much by spatial frequencies promotes further research into this
subject, and knowledge of these effects could be of aid to architects and urban planners. If
23
our results are true, optimal restorative effects might by achieved when the HSF components
of urban environments are be mixed with the LSF components of natural environments.
These are only speculations as of yet and should not be directly applied without
further research. Participants in this study are not representative of the population as theywere
chosen by a convenience sample consisting mostly of university students and young adults.
Moreover, applications to real world scenarios are limited since this study utilized grayscale
images. Colour has been shown to be a vital feature in scene recognition, especially when
conveyed through LSF components of images (Oliva, 2005). Attribution of our results to the
nature-disorder-paradox are as of yet theoretical, as the images were not rated for observed
disorder.
In conclusion, this study has shown that the restorative effects of nature on directed
attention are not set in stone, and might rely heavily on the information conveyed by different
spatial frequencies. Further research is required to investigate whether this phenomenon is
linked to the nature-disorder-paradox or some other effect.
24
References
Bar, M. (2004). Visual objects in context. Nature Reviews Neuroscience, 5(8), 617. https:
//doi.org/10.1038/nrn1476
Berman, M. G., Jonides, J., & Kaplan, S. (2008). The cognitive benefits of interacting
with nature. Psychological Science, 19(12), 1207–1212. https://doi.org/10.1111/j.
1467-9280.2008.02225.x
Berto, R. (2005). Exposure to restorative environments helps restore attentional capacity.
Journal of Environmental Psychology, 25(3), 249–259. https://doi.org/10.1016/j.
jenvp.2005.07.001
Biederman, I., & Vessel, E. A. (2006). Perceptual pleasure and the brain: A novel theory
explains why the brain craves information and seeks it through the senses. American
Scientist, 94(3), 247–253. Retrieved from http://www.jstor.org/stable/27858773
Bowler, D. E., Buyung-Ali, L. M., Knight, T. M., & Pullin, A. S. (2010). A systematic review
of evidence for the added benefits to health of exposure to natural environments. BMC
Public Health, 10(1), 456. https://doi.org/10.1186/1471-2458-10-456
Brown, M., & Süsstrunk, S. (2011). Multi-spectral sift for scene category recognition. In
Computer vision and pattern recognition (cvpr), 2011 ieee conference on (pp. 177–
184). IEEE. https://doi.org/10.1109/CVPR.2011.5995637
Chae, B., & Zhu, R. (2013). Environmental disorder leads to self-regulatory failure. Journal
of Consumer Research, 40(6), 1203–1218. https://doi.org/10.1086/674547
Contet, C., Kieffer, B. L., & Befort, K. (2004). Mu opioid receptor: A gateway to drug
addiction. Current Opinion in Neurobiology, 14(3), 370–378. https://doi.org/10.
1016/j.conb.2004.05.005
Epstein, R. A., & Ward, E. J. (2009). How reliable are visual context effects in the parahip-
pocampal place area? Cerebral Cortex, 20(2), 294–303. https://doi.org/10.1093/
cercor/bhp099
Epstein, R., &Kanwisher, N. (1998). A cortical representation of the local visual environment.
Nature, 392(6676), 598. https://doi.org/10.1038/33402
25
Faber Taylor, A., & Kuo, F. E. (2009). Children with attention deficits concentrate better after
walk in the park. Journal of Attention Disorders, 12(5), 402–409. https://doi.org/10.
1177/1087054708323000
Fairman, H. S., Brill, M. H., & Hemmendinger, H. (1997). How the cie 1931 color-
matching functions were derived from wright-guild data. Color Research &
Application: Endorsed by Inter-Society Color Council, the Colour Group (Great
Britain), Canadian Society for Color, Color Science Association of Japan,
Dutch Society for the Study of Color, the Swedish Colour Centre Foundation,
Colour Society of Australia, Centre Français de La Couleur, 22(1), 11–23.
https://doi.org/10.1002/(SICI)1520-6378(199702)22:1<11::AID-COL4>3.0.CO;2-7
Fan, J., McCandliss, B. D., Sommer, T., Raz, A., & Posner, M. I. (2002). Testing the ef-
ficiency and independence of attentional networks. Journal of Cognitive Neuro-
science, 14(3), 340–347. https://doi.org/10.1162/089892902317361886
Gamble, K. R., Howard Jr, J. H., & Howard, D. V. (2014). Not just scenery: Viewing nature
pictures improves executive attention in older adults. Experimental Aging Research,
40(5), 513–530. https://doi.org/10.1080/0361073X.2014.956618
Hegdé, J. (2008). Time course of visual perception: Coarse-to-fine processing and beyond.
Progress in Neurobiology, 84(4), 405–439. https://doi.org/10.1016/j.pneurobio.
2007.09.001
Heintzelman, S. J., Trent, J., & King, L. A. (2013). Encounters with objective coherence and
the experience of meaning in life. Psychological Science, 24(6), 991–998. https:
//doi.org/10.1177/0956797612465878
Helton, W. S., Weil, L., Middlemiss, A., & Sawers, A. (2010). Global interference and spa-
tial uncertainty in the sustained attention to response task (sart). Consciousness and
Cognition, 19(1), 77–85. https://doi.org/10.1016/j.concog.2010.01.006
James, W. (1983). Talks to teachers on psychology and to students on some of life’s ideals
(Vol. 12). Harvard University Press.
Kaplan, S. (1995). The restorative benefits of nature: Toward an integrative framework.
Journal of Environmental Psychology, 15(3), 169–182. https://doi.org/10.1016/
26
0272-4944(95)90001-2
Kaplan, S., & Berman, M. G. (2010). Directed attention as a common resource for executive
functioning and self-regulation. Perspectives on Psychological Science, 5(1), 43–
57. https://doi.org/10.1177/1745691609356784
Kaplan, S., & Wendt, J. S. (1972). Preference and the visual environment: Complexity and
some alternatives. Environmental Design: Research and Practice, 6, 75–76. https:
//doi.org/10.1.1.471.472
Kardan, O., Demiralp, E., Hout, M. C., Hunter, M. R., Karimi, H., Hanayik, T., …Berman, M.
G. (2015). Is the preference of natural versus man-made scenes driven by bottom–up
processing of the visual features of nature? Frontiers in Psychology, 6, 471. https:
//doi.org/10.3389/fpsyg.2015.00471
Kauffmann, L., Ramanoël, S., & Peyrin, C. (2014). The neural bases of spatial frequency
processing during scene perception. Frontiers in Integrative Neuroscience, 8, 37.
https://doi.org/10.3389/fnint.2014.00037
Kotabe, H. P. (2014). The world is random: A cognitive perspective on perceived disorder.
Frontiers in Psychology, 5, 606. https://doi.org/10.3389/fpsyg.2014.00606
Kotabe, H. P., Kardan, O., & Berman, M. G. (2017). The nature-disorder paradox: A percep-
tual study on how nature is disorderly yet aesthetically preferred. Journal of Experi-
mental Psychology: General, 146(8), 1126. https://doi.org/10.1037/xge0000321
Lee, K. E., Williams, K. J., Sargent, L. D., Williams, N. S., & Johnson, K. A. (2015).
40-second green roof views sustain attention: The role of micro-breaks in
attention restoration. Journal of Environmental Psychology, 42, 182–189.
https://doi.org/10.1016/j.jenvp.2015.04.003
Lin, Y.-H., Tsai, C.-C., Sullivan, W. C., Chang, P.-J., & Chang, C.-Y. (2014). Does awareness
effect the restorative function and perception of street trees? Frontiers in Psychology,
5, 906. https://doi.org/10.3389/fpsyg.2014.00906
Livio, M. (2002). The golden ratio: The story of phi, the extra ordinary [ie extraordinary]
number of nature, art and beauty. Review.
Machado, P., Romero, J., Nadal, M., Santos, A., Correia, J., & Carballal, A. (2015). Com-
27
puterized measures of visual complexity. Acta Psychologica, 160, 43–57. https:
//doi.org/doi.org/10.1016/j.actpsy.2015.06.005
Mandelbrot, B. B. (2007). Scalebound or scaling shapes: A useful distinction in the visual
arts and in the natural sciences. Leonardo, 40(5), 494–496. https://doi.org/10.2307/
1574481
Manly, T., Robertson, I. H., Galloway, M., & Hawkins, K. (1999). The absent mind:: Further
investigations of sustained attention to response. Neuropsychologia, 37(6), 661–670.
https://doi.org/10.1016/S0028-3932(98)00127-4
Masuda, O., & Nascimento, S. M. (2013). Best lighting for naturalness and preference. Jour-
nal of Vision, 13(7), 4–4. https://doi.org/10.1167/13.7.4
Musel, B., Kauffmann, L., Ramanoël, S., Giavarini, C., Guyader, N., Chauvin, A., & Peyrin,
C. (2014). Coarse-to-fine categorization of visual scenes in scene-selective cortex.
Journal of Cognitive Neuroscience, 26(10), 2287–2297. https://doi.org/10.1162/
jocn_a_00643
Nummenmaa, L., Saanijoki, T., Tuominen, L., Hirvonen, J., Tuulari, J. J., Nuu-
tila, P., & Kalliokoski, K. (2018). µ-opioid receptor system mediates re-
ward processing in humans. Nature Communications, 9(1), 1500. https:
//doi.org/10.1038/s41467-018-03848-y
Ohly, H., White, M. P., Wheeler, B. W., Bethel, A., Ukoumunne, O. C., Nikolaou, V., &
Garside, R. (2016). Attention restoration theory: A systematic review of the attention
restoration potential of exposure to natural environments. Journal of Toxicology and
Environmental Health, Part B, 19(7), 305–343. https://doi.org/10.1080/10937404.
2016.1196155
Oliva, A. (2005). Gist of the scene. In Neurobiology of attention (pp. 251–256). Elsevier.
https://doi.org/10.1016/B978-012375731-9/50045-8
Oliva, A., & Schyns, P. G. (1997). Coarse blobs or fine edges? Evidence that information di-
agnosticity changes the perception of complex visual stimuli. Cognitive Psychology,
34(1), 72–107. https://doi.org/10.1006/cogp.1997.0667
Palmer, S. E., & Schloss, K. B. (2010). An ecological valence theory of human color pref-
28
erence. Proceedings of the National Academy of Sciences, 107(19), 8877–8882.
https://doi.org/10.1073/pnas.0906172107
Park, S., Brady, T. F., Greene, M. R., & Oliva, A. (2011). Disentangling scene content from
spatial boundary: Complementary roles for the parahippocampal place area and lat-
eral occipital complex in representing real-world scenes. Journal of Neuroscience,
31(4), 1333–1340. https://doi.org/10.1523/JNEUROSCI.3885-10.2011
Pau, G., Fuchs, F., Sklyar, O., Boutros, M., & Huber, W. (2010). EBImage—an r package
for image processing with applications to cellular phenotypes. Bioinformatics, 26(7),
979–981. https://doi.org/10.1093/bioinformatics/btq046
Peebles, D., & Bothell, D. (2004). Modelling performance in the sustained attention to
response task. In ICCM (pp. 231–236). Retrieved from https://www.researchgate.
net/profile/David_Peebles/publication/221548288_Modelling_Performance_in_
the_Sustained_Attention_to_Response_Task/links/09e4150a7b15b0018b000000/
Modelling-Performance-in-the-Sustained-Attention-to-Response-Task.pdf
Peirce, J. W. (2007). PsychoPy—psychophysics software in python. Journal of Neuro-
science Methods, 162(1-2), 8–13. https://doi.org/10.1016/j.jneumeth.2006.11.017
Peyrin, C., Baciu, M., Segebarth, C., & Marendaz, C. (2004). Cerebral regions and hemi-
spheric specialization for processing spatial frequencies during natural scene recogni-
tion. An event-related fMRI study. Neuroimage, 23(2), 698–707. https://doi.org/10.
1016/j.neuroimage.2004.06.020
Phillips, F., Norman, J. F., & Beers, A. M. (2010). Fechner’s aesthetics revisited. Seeing and
Perceiving, 23(3), 263–271. https://doi.org/10.1163/187847510X516412
Rajimehr, R., Devaney, K. J., Bilenko, N. Y., Young, J. C., & Tootell, R. B. (2011). The
“parahippocampal place area” responds preferentially to high spatial frequencies in hu-
mans and monkeys. PLoS Biology, 9(4), e1000608. https://doi.org/10.1371/journal.
pbio.1000608
rayryeng. (2016). How to write fftshift and ifftshift in r? [Closed]. StackOverflow question.
https://stackoverflow.com/questions/38230794/how-to-write-fftshift-and-ifftshift-in-r;
StackOverflow.
29
R Core Team. (2018). R: A language and environment for statistical computing. Vi-
enna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.
R-project.org/
Robertson, I. H., Manly, T., Andrade, J., Baddeley, B. T., & Yiend, J. (1997). Oops!’:
Performance correlates of everyday attentional failures in traumatic brain injured
and normal subjects. Neuropsychologia, 35(6), 747–758. https://doi.org/10.1016/
S0028-3932(97)00015-8
RStudio Team. (2016). RStudio: Integrated development environment for r. Boston, MA:
RStudio, Inc. Retrieved from http://www.rstudio.com/
Schyns, P. G., & Oliva, A. (1994). From blobs to boundary edges: Evidence for time-and
spatial-scale-dependent scene recognition. Psychological Science, 5(4), 195–200.
https://doi.org/10.1111/j.1467-9280.1994.tb00500.x
Shalgi, S., O’Connell, R. G., Deouell, L. Y., & Robertson, I. H. (2007). Absent
minded but accurate: Delaying responses increases accuracy but decreases
error awareness. Experimental Brain Research, 182(1), 119–124. https:
//doi.org/10.1007/s00221-007-1054-5
Snow, G. (2013). Blockrand: Randomization for block random clinical trials. Retrieved
from https://CRAN.R-project.org/package=blockrand
Stanislaw, H., & Todorov, N. (1999). Calculation of signal detection theory measures. Be-
havior Research Methods, Instruments, & Computers, 31(1), 137–149. https://doi.
org/10.3758/BF03207704
Stothart, C. (2017). A python version of the sustained attention to response task (sart) as
detailed by robertson et al. (1997). GitHub repository. https://github.com/cstothart/
sustained-attention-to-response-task; GitHub.
Theeuwes, J., Atchley, P., & Kramer, A. F. (2000). On the time course of top-down and
bottom-up control of visual attention. Control of Cognitive Processes: Attention
and Performance XVIII, 105–124.
Ulrich, R. S. (1983). Aesthetic and affective response to natural environment. In Behav-
ior and the natural environment (pp. 85–125). Springer. https://doi.org/10.1007/
30
978-1-4613-3539-9_4
Valtchanov, D., & Ellard, C. G. (2015). Cognitive and affective responses to natural
scenes: Effects of low level visual properties on preference, cognitive load
and eye-movements. Journal of Environmental Psychology, 43, 184–195.
https://doi.org/10.1016/j.jenvp.2015.07.001
Van den Berg, A. E., Joye, Y., & Koole, S. L. (2016). Why viewing nature is more fascinating
and restorative than viewing buildings: A closer look at perceived complexity. Urban
Forestry & Urban Greening, 20, 397–401. https://doi.org/10.1016/j.ufug.2016.10.
011
Wechsler, D., Coalson, D. L., & Raiford, S. E. (1997). WAIS-iii: Wechsler adult intelligence
scale. Psychological Corporation San Antonio, TX.
Yager, L. M., Garcia, A. F., Wunsch, A. M., & Ferguson, S. M. (2015). The ins and outs of
the striatum: Role in drug addiction. Neuroscience, 301, 529–541. https://doi.org/
10.1016/j.neuroscience.2015.06.033
Yue, X., Vessel, E. A., & Biederman, I. (2006). The neural basis of preference for natural
scenes. Journal of Vision, 6(6), 474–474. https://doi.org/10.1167/6.6.474
Zeidman, P., Mullally, S. L., Schwarzkopf, D. S., & Maguire, E. A. (2012). Exploring the
parahippocampal cortex response to high and low spatial frequency spaces. Neurore-
port, 23(8), 503. https://doi.org/10.1097/WNR.0b013e328353766a
31
Appendix
Removed Image
Figure 4: The photo with the highest error rate. The presence of benches, fences and abuilding may have caused participants to categorize it as an urban image.
32
Spatial Frequency FilteringLow Pass Filtering
low_pass_filt <- function(im, s = 0.1) {source("fftshift.R")source("ifftshift.R")require(EBImage)m <- dim(im)[1]n <- dim(im)[2]imff <- fft(im)imff <- fftshift(imff)x <- 1:m - ((m + 1)/2)y <- 1:n - ((n + 1)/2)z <- outer(x, y, FUN = function(X, Y) (sqrt(X * X + Y * Y)))z <- z/max(z)z <- exp(-z^2/(2 * s^2))imff <- imff * zimff <- ifftshift(imff)imffinv <- fft(imff, inverse = TRUE)imffinv <- Re(imffinv)imffinv <- imffinv * sum(im)/sum(imffinv)as.Image(imffinv)
}
33
High Pass Filtering
hi_pass_filt <- function(im, s = 0.02) {source('fftshift.R')source('ifftshift.R')require(EBImage)m <- dim(im)[1]n <- dim(im)[2]imff <- fft(im)imff <- fftshift(imff)x <- 1:m - ((m + 1)/2)y <- 1:n - ((n + 1)/2)z <- outer(x, y, FUN = function(X, Y) (sqrt(X*X + Y*Y)))z <- z/max(z)z <- 1 - exp(-z^2/(2*s^2))imff <- imff * zimff <- ifftshift(imff)imffinv <- fft(imff, inverse = TRUE)imffinv <- Re(imffinv)imffinv <- imffinv * sum(im) / sum(imffinv)as.Image(imffinv)
}
34
FFT Shift
fftshift <- function(img_ff, dim = -1) {rows <- dim(img_ff)[1]cols <- dim(img_ff)[2]swap_up_down <- function(img_ff) {
rows_half <- ceiling(rows/2)return(rbind(img_ff[((rows_half+1):rows), (1:cols)],
img_ff[(1:rows_half), (1:cols)]))}swap_left_right <- function(img_ff) {
cols_half <- ceiling(cols/2)return(cbind(img_ff[1:rows, ((cols_half+1):cols)],
img_ff[1:rows, 1:cols_half]))}if (dim == -1) {
img_ff <- swap_up_down(img_ff)return(swap_left_right(img_ff))
}else if (dim == 1) {
return(swap_up_down(img_ff))}else if (dim == 2) {
return(swap_left_right(img_ff))}else {
stop("Invalid dimension parameter")}
}
35
Inverse FFT Shift
ifftshift <- function(img_ff, dim = -1) {rows <- dim(img_ff)[1]cols <- dim(img_ff)[2]swap_up_down <- function(img_ff) {
rows_half <- floor(rows/2)return(rbind(img_ff[((rows_half+1):rows),
(1:cols)], img_ff[(1:rows_half), (1:cols)]))}swap_left_right <- function(img_ff) {
cols_half <- floor(cols/2)return(cbind(img_ff[1:rows, ((cols_half+1):cols)],
img_ff[1:rows, 1:cols_half]))}if (dim == -1) {
img_ff <- swap_left_right(img_ff)return(swap_up_down(img_ff))
}else if (dim == 1) {
return(swap_up_down(img_ff))}else if (dim == 2) {
return(swap_left_right(img_ff))}else {
stop("Invalid dimension parameter")}
}
36