Learning 5
Enduring Issues in Learning
Classical Conditioning • Elements of Classical
Conditioning • Establishing a Classically
Conditioned Response • Classical Conditioning in
Humans • Classical Conditioning Is
Selective
Operant Conditioning • Elements of Operant
Conditioning • Establishing an Operantly
Conditioned Response • A Closer Look at
Reinforcement • Punishment • Learned Helplessness • Shaping Behavioral Change
Through Biofeedback
Factors Shared by Classical and Operant Conditioning • The Importance of
Contingencies • Extinction and Spontaneous
Recovery • Stimulus Control,
Generalization, and Discrimination
O V E R V I E W
• New Learning Based on Original Learning
• Summing Up Cognitive Learning • Latent Learning and
Cognitive Maps • Insight and Learning Sets • Learning by Observing • Cognitive Learning in
Nonhumans
IS B
N 1-256-37427-X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
155
define it more broadly. To them, learning occurs whenever experience or practice results in a relatively permanent change in behavior or in potential behavior. This definition includes all the examples previously mentioned, plus a great many more. When you remember how to park a car or where the library water fountain is, you are showing a tiny part of your enormous capacity for learning.
Human life would be impossible without learning; it is involved in virtually everything we do. You could not communi- cate with other people or recognize yourself as human if you were unable to learn. In this chapter, we explore several kinds of learning. One type is learning to associate one event with another. When pouched rats associate the smell of TNT and receiving food or when a person associates the sight or smell of a food with illness they are engaging in two forms of learning called operant and classical conditioning. Because psycholo- gists have studied these forms of learning so extensively, much of this chapter is devoted to them. But making associations isn’t all there is to human learning. Our learning also involves the for- mation of concepts, theories, ideas, and other mental abstrac- tions. Psychologists call it cognitive learning, and we discuss it at the end of this chapter.
Our tour of learning begins in the laboratory of a Nobel Prize–winning Russian scientist at the turn of the 20th century. His name is Ivan Pavlov, and his work is helping to rev- olutionize the study of learning. He has discovered classical conditioning.
ENDURING ISSUES IN LEARNING This chapter addresses how humans and other animals acquire new behaviors as a result of their experiences. Thus, it bears directly on the enduring issue of Stability versus Change (the extent to which organisms change over the course of their lives). The events that shape learning not only vary among different individuals (diversity–universality) but also are influenced by an organism’s inborn characteristics (nature–nurture). Finally, some types of learning can affect our physical health by influencing how our body responds to disease (mind–body).
CLASSICAL CONDITIONING How did Pavlov discover classical conditioning?
The Russian physiologist Ivan Pavlov (1849–1936) discovered classical (or Pavlovian) conditioning, a form of learning in which a response elicited by a stimulus becomes elicited by a previously neutral stimulus, almost by accident. He was studying digestion, which begins when saliva mixes with food in the mouth. While measuring how much saliva dogs produce when given food, he noticed that they began to salivate even before they tasted the food. The mere sight of food or the sound of his footsteps made them drool. This aroused Pavlov’s curiosity. How had the dogs learned to salivate to sights and sounds?
L E A R N I N G O B J E C T I V E S • Define learning. • Describe the elements of classical
conditioning, distinguishing between unconditioned stimulus, unconditioned response, conditioned stimulus and conditioned response. Describe the process of establishing a classically conditioned response, including the effect of intermittent pairing.
• Provide examples of classical conditioning in humans, including desensitization therapy. Explain the statement that “classical conditioning is selective” and illustrate with examples of conditioned taste aversions.
• In Mozambique, a giant pouched rat the size of a cat scurries across a field, pauses, sniffs the air, turns,
sniffs again, and then begins to scratch at the ground with her forepaws. She has discovered yet another land mine buried a few inches underground. After a brief break for a bit of banana and a pat or two from her handler, she scur- ries off again to find more land mines.
• In the middle of a winter night, Adrian Cole—4 years old and three feet tall—put on his jacket and boots and drove his mother’s car to a nearby video store. When he found the store closed, he drove back home. Since he was driving very slowly with the lights off and was also weaving a bit, he understandably attracted the attention of police officers who followed him. When he got home, he collided with two parked cars and then backed into the police cruiser! When the police asked him how he learned to drive, he explained that his mother would put him on her lap while she drove and he just watched what she did.
• “I just can’t stand to eat shrimp. I don’t like the smell of it, or the sight of it. Once when I young, I had some for dinner while vacationing at the beach and it made me sick for the rest of the week. Now just the thought of it disgusts me.”
The common element in all these stories—and the topic of this chapter—is learning. Although most people associate learning with classrooms and studying for tests, psychologists
W hat do the following anecdotes have in common?
IS B
N 1-
25 6-
37 42
7- X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
156 Chapter 5
learning The process by which experience or practice results in a relatively permanent change in behavior or potential behavior.
classical (or Pavlovian) conditioning The type of learning in which a response naturally elicited by one stimulus comes to be elicited by a different, formerly neutral, stimulus.
unconditioned stimulus (US) A stimulus that invariably causes an organism to respond in a specific way.
unconditioned response (UR) A response that takes place in an organism whenever an unconditioned stimulus occurs.
conditioned stimulus (CS) An originally neutral stimulus that is paired with an unconditioned stimulus and eventually produces the desired response in an organism when presented alone.
conditioned response (CR) After conditioning, the response an organism produces when a conditioned stimulus is presented.
Figure 5–1 Pavlov’s apparatus for classically conditioning a dog to salivate. The experimenter sits behind a one-way mirror and controls the presentation of the conditioned stimulus (touch applied to the leg) and the unconditioned stimulus (food). A tube runs from the dog’s salivary glands to a vial, where the drops of saliva are collected as a way of measuring the strength of the dog’s response.
To answer this question, Pavlov sounded a bell just before presenting his dogs with food. A ringing bell does not usually make a dog’s mouth water, but after hearing the bell many times right before getting fed, Pavlov’s dogs began to salivate as soon as the bell rang. It was as if they had learned that the bell signaled the appearance of food; and their mouths watered on cue even if no food followed. The dogs had been conditioned to salivate in response to a new stimulus: the bell, which normally would not prompt salivation (Pavlov, 1927). Figure 5–1 shows one of Pavlov’s procedures in which the bell has been replaced by a touch to the dog’s leg just before food is given.
Elements of Classical Conditioning How might you classically condition a pet?
Figure 5–2 diagrams the four basic elements in classical conditioning: the unconditioned stimulus, the unconditioned response, the conditioned stimulus, and the conditioned response. The unconditioned stimulus (US) is an event that automatically elicits a certain reflex reaction, which is the unconditioned response (UR). In Pavlov’s studies, food in the mouth was the unconditioned stimulus, and salivation to it was the unconditioned response. The third element in classical conditioning, the conditioned stimulus (CS), is an event that is repeatedly paired with the unconditioned stimulus. For a conditioned stimu- lus, Pavlov often used a bell. At first, the conditioned stimulus does not elicit the desired response. But eventually, after repeatedly being paired with the unconditioned stimulus, the conditioned stimulus alone comes to trigger a reaction similar to the unconditioned response. This learned reaction is the conditioned response (CR).
IS B
N 1-256-37427-X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
Learning 157
Classical conditioning has been demonstrated in virtually every animal species, even cockroaches, bees, and sheep (Abramson & Aquino, 2002; Johnson, Stanton, Goodlett, & Cudd, 2008; Krasne & Glanzman, 1995; Watanabe, Kobayashi, Sakura, Matsumoto, & Mizunami, 2003; Watanabe & Mizunami, 2006). You yourself may have inadvertently clas- sically conditioned one of your pets. For instance, you may have noticed that your cat begins to purr when it hears the sound of the electric can opener running. For a cat, the taste and smell of food are unconditioned stimuli for a purring response. By repeatedly pairing the can opener whirring with the delivery of food, you have turned this sound into a conditioned stimulus that triggers a conditioned response.
Establishing a Classically Conditioned Response If you once burned your finger on a match while listening to a certain song, why doesn’t that song now make you reflexively jerk your hand away?
As shown in Figure 5–3, it generally takes repeated pairings of an unconditioned stimulus and a cue before the unconditioned response eventually becomes a conditioned response. The likelihood or strength of the conditioned response increases each time these two stim- uli are paired. This learning, however, eventually reaches a point of diminishing returns. The amount of each increase gradually becomes smaller, until finally no further learning occurs. The conditioned response is now fully established.
It is fortunate that repeated pairings are usually needed for clas- sical conditioning to take place (Barry Schwartz, 1989). There are always a lot of environmental stimuli present whenever an uncondi- tioned stimulus triggers an unconditioned response. If conditioning occurred on the basis of single pairings, all these usually irrelevant stimuli would generate some type of CR. Soon we would be over- whelmed by learned associations. Because a number of pairings are usually needed to produce a conditioned response, only a cue con- sistently related to the unconditioned stimulus typically becomes a conditioned stimulus.
In cr
ea se
in st
re n
g th
o f
C R
Number of trials
Figure 5–3 Response acquisition. At first, each pairing of the US and CS increases the strength of the response. After a number of trials, learning begins to level off; and eventually it reaches a point of diminishing returns.
Bell USUS
USUS
URUR
CRCR
CSCS
CSCS
No response
(Food) (Salivation)
URUR (Salivation)(Food)(Bell)
(Bell) (Salivation)
Before conditioning
During conditioning
After conditioning
but
Followed by
Figure 5–2 A model of the classical conditioning process.
IS B
N 1-
25 6-
37 42
7- X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
158 Chapter 5
desensitization therapy A conditioning technique designed to gradually reduce anxiety about a particular object or situation.
Desensitization therapy is based on the belief that we can overcome fears by learning to remain calm in the face of increasingly fear- arousing situations. Here people being desensitized to a fear of heights are able to swing high above the ground without panicking.
The spacing of pairings is also important in establishing a classically conditioned response. If pairings of the CS and US follow each other very rapidly, or if they are very far apart, learning the association is slower. If the spacing of pairings is moderate—neither too far apart nor too close together—learning occurs more quickly. It is also important that the CS and US rarely, if ever, occur alone. Pairing the CS and US only once in a while, called intermittent pairing, reduces both the rate of learning and the final strength of the learned response.
Classical Conditioning in Humans What is an example of classical conditioning in your own life?
Classical conditioning is as common in humans as it is in other animals. For example, some people learn phobias through classical conditioning. Phobias are intense, irrational fears of particular things or situations, such as spiders or flying. In Chapter 1, we discussed the study in which John Watson and his assistant, Rosalie Rayner, used classical conditioning to instill a phobia of white rats in a 1-year-old baby named Little Albert (J. B. Watson & Rayner, 1920). They started by pairing a loud noise (an unconditioned stimulus) with the sight of a rat. After a few pairings of the rat and the frightening noise, Albert would cry in fear at the sight of the rat alone.
Several years later, psychologist Mary Cover Jones demonstrated a way that fears can be unlearned by means of classical conditioning (M. C. Jones, 1924). Her subject was a 3- year-old boy named Peter who, like Albert, had a fear of white rats. Jones paired the sight of a rat with an intrinsically pleasant experience—eating candy. While Peter sat alone in a room, a caged white rat was brought in and placed far enough away so that the boy would not be frightened. At this point, Peter was given candy to eat. On each successive day, the cage was moved closer, after which, Peter was given candy. Eventually, he showed no fear of the rat, even without any candy. By being repeatedly paired with a stimulus that evoked a pleasant emotional response, the rat had become a conditioned stimulus for pleasure.
In more recent times, psychiatrist Joseph Wolpe (1915–1997) adapted Jones’s method to the treatment of certain kinds of anxiety (Wolpe, 1973, 1990). Wolpe reasoned that because irrational fears are learned (conditioned), they could also be unlearned through conditioning. He noted that it is not possible to be both fearful and relaxed at the same time. Therefore, if people could be taught to relax in fearful or anxious situations, their anxiety should disappear. Wolpe’s desensitization therapy begins by teaching a system of deep-muscle relaxation. Then the person constructs a list of situations that prompt various degrees of fear or anxiety, from intensely frightening to only mildly so. A person with a fear of heights, for example, might construct a list that begins with standing on the edge of the Grand Canyon and ends with climbing two rungs on a ladder. While deeply relaxed, the person imagines the least distressing situation on the list first. If he or she succeeds in remaining relaxed, the person proceeds to the next item on the list, and so on until no anx- iety is felt. In this way, classical conditioning is used to change an undesired reaction: A fear-arousing thought is repeatedly paired with a muscular state that produces calmness until eventually the formerly fearful thought no longer triggers anxiety. Desensitization therapy has been used successfully to treat a variety of disorders such as phobias and post- traumatic stress disorder (Morris, Kratochwill, Schoenfield, & Auster, 2008; S. M. Silver, Rogers, & Russell, 2008). More recently, desensitization therapy has taken on a new form using virtual reality simulation. For instance, a person with a fear of flying may learn to relax while in a flight simulator rather than actually aboard an airplane. Therapy using vir- tual reality desensitization is still in its infancy, but the early results are promising (Parsons & Rizzo, 2008).
intermittent pairing Pairing the conditioned stimulus and the unconditioned stimulus on only a portion of the learning trials.
IS B
N 1-256-37427-X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
Learning 159
preparedness A biological readiness to learn certain associations because of their survival advantages.
conditioned taste aversion Conditioned avoidance of certain foods even if there is only one pairing of conditioned and unconditioned stimuli.
Classical Conditioning Is Selective Why are people more likely to develop a phobia of snakes than of flowers?
If people can develop phobias through classical conditioning, why don’t we acquire phobias of virtually everything that is paired with harm? For example, many people get shocks from electric sockets, but almost no one develops a socket phobia. Why should this be the case?
Psychologist Martin Seligman (1971) has offered an answer: The key, he says, lies in the concept of preparedness. Some things readily become conditioned stimuli for fear responses because we are biologically prepared to learn those associations. Among the common objects of phobias are heights, snakes, and the dark. In our evolutionary past, fear of these potential dangers probably offered a survival advantage, and so a readiness to form such fears may have become “wired into” our species.
Preparedness also underlies conditioned taste aversion, a learned association between the taste of a certain food and a feeling of nausea and revulsion. Conditioned taste aver- sions are acquired very quickly. It usually takes only one pairing of a distinctive flavor and subsequent illness to develop a learned aversion to the taste of that food. Readily learning connections between distinctive flavors and illness has clear benefits. If we can quickly learn which foods are poisonous and avoid those foods in the future, we greatly increase our chances of survival. Other animals with a well-developed sense of taste, such as rats and mice, also readily develop conditioned taste aversions, just as humans do (Chester, Lumeng, Li, & Grahame, 2003; Guitton, Klin, & Dudai, 2008).
Mind–Body Classical Conditioning and the Immune System In another example of classical conditioning in humans, researchers have devised a novel way to treat autoimmune disorders, which cause the immune system to attack healthy organs or tissues. Although powerful drugs can be used to suppress the immune system and thus reduce the impact of the autoimmune disorder, these drugs often have dangerous side effects, so they must be administered sparingly. The challenge, then, was to find a treat- ment that could suppress the immune system without damaging vital organs. Researchers discovered that they could use formerly neutral stimuli either to increase or to suppress the activity of the immune system (Hollis, 1997; Markovic, Dimitrijevic, & Jankovic, 1993). Here’s how it works: As US, the researchers use immune-suppressing drugs and pair them with a specific CS, such as a distinctive smell or taste. After only a few pairings of the drug (US) with the smell or taste (CS), the CS alone suppresses the immune system (the CR) without any dangerous side effects! In this case, classical conditioning works on the mind but ultimately affects the body. While the use of classical conditioning to treat autoimmune disorders shows promise, additional research is still necessary to validate its effectiveness and evaluate its potential application as a therapy to treat these disorders (Bovbjerg, 2003; Gregory Miller & Cohen, 2001). ■
Nature–Nurture The Evolutionary Basis of Fear To what extent does our evolutionary heritage condition our fears; and to what extent are fears the result of our experiences? Recent studies suggest that the two work in tandem (Mineka & Oehman, 2002). For example, some stimuli unrelated to human survival through evolution, but which we have learned to associate with danger, can serve as CSs for
A bird’s nervous system is adapted to remem- ber sight–illness combinations, such as the distinctive color of a certain berry and subse- quent food poisoning. In mammals, by con- trast, taste–illness combinations are quickly and powerfully learned.
IS B
N 1-
25 6-
37 42
7- X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
fear responses. Pictures of handguns and butcher knives, for example, are as effective as pic- tures of snakes and spiders in conditioning fear in some people (Lovibond, Siddle, & Bond, 1993). These studies suggest that preparedness may be the result of learning rather than evolution. Other studies have shown that people who do not suffer from phobias can rather quickly unlearn fear responses to spiders and snakes if those stimuli appear repeatedly without painful or threatening USs (Honeybourne, Matchett, & Davey, 1993). Thus, even if humans are prepared to fear these things, that fear can be overcome through conditioning. In other words, our evolutionary history and our personal learning histories interact to increase or decrease the likelihood that certain kinds of conditioning will occur. ■
160 Chapter 5
OPERANT CONDITIONING How are operant behaviors different from the responses involved in classical conditioning?
Around the turn of the 20th century, while Pavlov was busy with his dogs, the American psychologist Edward Lee Thorndike (1874–1949) was using a “puzzle box,” or simple wooden cage, to study how cats learn (Thorndike, 1898). As illustrated in Figure 5–4,
___ unconditioned stimulus a. bell ___ unconditioned response b. food ___ conditioned stimulus c. salivating to bell ___ conditioned response d. salivating to food
CHECK YOUR UNDERSTANDING
1. The simplest type of learning is called ____________ ____________. It refers to the establishment of fairly predictable behavior in the presence of well-defined stimuli.
2. Match the following in Pavlov’s experiment with dogs:
3. The intense, irrational fears that we call phobias can be learned through classical conditioning. Is this statement true (T) or false (F)?
4. A learned association between the taste of a certain food and a feeling of nausea is called ____________ ____________ ____________.
5. Teaching someone to relax even when he or she encounters a distressing situation is called ____________ ____________.
6. In the experiment with Little Albert, the unconditioned stimulus was __________ ___________.
Answers:1. classical conditioning.2. unconditioned stimulus—b; unconditioned response—d; conditioned stimulus—a; conditioned response—c.3. T.4. conditioned taste aversion.5. desensitization therapy.6. loud noises.
APPLY YOUR UNDERSTANDING
1. Which of the following are examples of classical conditioning? a. eating when not hungry just because we know it is lunchtime b. a specific smell triggering a bad memory c. a cat running into the kitchen to the sound of a can opener d. All of the above are examples of classical conditioning.
2. You feel nauseated when you read about sea scallops on a restaurant menu, because you once had a bad episode with some scallops that made you sick. For you in this situation, the menu description of the scallops is the
a. US. b. CS. c. CR.
Answers:1. d.2. b.
Seligman’s theory of preparedness argues that we are biologically prepared to associ- ate certain stimuli, such as heights, the dark, and snakes, with fear responses. In our evo- lutionary past, fear of these potential dangers probably offered a survival advantage.
L E A R N I N G O B J E C T I V E S • Explain how operant conditioning
differs from classical conditioning. • Explain the law of effect (the principle
of reinforcement) and the role of reinforcers, punishers, and shaping in establishing an operantly conditioned response. Differentiate between positive reinforcers, negative reinforcers, and punishment. Explain the circumstances under which punishment can be effective and the drawbacks to using punishment.
• Explain what is meant by learned helplessness.
• Describe how biofeedback and neurofeedback can be used to change behavior.
IS B
N 1-256-37427-X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
Learning 161
Thorndike confined a hungry cat in the puzzle box, with food just outside where the cat could see and smell it. To get to the food, the cat had to figure out how to open the latch on the box door, a process that Thorndike timed. In the beginning, it took the cat quite a while to discover how to open the door. But on each trial, it took the cat less time, until eventually it could escape from the box in almost no time at all. Thorndike was a pioneer in studying the kind of learning that involves making a certain response due to the consequences it brings. This form of learning has come to be called operant or instrumental conditioning. The pouched rat described at the opening of this chapter learned to find land mines through operant conditioning.
Elements of Operant Conditioning What two essential elements are involved in operant conditioning?
One essential element in operant conditioning is emitted behavior. This is one way in which operant conditioning is different from classical conditioning. In classical conditioning, a response is automatically triggered by some stimulus, such as a loud noise automatically triggering fear. In this sense, classical conditioning is passive in that the behaviors are elicited by stimuli. However, this process is not true of the behaviors involved in operant conditioning. Thorndike’s cats spontaneously tried to undo the latch on the door of the box. You spontaneously wave your hand to signal a taxi to stop. You voluntarily put money into machines to obtain food. These and similar actions are called operant behaviors because they involve “operating” on the environment.
A second essential element in operant conditioning is a consequence following a behav- ior. Thorndike’s cats gained freedom and a piece of fish for escaping from the puzzle boxes. Consequences like this one, which increase the likelihood that a behavior will be repeated, are called reinforcers. In contrast, consequences that decrease the chances that a behavior will be repeated are called punishers. Imagine how Thorndike’s cats might have acted had they been greeted by a large, snarling dog when they escaped from the puzzle boxes. Thorndike summarized the influence of consequences in his law of effect: Behavior that brings about a satisfying effect (reinforcement) is likely to be performed again, whereas
operant (or instrumental) conditioning The type of learning in which behaviors are emitted (in the presence of specific stimuli) to earn rewards or avoid punishments.
operant behaviors Behaviors designed to operate on the environment in a way that will gain something desired or avoid something unpleasant.
reinforcers A stimuli that follows a behavior and increases the likelihood that the behavior will be repeated.
punishers Stimuli that follows a behavior and decreases the likelihood that the behavior will be repeated.
law of effect (principle of reinforcement) Thorndike’s theory that behavior consistently rewarded will be “stamped in” as learned behavior, and behavior that brings about discomfort will be “stamped out.”
5 252015 Number of trials
T im
e (s
ec )
10
50
100
150
The cat can escape and be rewarded with food by tripping the bolt on the door.
Cats learned to make the necessary response more rapidly after an increasing numbers of trials.
Figure 5–4 A cat in a Thorndike “puzzle box.” The cat can escape and be rewarded with food by tripping the bolt on the door. As the graph shows, Thorndike’s cats learned to make the necessary response more rapidly after an increasing number of trials.
IS B
N 1-
25 6-
37 42
7- X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
162 Chapter 5
Figure 5–5 A rat in a Skinner box. By pressing the bar, the rat releases food pellets into the box; this procedure reinforces its bar- pressing behavior.
behavior that brings about a negative effect (punishment) is likely to be suppressed. Contemporary psychologists often refer to the principle of reinforcement, rather than the law of effect, but the two terms mean the same thing.
Establishing an Operantly Conditioned Response
How might an animal trainer teach a tiger to jump through a flaming hoop?
Because the behaviors involved in operant conditioning are voluntary ones, it is not always easy to establish an operantly conditioned response. The desired
behavior must first be performed spontaneously in order for it to be rewarded and strengthened. Sometimes you can simply wait for this action to happen.
Thorndike, for example, waited for his cats to trip the latch that opened the door to his puzzle boxes. Then he rewarded them with fish. But when there are many opportunities for making irrelevant responses, waiting can
be slow and tedious. If you were an animal trainer for a circus, imagine how long you would have to wait for a tiger to decide to jump through a flaming hoop so you could reward it. One way to speed up the process is to increase motivation. Even without food in sight, a hungry animal is more active than a well-fed one and so is more likely, just by chance, to make the response you’re looking for. Another strategy is to reduce opportuni- ties for irrelevant responses, as Thorndike did by making his puzzle boxes small and bare. Many researchers do the same thing by using Skinner boxes to train small animals in. A Skinner box (named after B. F. Skinner, another pioneer in the study of operant condition- ing), is a small cage with solid walls that is relatively empty, except for a food cup and an activating device, such as a bar or a button. (See Figure 5–5.) In this simple environment, it doesn’t take long for an animal to press the button that releases food into the cup, thereby reinforcing the behavior.
Usually, however, the environment cannot be controlled so easily; hence a different approach is called for. Another way to speed up operant conditioning is to reinforce succes- sive approximations of the desired behavior. This approach is called shaping. To teach a tiger to jump through a flaming hoop, the trainer might first reinforce the animal simply for jumping up on a pedestal. After that behavior has been learned, the tiger might be rein-
forced only for leaping from that pedestal to another. Next, the tiger might be required to jump through a hoop between the pedestals to gain a reward. And finally, the hoop is set on fire, and the tiger must leap through it to be rewarded.
As in classical conditioning, the learning of an operantly conditioned response eventually reaches a point of diminishing returns. If you look back at Figure 5–4, you’ll see that the first few reinforce- ments produced quite large improvements in per- formance, as indicated by the rapid drop in time required to escape from the puzzle box. But each successive reinforcement produced less of an effect until, eventually, continued reinforcement brought no evidence of further learning. After 25 trials, for instance, Thorndike’s cats were escaping from the box no more quickly than they had been after 15 trials. The operantly conditioned response had then been fully established. Can operant conditioning
Watch on MyPsychLab
Source: © The New Yorker Collection, 1978, Sam Gross from cartoonbank.com. All Rights Reserved.
Watch B. F. Skinner Biography at www.mypsychlab.com
IS B
N 1-256-37427-X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
Learning 163
positive reinforcers Events whose presence increases the likelihood that ongoing behavior will recur.
negative reinforcers Events whose reduction or termination increases the likelihood that ongoing behavior will recur.
influence human behavior? See “Applying Psychology: Modifying Your Behavior,” above, to learn about how you can use operant conditioning to modify your own behavior.
Remember that the new, more desirable behavior need not be learned all at once. You can use shaping or successive approximations to change your behavior bit by bit. A person who wants to become more sociable might start by giving rewards just for sitting next to another person in a classroom rather than picking an isolated seat. The person could then work up to rewarding increasingly sociable behaviors, such as first saying hello to another person, then striking up a conversation.
A Closer Look at Reinforcement What is the difference between positive and negative reinforcement? What are some of the unintentional effects that reinforcement can have?
We have been talking about reinforcement as if all reinforcers are alike, but in fact this is not the case. Think about the kinds of consequences that would encourage you to perform some behavior. Certainly these include consequences that give you something positive, like praise, recognition, or money. But the removal of some negative stimulus is also a good reinforcer of behavior. When new parents discover that rocking a baby will stop the infant’s persistent crying, they sit down and rock the baby deep into the night; the removal of the infant’s crying is a powerful reinforcer.
These examples show that there are two kinds of reinforcers. Positive reinforcers, such as praise, add something rewarding to a situation, whereas negative reinforcers, such as
Skinner box A box often used in operant conditioning of animals; it limits the available responses and thus increases the likelihood that the desired response will occur.
shaping Reinforcing successive approximations to a desired behavior.
Modifying Your Own Behavior
C an you modify your own undesirable behaviors by using operant condi- tioning techniques? Yes, but first you
must observe your own actions, think about their implications, and plan a strat- egy of intervention.
1. Begin by identifying the behavior you want to acquire: This is called the “target” behavior. You will be more successful if you focus on acquiring a new behavior rather than on elimi- nating an existing one. For example, instead of setting a target of being less shy, you might define the target behavior as becoming more outgoing or more sociable.
2. The next step is defining the target behavior precisely: What exactly do you mean by “sociable”? Imagine sit- uations in which the target behavior could be performed. Then describe in writing the way in which you now
respond to these situations. For example, you might write, “When I am sitting in a lecture hall, waiting for class to begin, I don’t talk to the people around me.” Next, write down how you would rather act in that sit- uation: “In a lecture hall before class, I want to talk to at least one other person. I might ask the person sitting next to me how he or she likes the class or the professor or simply com- ment on some aspect of the course.”
3. The third step is monitoring your present behavior: You may do so by keeping a daily log of activities related to the target behavior. This will establish your current “base rate” and give you something concrete against which to gauge improve- ments. At the same time, try to figure out whether your present, undesir- able behavior is being reinforced in some way. For example, if you find
yourself unable to study, record what you do instead (Get a snack? Watch television?) and determine whether you are inadvertently rewarding your failure to study.
4. The next step—the basic principle of self-modification—is providing your- self with a positive reinforcer that is contingent on specific improvements in the target behavior: You may be able to use the same reinforcer that now maintains your undesirable behavior, or you may want to pick a new reinforcer. For example, if you want to increase the amount of time you spend studying, you might reward yourself with a token for each 30 minutes of study. Then, if your favorite pastime is watching movies, you might charge yourself three tokens for an hour of television, whereas the privilege of going to a movie might cost six.
IS B
N 1-
25 6-
37 42
7- X
Understanding Psychology, Ninth Edition, by Charles G. Morris and Albert A. Maisto. Published by Prentice Hall. Copyright © 2010 by Pearson Education, Inc.
164 Chapter 5
punishment Any event whose presence decreases the likelihood that ongoing behavior will recur.
The use of punishment has potential draw- backs. It cannot “unteach” unwanted behav- ior, only suppress it. Punishment may also stir up negative feelings in the person who is punished or inadvertently provide a model of aggressive behavior.
stopping an aversive noise, subtract something unpleasant. Animals will learn to press bars and open doors not only to obtain food and water (positive reinforcement), but also to turn off a loud buzzer or an electric shock (negative reinforcement).
Both positive and negative reinforcement results in the learning of new behaviors or the strengthening of existing ones. Remember, in everyday conversation when we say that we have “reinforced” something, we mean that we have strengthened it. Similarly, in oper- ant conditioning, reinforcement—whether positive or negative—always strengthens or encourages a behavior. A child might practice the piano because she or he receives praise for practicing (positive reinforcement) or because it gives her or him a break from doing tedious homework (negative reinforcement), but in either case the end result is a higher incidence of piano playing.