|
Last
updated March 6, 2003
Operant
Training vs. Classical Conditioning
I think
we are all familiar with Jean Donaldson's Open Bar/Closed Bar
method to help provide a positive emotional response
to the approach of a scary/threatening thing.
ie: dog or person appears, treat treat treat, dog or person leaves,
treats stop.
(If
you aren't, pick up a copy of Culture Clash!!)
Let's
put it in practical use. Here's the scenario:
Dog
in shelter run barks and leaps aggressively at the gate when people
walk by.
We put a bucket of treats at the front of the run and instruct everyone
who walks down the aisle to drop treats into the run and walk away.
Here is the Q of the week:
What
behavior is the treats helping?
For instance, if the dog is snarling/growling when
you approach the cage, and you then give it
a treat, is the dog actually learning that strangers bring GOOD
things instead of bad,
or is the dog being reinforced for snarling/growling?
WHY?
From
Cathy:
At the very first few repetitions, you are, in a way, reinforcing
the snarling/growling, but classical conditioning means associating
one thing with another.This takes many many repetitons. You are
not asking the dog for any behavior at all -- simply person approaching
the cage--give treat. After a few of these approaches to the cage
and delivering the treat, the dog will start to associate the stranger
coming toward his cage with the delivery of a treat, and will most
likely settle in anticipation of that delivery, thus he is being
reinforced for the quiet behavior. With many many repetitions of
this exercise, it is very possible to have that same snarling/growling
dog when seeing a stranger approach his cage, he will stop growling,
sit at the front of his cage and wait for his treat. Of course to
get to that point, one would have to do this exercise many times,
and change the picture of the approach VERY slowly -- i.e. coming
toward the cage and holding out for a couple of seconds before delivering
the treat -- this may result in the dog sitting; or coming toward
the cage and simply placing your hand on the latch and then delivering
the treat -- this could result in a dog not jumping up like a looney
toon when someone goes to open his cage.
Classical
conditioning is one of the most important ways we can help shelter
dog get adopted. This can work with the dogs like the one described
above, or shy dogs who sit at the back of their cages looking scared
to death.
Although
initially my motivation for volunteering at Spokanimal was a selfish
one (gaining experience and learning from the dogs behavior so I
could be a better trainer) it has grown to being able to truly make
a difference in these dogs lives and making them more adoptable
--- and this after only a couple of times being there.
From
Lisa:
It depends on the timing of the treat delivery relative to each
dog. Both classical and operant training is at work. The dog is
being classically conditioned to like the approach of treat-slinging
strangers. It is operantly learning that when they bark and bang
the kennel door, strangers appear with treats.
If the treat slingers wait until the dog is sitting quietly before
delivering the treat, the dog operantly learns that sitting quietly
earns a treat.
From
Diane:
I'd like to respond to this question and offer another.
(question is below)
The
dog in the shelter run has an already established opinion and fear
of people, based on his past experiences, breeding, etc. So, tossing
a cookie to him in passing is not creating this snarling fearful
behavior, it's changing the pattern of what happens, and his opinion
of people, by offering something new. He doesn't expect cookies
from humans, so at first he has to process the new info and then
learn to respond in a new way. If the humans add a nice word and
a smile in a non threatening way he'll be able to be less fearful
faster. Build on this and soon there will be a dog excited to see
people instead of snarling at them. The rate of progress depends
on lots of factors, and some dogs may not be able to rehab.
Fearful
dog snarls---recieves new and positive input from previously scarey
situation----questions what the deal is-----positive input repeated-----dog
realizes scarey situation no longer happens----looks forward for
cookie distrubitors to come by! Yummmmmm
BEAUTIFULLY
PUT, LISA AND CATHY - AND HOORAY!! GREAT ANSWER FROM DIANE, TOO!
And
a great answer from Kim Imel:
Operant
Training Vs. Classical Conditioning
At first it would seem we are rewarding the bad behavior, but if
the treats are powerful enough, and the dog wants them, he will
soon learn that they only come when a person approaches. So people
mean good things. I think that the attitude of the human involved
is also key. The human needs to, in the beginning, be giving calming
signals as they pass the cage and toss the treat. No direct eye
contact or verbal contact, approach from the side vs. direct on.
As the behavior calms, up the ante to eye contact, and talking,
to walking head on looking at the dog. I actually got to do this
with a wolf-hybrid that boarded at the hospital I worked for in
Calif. Went from having her do the "wolf stance" growling in the
back of the cage to singing and wagging her tail as soon as she
saw me and throwing herself against the front of the cage as I approached
so I could pet and sweet talk her.
Why? I think it is a matter of building confidence in the dog. For
some reason the dog is not confident that people coming near his
cage are safe. Or people in general are not safe. Some dogs run
away or cower in the back of the cage, others charge/show aggression.
Flight or fight. However when we give treats each time we walk by,
they are being conditioned that way cool things happen, slowly building
confidence in humans as providers of good. It is like skiing. You
start out at the top of the slope and think, "Oh, my God… I'm
gonna die", then you head down screaming in your head "oh my God-oh
my God-oh my God", but you make it down with out falling. The next
run you head down saying in your head "oh my-oh my-oh my" and guess
what, another safe run. Run three you whisper "oh-oh-oh" all the
way down, again a clean run. Run four, you swish down and end with
a triumphant "woo-hoo!!". By the end of the day, you are racing
to get to the lift so you can make another run. It is a matter of
confidence when you face the unknown it is a little scary at first,
but with each success, you build your confidence to the point that
you look forward to the event happening again. For us, it is another
run down the slopes, for the dog it is another human walking by
his cage.
My
response:
Classical
conditioning: Learning by association
Operant conditioning: offering a behavior
to earn a reward/avoid a punishment.
The
purpose of the repeated pattern of "human appearance equals food"
is to classically condition a new emotional response to humans
passing by the kennel run... the theory being that one can in time
change the dog's behavioral response to humans passing by. The dog
will soon look forward to, instead of becoming alarmed at, the stranger's
approach.
Of
course it would seem that giving food would reward what EVER the
behavior the dog is presenting at the time of the food delivery.
Here's
the answer to why it doesn't:
Scientific studies show that classical conditioning
overrides operant.
When
a dog is in a limbic, or emotional state, he is unable to learn
through operant conditioning. When a dog is totally stressed out,
it *can* learn, but only by association. Learning via consequences
doesn't happen in this state. So, the dog will associate the person
with the treat, but not learn that the treat is a consequence of
its behavior. There is learning going on, but not the kind where
we say "sit," the dog sits, the dog gets a cookie, the dog *chooses*
to sit again.
So
by dropping treats regardless of the dog's behavior, we CAN classically
condition his emotional state by pairing a positive outcome with
the appearance of a human at the front of the cage. When this happens,
the main or sole effect of dropping treats is to help the dog calm
down and begin to have a more pleasant association with humans passing
by. The end result, the aggressive behavior decreases as the dog's
comfort level increases.
Even
dogs who are behaving more operantly (able to learn), are probably
behaving in this way to drive people away because, for whatever
reason, people make them nervous. They repeat the behavior because
it works. Thus, dropping the treat still changes the emotional response
and the dog will likely feel less need to behave in the repelling
manner after several repetitions.
*
A very few dogs are fully operant when they are
barrier-guarding at shelters (will observe that by being aggressive
they can cause a person to appear and give treats.). Even in these,
we can create a happier emotional response to the presence of humans.
As the dog begins to calm down, we need to keep notes, observe,
quantify the behavior of the animal: "is it improving or not?" and
proceed accordingly. As the dog gains confidence and is able to
concentrate on how his behavior affects the outcome, then do more
operant conditioning oriented work with him. Mark appropriate behaviors
as they happen until he starts to consciously offer them, and then
switch to Premack: "If you sit quietly, then I'll
give you a treat."
Additional
comment from Lisa: "Emotional learning overrides
all else -- it is why dogs do not generalize well when we train
operantly and they DO generalize emotionally driven behaviors."
QUESTION
FROM DIANE: Okay, here's my question about redirection.
Sometimes I see this. Dog snarls at other dogs. Handler redirects,
rewards. Dog doing better, gets this down. Most dogs do fine from
there. What about the occasional dog who learns the pattern of snarl
(or whatever reaction they do), turn to mom and look for cookie,
and keeps repeating this pattern specifically for the reward? Some
dogs almost literally smile at mom as they turn away from the dog
they snarl at, and they seem relaxed even as they do the snarl.
They learned to redirect, and that they need not worry about the
other dog, just look to mom, but not to omit the snarl. What's the
next step?
(Gee,
could this be SOFI the BC/Aussie? *G*)
See " * " paragraph above Lisa's
additional comment, as it applies to this situation.
This is
usually the result of a handler working with a dog who is incredibly
sharp and observant AND one of the rare dogs who is operant when
is a limbic state. Note that classical conditioning is at work -
the dog is "smiling" when it performs the reinforced behavior!
The
dog is simply doing what it's been taught. It has learned the sequence.
The other dog becomes cue to lunge (and the lunge itself is self
reinforcing behavior - the dog gets a buzz out of it), handler redirects
& from the dog's point of view is rewarding a predictable
sequence of chained behaviors.
The
primary goal of this exercise from start to finish is to redirect
BEFORE the lunge - before the dog takes in its breath - while the
dog is NOT reactive. The redirection is to PREVENT the lunge, the
lunge is not the cue to redirect.
This
is often the result of poor handler timing. When working with incredibly
fast dogs who show little if any body language signals (like Sofi),
this can be extremely difficult to catch at first. If the handler
has a proactive plan - redirecting *at first notice BEFORE the dog
erupts* and jackpots any choice NOT to erupt, she can get rid of
this accidentally reinforced behavior. It won't be easy, because
part of the behavior is self-reinforcing.
I
would personally add a verbal NRM (neutral no reward marker - ah-ah!)
at the point of lunge and redirect as always but no longer C/T the
resulting attention if *I* missed the pre-signals and a lunge happened.
(I would take responsibility for being the one who missed it and
would not set the dog up to lunge so I could correct it!) If the
dog spun around following the lunge and gave me its attention, I
would quietly acknowledge the attention but not C/T, increase the
distance from what precipitated the lunge in order to set the dog
up to succeed, and jackpot success on the next pass.
Lisa
- perhaps you could share how a correction situation
classically conditioned your Bouvier student to hate the approach
of other dogs!
Lisa's
response to Diane's question AND the Bouv story:
Diane
asked a GREAT question and your resonse, was, like "oh yeah, of
course!" Have to redirect before the explosion. How about in a "de-exploding"
training set-up -we get a handful of well-trained dogs and handlers
to walk by Sofie at varying distances (starting farther away and
gradually getting closer). Sophie is tied up. Sophie's mom simply
feeds, feeds, feeds as the dog goes by. If Sophie explodes the other
handler and dog are to stop with the other dog facing away from
Sophie and mom walks away. [Hooray for John Fisher - good plan!]
When Sophie turns to look for mom, the other dog/handler can then
walk away. OR in a classroom situation, if Sophie explodes, the
food goes away and mom turns into a tree. After they are successful
at this, the next step is to reward Sophie for voluntarily turning
away from the other dog, whether or not she looks at her mom. In
other words, click a lookaway when Sophie offers it instead of exploding.
The last time I saw Sophie in class her mother was often not even
looking at Sophie during the entire explode, re-direct, click and
feed sequence. She has actually trained Sophie (thanks to us) to
perform the sequence of behaviors. With Denali, I rewarded voluntary
look-aways. And the calm looks at other dogs. This is the ticket.
Reward a dog's decision to NOT explode.
The
Bouv Story:
Classical conditioning at work ...
Chloe,
the Bouvier, is one of four Bouvier's in this family. They are show/obedience
dogs. Chloe is 14 mos old and started exhibiting "aggressive" behavior
towards other dogs in the show ring. When getting a history on the
dog, I learned that there are many rude dogs where they train. Starting
when Chloe was 6 mos old, whenever dogs got loose and/or in Chloe's
space her handler would pop the leash and jerk Chloe away. Soon,
Chloe was growling as the other dogs approached, which earned her
another leash correction. Only now, Chloe's mom, not understanding
what was happening also became extremely tense when other dogs approached
so whenever other dogs got close she sucked in her breath and tightened
her muscles in preparation for whatever was going to happen. Karen's
response convinced Chloe that bad things happened when other dogs
came close, so she worked harder at chasing them away with bigger
growls. She was telling all other dogs, "GET AWAY NOW Pleeeeeeaaaaaase,
so I don't get in trouble" In other words, Chloe developed a negative
association to dogs approaching when she was on-leash.
The
biggest lesson I could offer was to teach Karen how to read her
dog and be her advocate and re-direct prior to the explosion. Following
is an excerpt from a recent email from Karen:
"We
have had two obedience classes with Barb since we met with you.
Chloe was good in the first class, and she seemed to be even more
relaxed in last night's class. I spend more time reading her than
actual training, but it seems to be paying off. I'm keeping her
out of the major crush of dogs, but we still are a part of the class.
We had a Golden puppy get loose last night, and I just shielded
Chloe with my body. I'm pleased with her progress. I will set up
another session after the next class. I read the article that you
recommended (He Just Wants to Say Hi, by Suzanne Clothier), and
I ordered the book. I recognized myself in the article."
Thank
you, Lisa!
QUESTION OF THE WEEK, PART TWO:
GIVE
EXAMPLES OF HOW THIS THEORY
MIGHT APPLY IN A CLASS SITUATION?
The
reactive behavior of the shelter dog is seen in almost every first
week class.
The
dog who erupts barking, the dog who hides under the chair, the dog
who
is shy and withdrawn or who growls at any approach or eye contact.
We see them every day.
Every
time there is a dog with a stress-based issue in class, we must
use classical
conditioning (learning by association) to address it. It's imperative
that you have a working
understanding of learning theory and how it applies to the class
situation in order
to provide the most optimum help to this dog and student.
Given the laws of learning theory, we know we can't "train"
them to "feel better" ...
an enforced down, holding their face and making them look at
you,
making them come out from under the chair, scolding,
leash corrections, binaca or squirt bottles might temporarily
inhibit the visible aspects of the behavior
but it won't change the internal SOURCE of the behavior which is
stress
- in fact, they will increase stress and make the dog's emotional
state worse.
We will HAVE to use classical conditioning to get there. HOW?
BRAINSTORM TIME!
WHAT
SPECIFIC WAYS CAN WE USE CLASSICAL CONDITIONING
TO CHANGE A STRESSED STUDENT OR DOG'S EMOTIONAL STATE IN CLASS?

YOUR ANSWER!

|