Table of Contents |
You may recall that operant conditioning is when learning occurs through the association of consequences to behaviors. Operant conditioning is concerned with what happens after and as a result of a person's actions, which makes it more or less likely for them to repeat that action later.
Reinforcement is anything that follows a behavior and makes it more likely for a response to be repeated.
Reinforcement is grouped into two different categories: positive reinforcement and negative reinforcement. It is important not to think of “positive” and “negative” as good and bad. Rather, these terms refer to the effects or the consequences that change behaviors; thus, one isn't better than the other.
Positive reinforcement is anything that is presented or given to the subject that makes it more likely for the response to be repeated.
EXAMPLE
Examples of positive reinforcement would include such things as candy, stickers, a high five, or even verbal praise, given to a child after they do a good job on a math test, for instance. In turn, the child is more likely to perform well in math later, because they received that positive reinforcement.In contrast, negative reinforcement is anything that is taken away from a subject that makes it more likely for a response to be repeated. Remember, reinforcement is all about making it more likely for something to occur; therefore, negative reinforcement is generally taking away something that is unpleasant to compel someone to want to perform an action more.
EXAMPLE
Suppose there is an annoying sound in your room, and it can be turned off by pushing a button. The negative reinforcement is the annoying sound, which makes you more likely to perform the behavior of pushing the button. Similarly, you may perform a chore so that your roommate doesn’t nag at you. In either case, you're more likely to do what you're supposed to be doing.Punishment follows a response and makes it less likely for a behavior to be repeated.
In other words, punishments are trying to suppress a behavior. There has been much research, particularly in the field of childhood development and education, that has shown how effective punishment is.
Under punishment, there is also positive and negative reinforcement:
The difference between these two types of aversive consequences is that negative reinforcement makes it more likely for a behavior to repeat, whereas punishment makes it less likely for the behavior to repeat. Both of these concepts deal with unpleasant things, but the consequences, or results, are different.
Punishment can be more powerful than reinforcement, especially in the short term. It is possible to change somebody's behavior rather quickly as a result of punishment, but it can have some negative or unwanted results.
For instance, when you're using punishment, you don't teach a person to be positive, or to learn prosocial kinds of behaviors. Generally, you're simply teaching them either escape learning or avoidance learning, which are ways to end an aversive stimulus quickly.
Escape learning is learning to respond in a way that ends an aversive stimulus quickly, by trying to get away from the aversive stimulus. Avoidance learning, similarly, is learning to respond in a way that postpones or prevents an aversive stimulus from happening.
Instead of learning to deal with problems, like bullying, for example, a child instead learns to escape or avoid. This, in turn, rewards them with some relief so that they feel better about the situation. This makes them more likely to escape or avoid again. Therefore, punishment isn't necessarily teaching somebody in the best possible way.
There are a few other notable issues or problems with punishment:
Reinforcers can be used in more abstract and complex ways to explain how all human behavior occurs. This tutorial will explain how some of those higher-level reinforcers help to create a wide range of human behaviors.
There are different levels of reinforcers, which have gradually become more abstract and more complex in the way that they are applied. This tutorial will focus mainly upon positive reinforcers as examples, but keep in mind that negative reinforcers also apply to these concepts.
Primary reinforcers are basic types of reinforcers that are rewarding and desirable in and of themselves.
These are things that people don't have to learn to like, but rather things that people typically innately like.
Primary reinforcers are generally very basic things related to biological needs, such as:
EXAMPLE
If somebody offers you a glass of water on a hot day, you don’t have to wonder if you need it. You’re programmed biologically to need hydration.Secondary reinforcers are reinforcers that people have to learn to value. These are things that are rewarding and desirable generally because they're related to a primary reinforcer. They're not necessarily something that, as a child, you would automatically know that you should like and want.
There are different categories of secondary reinforcers:
EXAMPLE
For example, money is a secondary reinforcer because it's not something that you necessarily value in any way on its own, because it's just paper. However, you know that you can use it to buy things like food or other things that you want.EXAMPLE
Children tend to associate attention with biological primary reinforcers like food or physical contact. However, it's not necessarily something that they innately know they should like or want.EXAMPLE
If you’re playing a video game, elements like the background music or the flashing colors let you know exactly how you’re performing in the game. Thus, this feedback improves the likelihood of you modifying your behavior to do what you're supposed to be doing.It is important to note that secondary reinforcers are not less powerful than primary reinforcers; the desire for things like money or social attention can oftentimes be more powerful.
Self and Social Awareness: Skill Reflect |
When and how a behavior is rewarded can determine whether it is learned as well as the strength of that learning. A schedule of reinforcement is a plan that determines when a behavior will or will not be reinforced.
There are several different kinds of schedules of reinforcement under operant conditioning. They fall under two basic categories:
1. The first category is continuous reinforcement. This means giving reinforcement every time a behavior is performed.
EXAMPLE
Every single time a child performs a behavior you're hoping for, you give them a piece of candy.There are some pros and cons of this type of schedule:
The second category is partial reinforcement, which tends to produce more resistance to extinction. This means that the learning is less likely to go away since a subject is less likely to get tired or satiated by the reinforcer.
There are four basic types of partial reinforcement that are used:
The first kind of partial reinforcement is fixed ratio reinforcement or FR. In FR, reinforcement is given after a specific, predetermined number of correct behaviors.
EXAMPLE
Every third or fourth time a child performs the behavior you want, you give them a reward.This generally leads to some very quick, consistent responses; it is easy for the learner to catch on.
EXAMPLE
A rat that is placed within a box with a button that dispenses a treat will press the button many times, one after the other. It will continually press the button because it realizes that after a certain number of times, it will get the reward.In contrast, the second type of partial reinforcement is the variable ratio (VR) schedule of reinforcement. In VR, reinforcement is given after a variable, unpredictable number of correct behaviors.
Usually, reinforcements are given within a range, such as between every two to five times on average. Each time the subject performs the behavior, they won't know whether or not they will receive the reward. This generally leads to very high, consistent levels of responses.
In addition to ratios, there are also intervals. A fixed interval (FI) is when a reinforcer is given after a specific amount of time has passed since the last reinforcement.
EXAMPLE
A subject receives a reward after 30 seconds have passed from the last time the subject performed the desired behavior.This leads to some very slow responses from the subject. Generally, there will be more response towards the end of the interval, because the subject begins to predict exactly when they can get the reward. Thus, they only perform the behavior when they know they're going to receive reinforcement.
EXAMPLE
One example of FI might be getting paid a salary at the end of every week. You don't necessarily work harder at the end of the week before you get paid. Instead, you generally tend to work harder at the beginning of the week.A variable interval (VI) schedule of reinforcement is when a reinforcer is given after an unpredictable, variable amount of time has passed since the last reinforcement. It might be on average every 30 seconds, but the subject doesn't realize this, nor does the experimenter generally realize it.
This generally leads to some very slow, but also very steady responses that don't tend to go away very easily, because the subject doesn't realize when they're going to get the reward. Therefore, they have to produce the behavior at regular intervals because they need to be ready to receive the reinforcement that they desire.
Source: THIS TUTORIAL WAS AUTHORED BY SOPHIA LEARNING. PLEASE SEE OUR TERMS OF USE.