Operant Conditioning Quiz: Reinforcement and Punishment Practice

Operant Conditioning Quiz: Reinforcement and Punishment Practice

9 – 55 Questions 14 min
This quiz drills introductory psychology learning theory—operant conditioning—with emphasis on classifying consequences as positive/negative reinforcement or punishment and recognizing fixed/variable ratio and interval schedules from response patterns. Expect AP Psychology and MCAT Psych/Soc depth: translate vignettes into Behavior → Consequence → change in future frequency, then diagnose shaping, extinction, and avoidance.
Choose quiz length
1Negative reinforcement increases a behavior by removing or preventing an aversive stimulus.

True / False

2Variable interval schedules are known for producing a scalloped ("studying increases as test time approaches") response pattern.

True / False

3A dog used to get a treat every time it sat on command, but the owner stops giving treats. Over time, the dog sits less often. This decrease is an example of:
4A driver starts buckling their seatbelt more often because it immediately stops the loud warning beep. What type of consequence is maintaining seatbelt buckling?
5Which reinforcement schedule is most associated with a post-reinforcement pause (brief drop in responding right after reinforcement)?
6A child starts cleaning their room more often because cleaning makes a parent stop nagging immediately. Cleaning increases. This is:
7What is shaping in operant conditioning?
8Students tend to check the course website more frequently as Friday approaches because quizzes are always posted every Friday at 3 p.m. What schedule best describes quiz posting?
9A parent adds extra chores after a teen breaks curfew, and curfew violations decrease. This is:
10A student begins submitting assignments early because doing so prevents the teacher from sending repeated reminder emails. Submitting early becomes more frequent. This is best classified as:
11During practice, a coach adds 10 extra push-ups immediately after a player makes a sloppy pass. Sloppy passing decreases over the next week. The extra push-ups function as:
12A child whines in the grocery store, and the parent gives candy to stop the whining. The child’s whining becomes more frequent on later shopping trips. The candy is functioning as:
13In operant conditioning, a consequence is a reinforcer only if it increases the future frequency of the target behavior for that individual in that context.

True / False

14A coffee shop gives a free drink after every 10 drinks purchased. What reinforcement schedule is this?
15A teen argues with a parent. The parent responds by canceling the teen’s chores for the week. After this, the teen argues more often in similar situations. For the teen’s arguing, the parent’s response is:
16A supervisor praises an employee each time they stay late. Despite the praise, staying late becomes less frequent over the month. In this context, the praise is best described as:
17Arrange these shaping steps from earliest to latest when teaching a rat to press a lever for food.

Put in order

1Reinforce the rat for moving toward the lever.
2Reinforce the rat for pressing the lever.
3Reinforce the rat for touching the lever.
4Reinforce the rat for facing the lever.
18Arrange the steps for identifying a reinforcement schedule from a vignette in the most useful order.

Put in order

1Decide ratio vs interval.
2Name the schedule (FR/VR/FI/VI).
3Decide fixed vs variable.
4Determine whether reinforcement depends on responses or time.
19Select all that apply. What information is necessary to correctly label a consequence as one of the four operant-conditioning quadrants?

Select all that apply

20Arrange the decision steps for classifying a consequence into positive/negative reinforcement/punishment, from first to last.

Put in order

1Identify the target behavior.
2Decide whether a stimulus is added or removed.
3Label it reinforcement (increase) or punishment (decrease).
4Name the quadrant (positive/negative reinforcement/punishment).
5Decide whether the behavior increases or decreases in the future.
21A person takes an aspirin when they have a headache, and they start taking aspirin more quickly the next time a headache begins. For the behavior of taking aspirin, this is:
22Select all that apply. Which statements about reinforcement schedules and response patterns are accurate?

Select all that apply

23Select all that apply. Which statements about schedules and extinction are correct?

Select all that apply

24Select all that apply. Which are examples of a variable interval (VI) schedule?

Select all that apply

25A teacher wants to increase hand-raising, but repeatedly gives attention whenever a student calls out without raising a hand. Calling out increases over time. Which behavior is being reinforced?
26Select all that apply. Which consequences are examples of negative punishment?

Select all that apply

27Avoidance behavior (doing something to prevent an aversive event from happening) is maintained by negative reinforcement.

True / False

28A child’s tantrums were previously reinforced by attention. When attention is withheld, the tantrums briefly become louder and more frequent before gradually decreasing. That brief increase is called:

Operant Conditioning Pitfalls: Reinforcement vs Punishment and Schedule Traps

1) Treating “negative” as “bad” instead of “removal”

Mistake: Calling negative reinforcement “punishment” because an aversive stimulus is involved.

Fix: Decide reinforcement vs punishment by the future frequency of the target behavior. If the behavior increases, it’s reinforcement—even if it increases because something annoying stops (escape/avoidance).

2) Labeling the consequence by intention (“reward”) rather than effect

Mistake: Treating praise, stickers, or privileges as “reinforcers” automatically.

Fix: In operant conditioning, a reinforcer is defined functionally: it must increase the behavior in that situation. If the behavior does not increase, it wasn’t reinforcement (even if it was meant to be).

3) Picking a quadrant before identifying the behavior being shaped

Mistake: Focusing on what the parent/teacher “did” instead of what the learner did.

Fix: Write a one-line contingency: Behavior → Consequence → Future behavior change. If you can’t state which behavior is supposed to increase/decrease, you can’t classify the consequence.

4) Confusing negative punishment with negative reinforcement

Mistake: Calling time-out or loss of screen time “negative reinforcement” because something is taken away.

Fix: Ask whether the organism experiences relief (escape) or a loss (removal of a desired stimulus). Relief that increases behavior = negative reinforcement; loss that decreases behavior = negative punishment.

5) Mixing up schedule definitions and their “signature” patterns

  • FR: high responding with a post-reinforcement pause (after “earning” the reinforcer).
  • VR: very high, persistent responding with minimal pause (hard to extinguish).
  • FI: “scallop” (slow after reinforcement, accelerating as the interval ends).
  • VI: steady, moderate responding (because timing is unpredictable).

Reinforcement & Punishment: Five Decision Rules You Should Apply on Every Vignette

Use these rules as a repeatable workflow when you review missed items; they turn “vibes” into a classification you can defend.

  1. Start with behavior change, not the stimulus. Ask: after the consequence, does the target behavior become more likely (reinforcement) or less likely (punishment)? Only then decide positive (add) vs negative (remove).

  2. Define “positive/negative” as add/remove with one concrete noun. Write what is added or removed (e.g., noise, attention, chores, phone access). This prevents swapping negative reinforcement (remove aversive) with negative punishment (remove appetitive).

  3. Separate escape/avoidance from loss-of-privilege. If the behavior produces relief (seatbelt buckling stops a beep; turning in homework ends parental nagging), and the behavior increases, that’s negative reinforcement. If the behavior produces loss (time-out, token removal), and the behavior decreases, that’s negative punishment.

  4. Memorize schedule “signatures,” then confirm with the definition. Ratio = based on number of responses; interval = based on time. Fixed = predictable; variable = unpredictable. Use the graph pattern as a clue, but verify whether the contingency is response-count or time-based.

  5. Watch for shaping and extinction clues that change the correct answer. “Successive approximations” indicates shaping (reinforcing closer-and-closer versions of the behavior). “Reinforcement stops” indicates extinction, often preceded by an extinction burst (a temporary spike in responding that is not “proof” reinforcement is still happening).

Authoritative Operant Conditioning Readings (Definitions + Schedules)

Operant Conditioning FAQ: Quadrants, Extinction Bursts, and Reinforcement Schedules

How do I decide “reinforcement vs punishment” if the question doesn’t explicitly state what happens later?

Infer the future behavior change from the outcome and the goal described in the vignette. If the consequence is presented as making the behavior more likely (e.g., “so the child starts doing it more”), it’s reinforcement; if it makes it less likely, it’s punishment. Then classify positive vs negative by whether a stimulus is added or removed.

Why isn’t negative reinforcement the same thing as punishment?

They move behavior in opposite directions. Negative reinforcement increases a behavior because it removes or prevents an aversive stimulus (escape/avoidance learning). Punishment decreases a behavior, either by adding something aversive (positive punishment) or removing something desirable (negative punishment).

Is time-out negative reinforcement because attention is removed?

Usually not. Time-out is typically negative punishment: access to reinforcement (attention, play, activities) is removed to decrease a behavior such as hitting or yelling. It would only be negative reinforcement if the “time-out” removed something aversive and the problem behavior increased because it produced that escape.

What’s the fastest way to identify FR, VR, FI, and VI schedules in a story problem?

Translate the contingency into one phrase: “After how many responses?” (ratio) or “after how much time?” (interval), and then ask whether that number/time is predictable (fixed) or unpredictable (variable). Use response patterns as confirmation: FI scallop, FR post-reinforcement pause, VR persistent high responding, VI steady moderate responding.

What does “extinction” mean in operant conditioning, and what is an extinction burst?

In operant conditioning, extinction is the decrease in a behavior when reinforcement for that behavior stops. An extinction burst is a common early phase of extinction where responding briefly increases in intensity or frequency (e.g., louder complaining) before it declines—so a temporary spike does not imply the consequence is still reinforcing.

How is shaping different from simply reinforcing the final behavior?

Shaping reinforces successive approximations toward a target behavior (e.g., first sitting near the desk, then opening the book, then completing a problem), which is essential when the target response is rare or absent. Reinforcing only the final behavior works best when the learner already performs something close enough to the target that it can occur and be reinforced.