Skip to content
Publicly Available Published by De Gruyter January 12, 2016

Optimal Penalties for Repeat Offenders – The Role of Offence History

  • Alfred Endres EMAIL logo and Bianca Rundshagen

Abstract

Within an infinite and a corresponding finite game framework we analyse intertemporal punishment for repeat offenders. The legal authority is assumed to maximize social welfare by minimizing the sum of harm from crimes and cost of punishment. We show that the time horizon considerably affects the structure of the optimal penalty scheme. In the finite game framework decreasing as well as escalating penalty schemes may be optimal. For the more appropriate infinite game framework we show three main results: First, any penalty scheme can be replaced by a (weakly) escalating penalty scheme that leads to the same criminal activity and the same social penalization cost. Second, the optimal penalty scheme is of the escalating type. Third, the socially optimal level of crime under escalating penalties may be higher than the level which would be optimal under uniform penalties.

JEL: K14; C73

1 Introduction

This paper is in the economics of crime literature tradition building upon the fundamental work of Becker (1968). Here, the decision on whether to commit a crime is modelled as a rational choice of the potential offender. It is rational in that it is based on utility maximization balancing expected benefits from crime against expected costs. There is an extensive literature on the benefits and costs of crime starting with the seminal papers by Ehrlich (1973, 1975) highlighting the importance of the expected size of punishment. [1]

Of course, criminals impose costs on society, particularly the damage done to their victims but also indirect costs like higher uncertainty in economic and social interactions. Consequentially, society allocates scarce resources to crime prevention. Consistent with the understanding of the potential criminals’ decisions sketched above, the decision of society on how much to spend on crime prevention is also interpreted to be a rational one balancing benefits of crime reduction and its costs. [2]

Obviously, the decisions of potential criminals and the decisions of the designers of criminal law are highly interdependent. Not only do potential criminals acknowledge the size and structure of penalties as an important determinant of their decisions for (against) crime. Also, the lawmakers should take the cost-benefit-analysis of potential criminals into account in the making of criminal law. Any assessment of the deterrence effect of a certain form and size of punishment (and thereby of the benefits of this punishment) must be based on a hypothesis about how potential criminals might adjust their decisions to this specific punishment.

To date, most of the contributions to the literature tradition referred to above use static economic models. A price to be paid for the analytical simplicity thereby achieved is that certain intertemporal aspects of crime and punishment cannot be satisfactorily analysed. [3] One of these aforementioned aspects is the intertemporal profile of punishment for repeat offenders. [4]

A central question in this context is: “We observe that repeat offenders are punished more severely than first time offenders. What (if any) is the economic rationale for this escalating penalties structure?”

Papers that explore the escalating (or synonymously: increasing) penalties question address two issues: First, how does a potential criminal react to alternative intertemporal punishment structures for first time and repeat offenders? Second, what is the optimal intertemporal structure of penalties (given the criminal’s cost benefit analysis explored in the first step)? The optimality concept used for the choice of the intertemporal penalties structure varies within the “escalating penalties literature”. Most authors assume that the lawmaker strives to design the penalty structure which maximizes social welfare in that the social cost of crime is minimized. [5] Typically the social cost of crime consists of the damage suffered by the victims and the cost of crime prevention. Some authors differentiate the cost of crime reduction distinguishing between the cost of apprehending offenders and the cost of punishment. [6] Others differentiate the benefits of crime reduction subtracting the utility losses suffered by the criminals from the gains of the potential victims which are spared. [7]

Most of the contributions to the escalating penalties literature use a finite game framework and confine the analysis to two periods and two sanction levels, one for first time and one for repeat offenders. [8] The advantage of this framework is analytical simplicity. The drawback is well-known in the game theoretic literature: The approach suffers from all kinds of “finite horizon paradoxes” (Rubinstein 1998, 165), the discussion of which began with the seminal contributions of Luce and Raiffa (1957) and Selten (1978), and is ongoing.

Apart from this methodological problem the results produced by the established escalating penalties literature are puzzling, in that a decreasing penalty scheme (one that punishes repeat offenders less severely than first time offenders) is optimal at least for certain parameter values in many of these finite models. [9] However, this kind of penalty scheme is nowhere to be observed in reality, and seems implausible.

To solve the “puzzle of escalating penalties” (Dana 2001), we analyse the impact of the time horizon on the structure of the optimal penalty schemes. In the main parts of the paper (Sections 2–5) we extend the model of the “criminal career” of an individual from the two-period case presented in the literature to an infinite game framework in order to formalize the unknown endpoint of lifetime more appropriately. Specifically, we assume that a decision maker who is alive in period t expects still to be alive in the next period with a positive probability q<1. [10],[11]

In Section 6 we consider a corresponding traditional two-period framework and show that in this finite model the optimal penalty scheme might be of the escalating as well as of the decreasing type as it is also the case in Emons (2007), Friehe (2009), Miceli (2013) and Polinsky and Rubinfeld (1991). This highlights the fact that the time horizon may have a crucial impact on the optimal penalty structure. [12]

The infinite game framework is similar to the one of Endres and Rundshagen (2012). However, opposed to the paper at hand, in Endres and Rundshagen (2012) only two sanction levels are considered, i. e., the sanction for repeat offenders is not allowed to depend on the number of previous records, which is an important feature of the paper at hand. This allows us to rule out the optimality of hybrid penalty schemes, which combine decreasing and increasing steps of penalization. Moreover, Endres and Rundshagen (2012) do not compare their outcome with a corresponding finite game, which is a central part of the paper at hand.

In both frameworks of this paper, the infinite and the finite one, the game is modelled in three stages. In the first two stages the legal system is chosen. First a regulator, who strives to minimize the sum of harm from crimes and cost of punishment chooses a target criminality level and in the second stage he chooses the corresponding optimal penalty scheme. In the third stage which comprises the infinite game described above or a traditional two-period game, respectively, potential offenders decide in each period of their life whether they commit a crime.

For the infinite game framework we derive three main results: First, any decreasing or hybrid penalty scheme can be replaced by a (weakly) escalating penalty scheme that leads to the same criminal activity and the same social penalization cost. Second, for any (exogenously) given aspired criminality level (chosen in stage one), the cost minimizing penalty scheme (equilibrium of stage two) is monotonically increasing in the number of previous police records but not strictly increasing. Third, although crime prevention is cheaper under optimal escalating penalty schemes than under uniform penalty schemes, the socially optimal level of crime may be higher than the level which would be optimal under the restriction of uniform penalties.

Moreover we show that there is a fundamental difference between the equilibrium strategies in the finite and infinite game framework. In the infinite game a decreasing sanction scheme never leads to higher welfare than a uniform penalty scheme. The driving force behind this result is the fundamental divergence between the equilibrium strategy of a potential offender under a decreasing sanction scheme in a game with ex ante known number of periods, and this strategy in an infinite game. In the finite game under a decreasing penalty scheme the decision of the potential offender in the second (or a later) period, depends on whether the offender has been convicted for previously committed crimes. However, in the infinite game a potential offender either commits the crime in each period or in no period under a decreasing penalty scheme as well as under a uniform penalty scheme. In both cases he commits the crime if and only if his benefit from committing the crime exceeds his average expected penalty. Thus, for any decreasing penalty scheme which induces the same criminality level as a given uniform penalty, the average expected penalty coincides with the uniform penalty. Hence, the critical benefit that separates offenders from non-offenders is identical under both penalty schemes, too. This equality also implies identical penalization costs. To prove this conjecture we proceed as follows: for a given penalty scheme we relate the decision of a potential offender to commit a crime to a sequence of critical benefits {bn}n=0,1,. For criminal record level n an outsider will observe that the offender commits the crime if and only if his benefit is larger than bn. We then show that each penalization scheme that leads to the same sequence of critical benefits (as it is the case, e. g. for a decreasing and corresponding uniform penalty scheme) implies identical penalization cost. Moreover, for the infinite game we show that each set of penalty schemes that lead to the same sequence of critical benefits contains an escalating penalty scheme (including the uniform penalty scheme as border case). Hence the task of finding the overall optimal penalty scheme is reduced to the task of finding the optimal penalty scheme within the set of escalating ones (including the uniform penalty schemes as border case). We show that the structure of the unique optimal penalty scheme is of the following type. There exists at most one criminal record level n0 for which the penalty level is non-degenerate (0<sn0<smax). All the other records result in either zero penalty (sn=0 for n<n0), or the maximum penalty (sn=smax for n>n0).

The intuition behind this result may be explained as follows. Consider a uniform or escalating penalty scheme s which induces the maximum tolerable criminality level, as chosen by society. Now consider a change from this scheme to a scheme of escalating penalties sˆ which just induces the same criminality level but differs from s by a lower penalty for offenders with a low criminal record and a higher penalty for offenders with a high criminal record. The change in the penalties must be such that the increased criminal activity of offenders with low criminal record is exactly offset by the decrease in the criminal activity of offenders with high criminal record. However, even though the level of crime is unaffected by the envisaged change in the penalty structure, the aggregate cost of punishment goes down: the cost of punishment for the additional criminal activity of offenders with low criminal record is lower than the saved cost of punishment for the reduced criminal activity of offenders with high criminal record. This is so since the (new) penalty for offenders with low criminal record is lower than the (old) penalty for offenders with high criminal record and thus the aggregate penalization effort is reduced. [13],[14]

In Section 6 we demonstrate that in the finite game framework, as well as in the infinite one, cost might be reduced by using escalating instead of uniform penalties. However, due to the final round effect, different from the infinite scenario, they also might be reduced by using decreasing penalties instead of uniform ones. Here it depends on the parameter values whether increasing of decreasing penalties are globally optimal in this setting.

We proceed as follows: In Section 2 we present the basic model. In Section 3 we derive the equilibrium strategies of the potential offenders, that is, we first solve the infinite game of stage 3. In Sections 4 and 5 we derive the equilibrium strategies of the legal authority. In Section 6 we investigate the consequences of a change from our infinite game framework to a corresponding two-period framework. We conclude in Section 7.

2 The Basic Model – Assumptions

Suppose that some individuals, the “potential offenders”, receive non-negative, constant gross benefits, b, from crime. [15] We assume that in each period a set of potential offenders with benefits distributed according to a continuous distribution function F:B[0,1], B0+, is born. [16] Additionally we assume that the probability density is bounded away from zero and that the hazard rate [17] defined by h(b)=f(b)/1F(b) is increasing. [18] The probability that an individual which is alive in period t still lives in period t+1 is given by q(0,1). [19] Hence in each period the total number of potential offenders is given by 1/(1q). [20] Moreover, each individual is assumed to be able to commit at most one crime per period in which it is alive. [21]

We formalize the decision process as a three stage game. [22] In the first two stages the regulator chooses the maximum tolerable (“aspired”) number of crimes c per period (stage 1) and a penalty scheme (stage 2) {sn}n=0,1,... with 0snsmaxn0, where the index n denotes the number of previous convictions and smax the maximal possible sanction. [23] We illustrate our point using three types of penalty schemes: uniform penalty schemes defined by sn=sˉn0, decreasing penalty schemes, defined by sn+1snn0 and increasing penalty schemes, defined by sn+1snn0. [24] The Propositions however are also valid for hybrid penalty schemes for which there exist criminal histories n,m0, such that sm+1<smand sn+1>snhold. In the third stage potential offenders decide whether they commit a crime.

Let p(0,1) denote the (exogenously given) probability of apprehending offenders which is assumed to be independent of the type of offender (defined by his previous number of crimes and his benefit). [25] We assume that F(psmax)<1 holds, i. e., even if the regulator chooses the maximum available penalty for each offender there are some potential offenders who may not be deterred from committing a crime.

In this paper, we follow the lines of neoclassical welfare economics. There, the regulator is stylized to maximize social welfare. In our model the effect of crime on social welfare is two-fold. First, crime inflicts harm to the victim and second, punishing criminals is costly to society. [26] Thereby, the strive to maximize social welfare translates into a strive to minimize the sum of harm from crimes and cost of punishment. This implies that the optimal crime control policy is defined by using “the minimum amount of punishment to achieve any given level of crime control” (Kleiman 2009, 3). [27]

Harm from crimes is given by H(c) with H(c)>0c(0,cmax), where cn(s) denotes the number of crimes per period committed by the group of potential offenders with n police records given the penalty scheme s, c=n=0cn(s) denotes the aggregate number of crimes committed per period and cmax denotes the aggregate number of crimes committed if there are no sanctions at all. Penalization costs are given by Ppn=0cn(s)sn with P denoting a strictly increasing function of the sum of imposed sanctions. [28] Thus, the objective function of the regulator may be written as

[1]minsK=Hn=0cn(s)+Ppn=0cn(s)sn.

In the following three sections we analyse the game backwards and begin with the third stage. That is, we first solve the infinite game (where the potential offender plays versus nature, which determines the offender’s lifetime). Replacing the third stage by its equilibrium outcome reduces the infinite game to a finite one.

3 Third Stage: Equilibrium Number of Crimes – Individual Choice and Aggregation

3.1 The Choice of a Potential Offender

Consider the equilibrium behaviour of a potential offender with benefit b and n previous convictions in t0. A potential offender either commits the crime immediately or he does not commit a crime in any period tt0. The reason is that it is never worthwhile to delay an offence to a subsequent period t1>t0. If the potential offender would delay committing the offence he would receive the payoff 0 in each period t with t0t<t1 and in period t1 (provided that he is still alive) he would face the same decision problem as in period t0. Hence, the lifetime returns of the potential offender with n previous records may be expressed by the following recursive equation:

[2]V(n)=max0,bpsn+qpV(n+1)+(1p)V(n).

The potential offender either decides against committing the crime and receives V(n)=0, or he commits the crime. In this case his lifetime returns are composed of three parts. First, there is the benefit from committing the crime, b, second, the offender has to suffer from the expected penalty psn and third, his criminal history status changes from n to n+1 with probability p, which becomes relevant if the potential offender is still alive in the next period, which occurs with probability q.

The potential offender commits the crime if and only if V(n)>0 holds. For any sanction scheme {sn}n=0,1,... and any criminal history n lifetime returns V(n) are positive, if the benefit b which the potential offender receives from committing a crime (and which is constant over time) is sufficiently high. In the following we denote the threshold benefit level for an offender with n previous sanctions by b¯n. That is an offender with criminal record n would commit the crime if and only if b>b¯n holds. [29] However only if also b>b¯m for all m<n holds, the potential offender may receive n convictions (because otherwise he stops criminal activity earlier). Hence the criminal activity of potential offenders may be represented by a sequence of critical benefit levels {bn}n=0,1,... with bn=max{b¯m,0mn}. An offender with benefit level b becomes a repeat offender with criminal history n (provided that he is alive long enough), if and only if b>bn holds.

In the following we determine the sequences of critical benefit levels for uniform, decreasing and escalating penalty schemes.

  1. Under a uniform penalty scheme we have V(n+1)=V(n) with V(n)>0b>psˉ. That is, a potential offender commits the crime, if and only if the gross benefit of committing the crime exceeds the expected penalty. Hence the critical benefits are given by bn=b¯n=ps¯.

  2. Under an escalating penalty scheme we have 0V(n+1)V(n). Hence a potential offender commits a crime as long as his benefit exceeds his expected penalty, i. e., the critical benefits are given by bn=b¯n=psn.

  3. Under a decreasing penalty scheme a potential offender either commits a crime in each or in no period (as in the case of uniform penalties). If there is a criminal record mn such that V(n)>0 and the strict inequality sm+1<sm holds, we have V(n+1)>V(n), from which b¯n<psn follows. Thus, a potential offender is willing to receive an expected loss in the current period if this loss is outweighed by the positive effect of receiving the more favourable status as a repeat offender with higher criminal record for the subsequent periods. Moreover we have b¯n+1b¯n from which follows that bn=max{b¯m,0mn}=b¯0 holds. To determine b¯0, we consider a potential offender who is (without loss of generality) assumed to be born in period 0 and commits a crime in each period. If the potential offender has got exactly k convictions with penalties s0,,sk1 in the t periods 0,…,t–1, he receives penalty sk in period t provided that he is convicted (again) (which occurs with probability p). The probability for k previous convictions is given by the binomial distribution Bk|t=tk(1p)tkpk with the binomial coefficients tk=t¡k¡(tk)¡. Thus his expected penalty in period t (provided that he is still alive) is given by pk=0tBk|tsk. Hence a potential offender commits a crime in each period if and only if t=0bqt>t=0qtpk=0tBk|tsk holds, which is equivalent to b>ps with s:=(1q)t=0qtk=0tBk|tsk. Hence the critical benefits are given by bn=b¯0=ps. Note that the general structure of the optimal strategies under a uniform and decreasing penalty scheme is identical: The potential offender commits a crime in each period if and only if the benefit exceeds the average expected penalty. Otherwise he does not commit a crime.

With respect to the sequence of critical benefits {bn}n=0,1,[30] the following general statements apply:

Proposition 1: Sequence of critical benefit levels

  1. For each type of penalty scheme including the hybrid ones the sequence of critical benefits is weakly increasing, bn+1bnn0.

  2. Different punishment schemes may give the same sequence of critical benefits {bn}n=0,1,...

Proof

See Appendix.

Proposition 1.ii implies that a potential offender behaves identically under two punishment schemes which lead to the same sequence of critical benefits. He stopps commiting crimes when he receives criminal history n for which bbn holds.

3.2 Aggregation

Building on the previous analysis of the equilibrium behaviour of a single potential offender the equilibrium number of crimes per period may be determined.

Proposition 2: Equilibrium criminality levels

For a penalty scheme{sn}with corresponding sequence of critical benefits{bn}the aggregate and criminal record specific equilibrium criminality levels are given byc=n=0(1F(bn))t=nqtBn|tandcn=(1F(bn))t=nqtBn|t.

Proof

See Appendix.

The criminal activity of a potential offender for any criminal history only depends on the relation of his individual benefit b to the critical benefit levels. Thereby, the aggregate level of criminality also depends on the sequence of critical benefit levels only. Hence, two punishment schemes with the same sequence of critical benefit levels do also lead to the same criminality level.

4 Second Stage: Lawmaker’s Choice – The Optimal Time Profile of Sanctions

Having determined the equilibrium criminality levels, we now consider the optimization task (1) of the regulator. As already mentioned in Section 2 this task is divided into two steps. In the second stage the regulator has already chosen the aspired number of crimes cˉ per period. His remaining task is to choose a sanction scheme which achieves cˉ with minimal penalization cost. Since the penalization costs are a strictly increasing function of total sanctions (P(pn=0cn(s)sn) with P>0), without loss of generality we may assume that P equals the identity function. Hence in the remaining parts of Section 4 we consider the equivalent optimization task

[3]minsP(c(s),s):=pn=0cn(s)sns.t.c=n=0cn(s)c¯.

To assure that a reduction of crime (or equivalently an increase of a sanction) raises punishment cost, we need the following assumption:

Assumption 1

smaxs˜ with s˜ implicitly defined by ps˜h(ps˜)=1. [31],[32]

If the regulator chooses the minimal (maximal) sanction sˉ=0 (sˉ=smax) the corresponding criminality level is given by cmax=1/(1q) (cmin=1F(psmax)/(1q)).

In the remaining part of Section 4 we assume that for the criminality target cˉ(cmin,cmax) holds in order to assure that there exist different penalty schemes that enforce cˉ and thus the solution of eq. [3] is non-trivial. [33]

In Section 3 we have seen that a given criminality target cˉ can be induced by different penalty schemes, which either may lead to the same or to different sequences of critical benefits {bn}. Proposition 3 shows that the penalization cost under a given penalty scheme is determined by the sequence of critical benefits.

Proposition 3

Each penalty scheme{sn}that induces the same sequence of critical benefits{bn}(and hence the same criminality levelcˉ(cmin,cmax)) leads to the same cost of punishment.

Proof

See Appendix.

The intuition behind Proposition 3 can be explained by the following example. Consider two penalty schemes which lead to the same sequence of critical benefits {bn}=(1,1,1,5,5,5,5,7,7,8,12,). Under both penalty schemes a potential offender with benefit b=1 is indifferent between “never committing the crime” and incurring the cost of “committing the crime until he is caught 3 times”. From this indifference condition, it follows that under both penalty scheme the penalization cost of punishing any criminal for the first 3 times are identical. Analogously it can be argued that the aggregate penalization cost for punishing offenders for the fourth to seventh time are identical and so on.

Proposition 3 can be interpreted as follows: If two penalty schemes induce the same crime decisions their implementation costs the same to the society. This directly implies the following statement:

Corollary 1

Any decreasing penalty schemeswithc(s)=cˉleads to the same cost of punishment as the corresponding uniform penalty scheme. [34]

Proposition 4 shows that any penalty scheme can be replaced by an equivalent escalating penalty scheme (in terms of critical benefits and social cost).

Proposition 4

Each sequence of critical benefits{bn}(that corresponds with a criminality levelcˉ(cmin,cmax)) can be implemented by the (weakly) increasing penalty scheme{sn}withsn=bn/p.

The assertion directly follows from the fact that under an escalating penalty scheme the critical benefits are given by bn=psn (see Section 3).

Using Proposition 4, the task of finding the globally optimal penalty scheme can be reduced to the task of determining the optimal scheme within the class of escalating penalty schemes. From Proposition 2 directly follows that the corresponding aggregate and type specific criminality levels are given by c=n=0(1F(psn))t=nqtBn|t and cn=(1F(psn))t=nqtBn|t. That is, in case of escalating penalty schemes, cn is a function of sn only.

It turns out (see Proposition 5) that the optimal penalty scheme has the following structure: Offenders with a high criminal record receive the harshest punishment and offenders with a low criminal record are not punished at all. The number of offences which is “necessary” to receive a positive punishment depends on the criminality target. The maximal sanction is imposed the earlier, the lower the aspired criminality level is. In case of a strict criminality target the punishment for the first detected offence is already positive. (Only) the first positive sanction may differ from smax to assure that the criminality target is exactly fulfilled. In particular it is not optimal to choose a strictly increasing penalty scheme with multiple (i. e. more than three) sanction levels.

Proposition 5: Escalating penalty schemes

  1. Within the class of escalating penalty schemes (including uniform penalty schemes as border case) the cost minimizing one is of the following type:

    n00:sn=0n<n0,0<sn0smax,sn=smaxn>n0.
  2. The penalty schemesIdefined in a) is the unique cost minimizing penalty scheme.

  3. For any given criminality levelc(cmin,cmax)the penalization costs are strictly higher when a uniform penaltysUis used instead of the optimal escalating penalty schemesI.

Proof

See Appendix.

The proceeding of the proof of Proposition 5.a is as follows: We assume that in the initial situation we have an escalating penalty scheme s with c(s)=cˉ which is not of the type described in Proposition 5.a. We define the minimum number of previous convictions a potential offender must have under s to receive a positive punishment by n0, i. e. n0:=min{n|sn>0}. Since s is not of the type given in Proposition 5.a, there must be a number of previous records n˜>n0 for which the corresponding sanction sn˜ is not maximal. We show that in this case it is possible to increase one or more sanction(s) for offenders with high criminal record and instead decrease the number of offenders with rather low criminal record in such a way that the following three conditions are fulfilled: i) The new penalty scheme is of the escalating penalties type, too, ii) the criminality level remains constant and iii) the shift of sanctions decreases penalization cost.

To clarify the driving forces that generate smallest penalization cost for the escalating penalty scheme given by Proposition 5 we conclude this section with an example, which compares the penalization cost of two escalating penalty schemes.

Example 1

Let the distribution function of benefits be given by F:[0,10][0,1],F(b)=b/10, the survival probability by q=9/10, the probability of apprehending offenders by p=1/2 and the maximum available penalty by smax=10. Assume that the initial escalating penalty scheme is given by s=(1,2,10,10,...). This penalty scheme is of the escalating penalty type, but not optimal, since it is possible to decrease s0 and instead increase s1 in such a way that the criminality level remains constant. More formally we consider a second escalating penalty scheme sˆ=(0,sˆ1,10,10,...), which fulfills c(sˆ)=c(s). [35] For s the sequence of critical benefits is given by {bn}={1/2,1,5,5,...} and for sˆ it is given by {bˆn}={0,sˆ1/2,5,5,...}. Hence we have two potential offender types that change their criminality behaviour if the penalty scheme s is replaced by sˆ. Under s potential offenders with benefit b<1/2 do not commit crimes at all, whereas under sˆ they commit crimes until their first conviction. Potential offenders with b[1,sˆ1/2) do only under s proceed with committing crimes after their first conviction (until they receive their second one). Hence we have

c(sˆ)c(s)=c0(sˆ)c0(s)+c1(sˆ)c1(s)=(10)(1F(1/2))t=0qtB(0|t)+(1F(sˆ1/2))(1F(1))t=1qtB(1|t)=F(1/2)t=0qtB(0|t)F(sˆ1/2)F(1)t=1qtB(1|t)=120t=0qtB(0|t)sˆ120110t=1qtB(1|t)=1201.81sˆ1201101.487603306.

Solving c(sˆ)c(s)=0 for sˆ1 reveals sˆ1=3,2ˉ, i. e. sˆ=(0,3.2ˉ,10,10,).

Since the criminality level and hence also harm from crime is identical under both penalty schemes, we only have to show that penalization cost under sˆ is lower than under s.

Therefore we consider the change of penalization cost, which is given by

ΔP=pc0(sˆ)sˆ0c0(s)s0+pc1(sˆ)sˆ1c1(s)s1.

with

c0(sˆ)=(10)t=0qtB(0|t)=1.81818,c0(s)=(10.05)t=0qtB(0|t)=1.72727,
c1(sˆ)=(13.2ˉ/20)t=1qtB(1|t)=1.24793,c1(s)=(10.1)t=1qtB(1|t)=1.33884,

from which follows

ΔP=0.51.8181801.727271+0.51.247933.2ˉ1.338842=0.19191<0.

Hence, the increase of penalization cost for offenders with one previous record is overcompensated by the decrease of penalization cost for offenders without criminal record. The main reason behind this assertion is that the cost of punishment for the additional criminal activity of offenders without criminal record and with b[0,0.5) is lower (in our example it takes the value zero) than the saved cost of punishment for the reduced criminal activity of offenders with one previous sanction and b[1,1.61ˉ). This is so since the (new) penalty for offenders with low criminal record (sˆ0=0) is lower than the (old) penalty for offenders with high criminal record (s1=2) and thus the aggregate penalization effort is reduced. In addition to that we have the following effects. For offenders without criminal record and benefit b>0.5 penalization cost is also reduced (in our example they even vanish) due to the reduced sanction for first time offenders. For offenders with benefit b>1.61ˉ there is an additional increasing cost effect for offenders with one previous sanction. However for our equal distribution function this cost effect is just offset by the decreasing cost effect for offenders without previous sanction and with benefit b>1.61ˉ. [36] Adding all effects reveals that aggregate penalization cost is decreasing if the system switches from the suboptimal escalating penalty scheme to the optimal one.

5 First Stage: Lawmaker’s Choice – The Optimal Level of Crime

In the previous section we have compared penalization cost under different penalty schemes for a given criminality target c(cmin,cmax). Knowing the optimal penalty scheme for each criminality level, the regulator may determine the optimal level of crime after inserting the minimized value of punishment cost for each level of crime in the objective function [1]. We denote the optimal level of crime by cI in order to indicate that the optimal increasing penalty scheme is used to induce the criminality target. Only in the cases cI{cmin,cmax} the penalty scheme is of the uniform type as a border case.

Economic intuition might suggest that cIcU, with cU denoting the level of crime that would be optimal under the constraint of uniform penalties. After all the costs of reducing crime go down if society switches from the latter to the former penalisation scheme. However, Proposition 6 shows that this plausible presumption does not always hold.

Proposition 6: (Penalty type specific) socially optimal level of crimes

The socially optimal level of crime, cI, may be strictly higher than the criminality target, cU, which would be optimal under the restriction of uniform penalties.

Proof

See Example 2 below.

The statement of Proposition 6 is rather counterintuitive. To understand this counterintuition consider a marginal increase of the advised criminality level starting from the criminality level cmin, which is induced if the maximal sanction, smax, is applied irrespective of the number of previous convictions. We compare the welfare effects in the following two scenarios: in scenario 1 the criminality level is induced by the corresponding uniform penalty and in scenario 2 the criminality level is induced by the optimal escalating penalty scheme. In both scenarios the welfare effect is composed of two components: the increase of social harm from crimes and the decrease of enforcement cost. Whereas the increase of social harm coincides in both scenarios the decrease of enforcement cost is larger in the escalating penalty scenario than in the uniform scenario. Therefore it is possible to construct an example, as is done below, where in scenario 1 marginal welfare is strictly decreasing in the crime level, and thus cmin is the equilibrium criminality level, whereas in scenario 2 marginal welfare is positive in a non-empty interval (cmin,c˜) and thus the optimal level of criminality exceeds cmin.

Example 2

We again consider example 1. In the relevant range c[cmin,cmax]=[5,10] the marginal harm function is assumed to be given by H(c)=g(c)+εc with g(c)=2c10 and ε>0. Moreover, as in Section 4, we assume that P equals the identity function.

First note that H,H>0c(cmin,cmax), i. e., the marginal utility of crime prevention is positive and decreasing. Moreover note that g(c)=dPU/dc holds. [37] Thus, for the uniform penalty scheme we obtain dKUdc=H+dPUdc=ε>0 from which follows the optimal criminality level cU=cmin=5, with corresponding penalty scheme s˜=(smax,smax,smax,). Hence, to prove Proposition 6 we have to show that there exists an ε>0 such that within the class of escalating penalty schemes the penalty scheme of border type s˜=(smax,smax,smax,) is not optimal. Therefore it suffices to demonstrate that there exists an ε>0 such that the penalty scheme s=(0,smax,smax,) (with corresponding higher criminality level) leads to lower total cost than s˜. Since under penalty scheme s the corresponding criminality levels cn, n1 coincide with those under s˜ and only differ for potential offenders without criminal record, the aggregate criminality level under s can be determined by

c(s)=cminc0(s˜)+c0(s)=cmin+(F(psmax)F(0))t=0qtB(0|t)=5+0.5t=0qtB(0|t)=5.90909.

Thus a change from s˜ to s increases harm from crimes by

ΔH=cminc(s)H(c)dc=55.90909(2c10+εc)dc=0.82646+4.95868ε

On the other hand penalization costs under s are lower than under s˜.

Since the penalty and the criminality level coincide under both penalty schemes for potential offenders with at least one criminal record the change of penalization costs can be determined by

ΔP=ps0c0(s)p10c0(s˜)=050.5t=00.9tB(0|t)=4.54545

Hence the change of total cost is given by ΔK=ΔH+ΔP=4.95868ε3.71901 which implies that for ε<0.75 we have cI>cU.

To clarify the driving forces in our example in a first step we assume ε=0. In this case we obtain dKU/dc=0. Hence in case of uniform penalization each enforceable level of criminality and thus also cmin is optimal. I.e., if c is increased, the increase of harm is just outweighed by the corresponding decrease of penalization cost H(c)=g(c)=dPU/dc. However under the escalating penalty regime s, penalization cost is lower, from which follows

KI(c(s))=H(c(s)+PI(c(s))<H(c(s)+PU(c(s))=KU(c(s))=KU(cmin)=KI(cmin)

i. e., under escalating penalties cmin can’t be the optimal criminality level.

In the second step, we depart from the assumption ε=0 and discuss how the additional term εc>0 in the marginal harm function affects equilibrium criminality levels. Under uniform penalties ε>0 assures that the social cost function is strictly increasing in c, and cmin is the unique equilibrium. On the other hand if ε is small enough, the advantage of the escalating penalty regime with respect to marginal penalization cost is not overcompensated.

6 Infinite Versus Finite Horizon – The Consequences

In the following we show that if we change the infinite game framework presented above to a two-period and thus finite one, the optimal penalty scheme can be of the escalating as well as of the decreasing type. The result that a decreasing penalty scheme is optimal at least for certain parameter values has been received in many of the finite models presented in the literature. Other papers do not consider decreasing penalty schemes at all and only compare welfare under uniform and escalating penalty schemes. [38]

We restrict our attention to the cost minimizing implementation of a given criminality target. Hence we (only) proceed along the structure of Sections 24 of our main model. Obviously, what will be said for any given target also holds for the socially optimal one.

6.1 The Finite Model – Assumptions

We now assume that each individual lives exactly two periods t{0,1} and is able to commit at most one crime in each period. Since generations are overlapping, the population size of each period is normalized to two. A sanction scheme is given by s=(s0,s1) with 0sismax, where i{0,1} denotes the number of previous convictions. For a given criminality target cˉ the optimization task of the regulator may be written as

(4)minsP(c(s),s):=pc0(s)s0+c1(s)s1s.t.c=c0(s)+c1(s)cˉ.

6.2 Third Stage: Equilibrium Number of Crimes – Individual Choice and Aggregation

6.2.1 The Choice of a Potential Offender

Since in the finite game model there are only two criminal record levels, the sanction scheme is either of the decreasing, the uniform or the escalating type. Under a uniform or escalating penalty scheme the decision rule of a potential defender in the finite game framework is the same as in the infinite scenario: a potential offender commits the crime as long as his benefit exceeds the expected penalty. In case of decreasing penalties (i. e., s1<s0) the choice of a potential offender in period 1 fundamentally differs from the general structure of the infinite game framework. In period 1 the decision depends not only on the benefit but also on whether the offender has been convicted in period 0. A potential offender with (without) criminal record commits the crime if b>ps1(b>ps0) holds. In period 0 the decision is essentially the same as in the infinite game: a potential offender commits the crime if and only if b>p(s0(bps1)) holds, which is equivalent to b>ps with s:=(s0+ps1)/(1+p).

Summarizing, the optimal strategy under a decreasing penalty scheme is given by:

  1. bps: No offences at all.

  2. ps<bps0: Offence in period 0. Offence in period 1, if and only if the offender has been convicted in period 0.

  3. b>ps0: Offence in each period.

6.2.2 Aggregation

In Proposition 7, the following notation is used: cU,(D,I) denotes the aggregate equilibrium criminality level for a given uniform (decreasing, increasing) penalty scheme and cnU,(D,I) denotes the criminality level of potential offenders with n previous records, n{0,1}.

Proposition 7: Equilibrium criminality levels

  1. Under a uniform penalty scheme the aggregate and criminal record specific equilibrium criminality levels are given bycU=21F(psˉ)andc0U=c1U=1F(psˉ).

  2. Under a decreasing penalty scheme the aggregate and criminal record specific equilibrium criminality levels are given bycD=(1+p)(1F(ps))+(1p)(1F(ps0)), c0D=(1F(ps))+(1p)(1F(ps0))andc1D=p(1F(ps))withs=s0+ps1/(1+p).

  3. Under an increasing penalty scheme the aggregate and criminal record specific equilibrium criminality levels are given bycI=(2p)(1F(ps0))+p(1F(ps1)), c0I=(2p)(1F(ps0))andc1I=p(1F(ps1)).

Proofs

See Appendix.

6.3 Second Stage: Lawmaker’s Choice – The Optimal Time Profile of Sanctions

Having determined the equilibrium criminality levels, we now consider the optimization task eq. [4] of the regulator.

We proceed as follows. At first, the cost of punishment is derived under the assumption of uniform penalties. This serves as a benchmark against which the cost of punishment under decreasing and increasing penalties are measured. It is shown that the cost of punishment may be reduced by choosing an appropriate escalating penalty scheme instead of a uniform one but also by choosing a decreasing penalty scheme instead of the uniform one.

We consider an example to demonstrate that it depends on the parameter values whether a decreasing or increasing penalty scheme is optimal for a given criminality target.

6.3.1 Uniform Penalties

Lemma 1: Uniform penalty scheme

  1. The uniform penalty that exactly enforces the criminality levelcˉequalssˉ=F11cˉ/2/p.

  2. The corresponding cost of punishment isPU=psˉ21F(psˉ)=cˉF11cˉ/2.

Proof

See Appendix.

As for the infinite game the derivative dPU/dsˉ=2p1F(psˉ)sˉpf(psˉ) becomes negative if and only if psˉh(psˉ)>1 holds. Hence, to assure that a reduction of crime raises punishment cost, we again make Assumption 1. In the remaining part of Section 6 we assume that c¯(cmin,cmax) with cmin=2(1F(psmax)) and cmax=2 holds.

6.3.2 Escalating and Decreasing Penalties

In the following we show that (as in the infinite game framework) starting from s0=s1=sˉ penalization cost may be reduced by increasing s1 (and correspondingly decreasing s0). That means penalization cost may be reduced by choosing an escalating penalty scheme.

However, we also show that (in contrast to the infinite game framework) starting from s0=s1=sˉ penalization cost may be reduced by increasing s0 (and correspondingly decreasing s1). This means that penalization cost may also be reduced by choosing a decreasing penalty scheme. Proposition 8 discusses the structure of the overall optimal penalty scheme.

Proposition 8

  1. The uniform penalty scheme leads to the highest penalization cost.

  2. From the set of escalating penalty schemes the cost minimizing one is of the border type withs0=0ors1=smax.

  3. From the set of decreasing penalty schemes the cost minimizing one is of the border type withs1=0ors0=smax.

  4. The penalization cost under the optimal decreasing penalty scheme may be higher than, lower than or equal to the penalization cost under the optimal escalating penalty scheme.

Proof

See Appendix.

The fundamental divergence between the finite and infinite game framework is given by Proposition 8.c. Starting from a uniform penalty scheme penalization cost can not only be reduced by shifting the penalty from first time to repeat offenders but also by shifting the penalty from repeat to first time offenders in such a way that the crime level remains constant.

The driving forces behind this result are illustrated by the following example, which demonstrates that in the two period framework the (overall) optimal penalty scheme may be of the decreasing as well as of the escalating penalty type.

Example 3

Consider the uniform distribution function F:[0,a][0,1], F(b)=b/a and assume smax=a2p.

According to Proposition 7 the equilibrium criminality levels are given by

  1. cU=21psˉ/a and c0U=c1U=1psˉ/a under the uniform penalty scheme (s0,s1)=(sˉ,sˉ).

  2. cD=(1+p)(1ps/a)+(1p)(1ps0/a),c0D=(1ps/a)+(1p)(1ps0/a) and c1D=p(1ps/a) with s=s0+ps1/(1+p) under a decreasing penalty scheme and

  3. cI=(2p)(1ps0/a)+p(1ps1/a), c0I=(2p)(1ps0/a) and c1I=p(1ps1/a) under an increasing penalty scheme.

According to Lemma 1, the uniform penalty that exactly enforces the criminality level cˉ is given by sˉ=a1cˉ/2/p. The corresponding cost of punishment is PU=psˉ21psˉ/a.

For a=10 and alternative values of p (column 2) and cˉ (column 3) Table 1 displays the maximal sanction (column 4), the cost minimizing decreasing (column 5) and escalating (column 6) penalty scheme, as well as the uniform penalty scheme (column 7). Columns 8 to 10 display the corresponding penalization costs and columns 11 and 12 show the record specific criminality levels under the optimal decreasing and escalating penalty scheme.

Table 1:

(Type) specific optimal sanction schemes.

(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
pcˉsmaxsDsIsˉPIPDPUcIcD
(I)0.51.251010,06.6ˉ,107.54.5834.5834.688(1,0.25)0.92,0.33
(II)0.51.5106.6ˉ,03.3ˉ,1053.3333.7043.75(1.25,0.25)(1.11,0.39)
(III)0.51.1252020,017.143,2017.54.9114.8754.922(1,0.125)(0.98,0.15)

The example demonstrates that for each parameter constellation the penalization costs take the highest value under the uniform penalty scheme. Moreover the number of first time offenders is lower than under the other types of penalty schemes. [39] The reason is as follows: Under an escalating as well as under a decreasing penalty scheme only a subgroup of the offenders that commit a crime in t=0 do also commit a crime in t=1. [40] Whereas under an increasing penalty scheme the critical benefit level of period 1 increases in case of conviction in period 0, under a decreasing penalty scheme the critical benefit level of period 1 increases in case of non-conviction in period 0. Whether an increasing or decreasing penalty scheme leads to lower penalization cost depends on the relative strength of the following two countervailing effects. i) From s0I<sˉ<s0D follows that under an escalating penalty scheme penalization cost for a crime committed by a member of the larger group of offenders without criminal history may be reduced. This effect works in favor of the escalating penalty scheme being optimal. ii) Since not every offender of t=0 reaches the “criminal record“-status for the next period we have s0D+s1D<s0I+s1I. That is, aggregate sanctions for offenders that commit a crime in both periods are lower under the decreasing penalty scheme. This effect works in favor of decreasing sanctions being optimal. As Table 1 shows, it depends on the parameter values which effect is stronger. For parameter constellation (I) the two effects just offset each other. Whereas in case (II) the optimal penalty scheme is of the escalating type, in case (III) the decreasing penalty scheme leads to lowest penalization cost.

7 Summary and Outlook

We have analysed positive and normative issues of criminal law in an intertemporal framework. In contrast to the earlier literature our analysis compared an infinite game setting with a finite one. Criminals have been assumed to maximize their net benefits from crime and the lawmaker to minimize social cost. The decision variables have been the number of crimes committed, and the time profile of punishments.

For the infinite game framework it has been shown that for any given criminality level penalization costs under decreasing and uniform penalty schemes are identical. However penalization cost may be reduced by choosing an escalating penalty scheme. We have shown that the optimal penalty scheme shifts the punishment as far into the future as possible. We further have shown that the optimal criminality level under escalating penalties may exceed the optimal criminality level under uniform penalties. Moreover we have shown that in the corresponding finite game framework the optimal penalty scheme may be of the escalating as well as of the decreasing type.

Our basic infinite game approach may be generalized in several directions. One is to endogenize the apprehension probability. Following Polinsky and Shavell (1998) this can be done by adding an additional term A(p) (with A>0) which captures the costs of apprehending offenders. If we additionally assume that neither the choice of p=1 nor the choice of p=0 is socially optimal, [41] it can easily be proven by contradiction that also in this framework the optimal escalating penalty scheme still dominates the uniform and decreasing ones. Once the optimal level of p is determined, finding the optimal penalty levels boils down to the problem that is solved in Sections 4 and 5 of this paper.

Examples for future extensions allow for age-dependent benefits, risk averse individuals or more general distribution functions of offenders’ benefits. Moreover, all kinds of “criminal hysteresis” (Fajnzylber, Lederman, and Loayza 2002, 1328) are particularly suggestive to be integrated into our intertemporal model. For example the criminal history of a certain offender might affect his cost of carrying out a particular criminal activity, the probability of apprehension and other determinants of the decision for crime.

Acknowledgement

We gratefully acknowledge the comments of two anonymous referees who considerably helped to improve the paper.

Appendix

Proof of Proposition 1

  1. bn+1=max{bˉm,0mn+1}max{bˉm,0mn}=bn.

  2. E.g., for any given decreasing penalty scheme {sn} there is a corresponding uniform penalty scheme given by sn=sˉ with sˉ=s=(1q)t=0qtk=0tBk|tsk. (q.e.d.)

Proof of Proposition 2

The criminal record specific criminality levels cn of potential offenders with n previous records can be derived as follows: We consider an offender who is born in t0, commits a crime in each period t0,...,t0+t1 since his benefit exceeds bn and is still alive in period t0+t. The probability that this offender has n previous convictions at the beginning of period t0+t is given by B(n|t) (see Section 3.1). Thus the criminality level of potential offenders with n previous convictions is given by cn=(1F(bn))t=nqtBn|t. Finally note, that c=n=0cn holds. (q.e.d.)

Proof of Proposition 3

Let 1m denote the number of different elements in the sequence {bn}. [42] Further define the sequence of critical indices as follows: {jk}k=1m with j1:=maxn{bn=b0} and jk+1:=maxn{bn=bjk+1}. [43] Further in case of m< define bjm+1=. Potential offenders with benefit bbj1 do not commit a crime at all. Potential offenders with benefit bjk<bbjk+1 commit crimes until they have jk+1 previous records. The penalization costs are unambigously determined by the critical benefits as follows: A potential offender with benefit bj1 is indifferent between “never committing the crime” and incurring the cost of “committing the crime until he is caught j1+1 times”. From this indifference condition, we can deduce the penalization cost of punishing any criminal for the first j1+1 times as =0j1t=qtpB|tbj1. Analogously a potential offender with benefit bjk+1 will commit the crime until he is caught jk+1 times. Once he gets to the criminal record level of jk+1 he is indifferent between “stopping committing crimes from now on” and “continuing committing crimes” until he is caught jk+1jk more times and gets to the record level jk+1+1. From this second indifference condition follows that the penalization costs of punishing a criminal for the jk+1jk more times must equal =jk+1jk+1t=qtpB|tbjk+1. In particular, they are identical for any penalty scheme that induces the same sequence of critical benefits. This proves the assertion. (q.e.d.)

Proof of Proposition 5

  1. We assume that in the initial situation we have an escalating penalty scheme s with c(s)=cˉ and n0=min{n|sn>0}. We show that the corresponding penalization cost may be decreased by marginal changes of the sanction levels if s is not of the type determined by Proposition 5.a. If s is not of this type it belongs to one of the following classes of escalating penalty schemes:

    Typea:n1>n0:sn1<sn1+1.
    Typeb:sn=s˜<smaxn>n0.
    1. We show for penalty schemes of type a that penalization cost may be reduced by a reduction of sn0 by ε>0 and a corresponding increase of sn1 in such a way that the crime level remains constant: cn0(sn0ε)+cn1(sn1+δ)=cn0(sn0)+cn1(sn1).

      The term ε must be small enough so that s0ε0 and s1+δsmax holds.

      Total differentiation of the equality above reveals that dδdε= cn0(sn0-ε)cn1(sn1+δ).

      Now we consider the change in the objective function.

      Differentiation of the penalization costs with respect to ε and δ yields

      dP=p(-cn0(sn0-ε)-(sn0-ε)cn0(sn0-ε))dε+p(cn1(sn1+δ)+(sn1+δ)cn1(sn1+δ))dδ.

      Using the value of dδ/dε this equals

      p(-cn0(sn0-ε)-(sn0-ε)cn0(sn0-ε)+cn1(sn1+δ)cn0(sn0-ε)cn1(sn1+δ)+(sn1+δ)cn0(sn0-ε))dε
      =-pcn0(sn0-ε)(cn0(sn0-ε)cn0(sn0-ε)+(sn0-ε)-cn1(sn1+δ)cn1(sn1+δ)-(sn1+δ))dε.

      According to Proposition 2 we have cn=(1F(psn))t=nqtBn|t and cn=-pf(psn)t=nqtB(n|t).

      Hence we obtain

      dP=pcn0(sn0ε)1p1F(p(sn0ε))f(p(sn0ε))+1p1F(p(sn1+δ))f(p(sn1+δ))+(sn0ε)(sn1+δ)dε
      =-cn0(sn0-ε)>0(1h(p(sn1+δ))-1h(p(sn0-ε))<0+p((sn0-ε)-(sn1+δ))<0)dε<0,

      since the hazard rate has been assumed to be increasing.

    2. The general structure of type b penalty schemes is given by s=(0,...,0,sn0,s˜,s˜,...) with 0<sn0s˜. We show that penalization cost may be reduced by a reduction of sn0 by ε>0 and a corresponding increase of sn=s˜n>n0 in such a way that the crime level remains constant:

      cn0(sn0ε)+n=n0+1cn(s˜+δ)=cn0(sn0)+n=n0+1cn(s˜).

      Total differentiation reveals that dδdε=cn0(sn0ε)n=n0+1cn(s˜+δ).

      Differentiation of the penalization costs P=p(sn0-ε)cn0(sn0-ε)+p(s˜+δ)Σn=n0+1cn(s˜+δ) with respect to ε and δ yields

      =pcn0(sn0ε)(1F(p(sn0ε))pf(p(sn0ε))+(sn0ε)n=n0+1[[1F(p(s˜+δ))]t=nqtB(n|t)]n=n0+1[pf(p(s˜+δ))t=nqtB(n|t)](s˜+δ))dε
      =cn0(sn0ε)([1F(p(s˜+δ))]n=n0+1[t=nqtB(n|t)]f(p(s˜+δ))n=n0+1[t=nqtB(n|t)]1F(p(sn0-ε))f(p(sn0-ε))+p((sn0ε)(s˜+δ)))dε
      =-cn0(sn0-ε)(1-F(p(s˜+δ))f(p(s˜+δ))-1-F(p(sn0-ε))f(p(sn0-ε))+p((sn0-ε)-(s˜+δ)))dε
      =-cn0(sn0-ε)>0(1h(p(s˜+δ))-1h(p(sn0-ε))<0+p((sn0-ε)-(s˜+δ))<0)dε<0

      Hence a marginal decrease of sn0 and a corresponding increase of snn>n0 decreases penalization cost.

  2. The sequence of critical benefits of the penalty scheme sI is given by {bn} with bn=0n<n0, bn0=ps0 and bn=smaxn>n0. Since this sequence contains only uniform regions of the border type bn=0 or bn=smax, these regions cannot be induced by decreasing regions of the penalty scheme.

  3. Follows directly from Proposition 5.b. (q.e.d.)

Proof of Proposition 7

We prove exemplarily Proposition 7.b. The number of offenders without criminal record (c0D) is composed of (1F(ps)) young offenders and (1p)(1F(ps0)) old offenders, which have not been convicted in period 0. The number of offenders with criminal record (c1D) follows from the fact that each potential offender with benefit b>ps commits a crime in period 0, and also in period 1, provided that he received the status of a repeat offender. Finally cD follows from cD=c0D+c1D. (q.e.d.)

Proof of Lemma 1

Solving cU=cˉ for sˉ and inserting the results in PU=pcˉsˉ proves the assertions. (q.e.d.)

Proof of Proposition 8

  1. Follows from the proof of Proposition 8. b and c.

  2. We assume that in the initial situation we have a uniform or increasing penalty scheme s=(s0,s1) with c(s)=cˉ and with 0<s0sˉs1<smax. We show that the corresponding penalization cost may be reduced by a reduction of s0 by ε>0 and a corresponding increase of s1 in such a way that the crime level remains constant. c0(s0ε)+c1(s1+δ)=c0(s0)+c1(s1). The term ε must be small enough so that s0ε0 and s1+δsmax holds.

Total differentiation of the equality above reveals that dδdε=c0(s0ε)c1(s1+δ).

Now we consider the change of penalization cost.

Differentiation of the penalization cost with respect to ε and δ yields

dP=pc0(s0ε)(s0ε)c0(s0ε)dε+pc1(s1+δ)+(s1+δ)c1(s1+δ)dδ.

Using the value of dδ/dε this equals

pc0(s0ε)(s0ε)c0(s0ε)+c1(s1+δ)c0(s0ε)c1(s1+δ)+(s1+δ)c0(s0ε)dε
=pc0(s0ε)c0(s0ε)c0(s0ε)+(s0ε)c1(s1+δ)c1(s1+δ)(s1+δ)dε.

According to Proposition 7.c we have c0=(2p)(1F(p(s0ε))), c1=p(1F(p(s1+δ))) and c0=(2p)pf(p(s0ε)), c1=p2f(p(s1+δ)).

Hence we obtain

dP=pc0(s0ε)1F(p(s0ε))pf(p(s0ε))+1F(p(s1+δ))pf(p(s1+δ))+(s0ε)(s1+δ)dε
=c0(s0ε)>01h(p(s1+δ))1h(p(s0ε))<0+p(s0ε)(s1+δ)<0dε<0

since the hazard rate has been assumed to be increasing.

  1. Now we assume that in the initial situation we have a uniform or decreasing penalty scheme s=(s0,s1) with c(s)=c and with 0<s1ss0<smax.

We show that the corresponding penalization cost may be reduced by a marginal increase of s0 by ε>0 and a corresponding reduction of s1 in such a way that the crime level remains constant. c0(s0+ε)+c1(s1δ)=c0(s0)+c1(s1). The term ε must be small enough so that s0+εsmax and s1δ0 holds. Total differentiation of the equality above reveals that dδdε=c0(s0+ε)c1(s1δ).

Now we consider the change of penalization cost.

Differentiation of the penalization cost with respect to ε and δ yields

dP=pc0(s0+ε)+(s0+ε)c0(s0+ε)dε+pc1(s1δ)(s1δ)c1(s1δ)dδ.

Using the value of dδ/dε this equals

pc0(s0+ε)+(s0+ε)c0(s0+ε)c1(s1δ)c0(s0+ε)c1(s1δ)(s1δ)c0(s0+ε)dε
=pc0(s0+ε)c0(s0+ε)c0(s0+ε)+(s0+ε)c1(s1δ)c1(s1δ)(s1δ)dε

According to Proposition 7.c we obtain

dP=pc0(s0+ε)(1F(p(s1δ)))pf(p(s1δ))(1F(p(s0+ε)))pf(p(s0+ε))+(s0+ε)(s1δ)dε=c0(s0+ε)<01h(p(s1δ))1h(p(s0+ε))>0+p((s0+ε)(s1δ))>0dε<0.

(d) See Example 3. (q.e.d).

References

Abrams, D. 2012. “Estimating the Deterrent Effect of Incarceration Using Sentencing Enhancements.” American Economic Journal: Applied Economics 4:32–56.10.1257/app.4.4.32Search in Google Scholar

Baker, M. J., and N. J. Westelius. 2013. “Crime, Expectations, and the Deterrence Hypothesis.” In Research Handbook on Economic Models of Law, edited by T. J. Miceli, and M. J. Baker, 235–80. Edward Elgar: Cheltenham.10.4337/9781781000151.00017Search in Google Scholar

Barbarino, A., and G. Mastrobuoni. 2014. “The Incapacitation Effect of Incarceration: Evidence From Several Italian Collective Pardons.” American Economic Journal: Economic Policy 6:1–37.10.1257/pol.6.1.1Search in Google Scholar

Becker, G. 1968. “Crime and Punishment: An Economic Approach.” Journal of Political Economy 76:169–217.10.1086/259394Search in Google Scholar

Binmore, K. 2007. Playing for Real – a Text on Game Theory. Oxford and New York: Oxford University Press.10.1093/acprof:oso/9780195300574.001.0001Search in Google Scholar

Burnovski, M., and Z. Safra. 1994. “Deterrence Effects of Sequential Punishment Policies: Should Repeat Offenders Be More Severely Punished?” International Review of Law and Economics 14:341–50.10.1016/0144-8188(94)90048-5Search in Google Scholar

Chu, C. Y. C., S. -C. Hu, and T. -Y. Huang. 2000. “Punishing Repeat Offenders More Severly.” International Review of Law and Economics 20:127–40.10.1016/S0144-8188(00)00024-7Search in Google Scholar

Cooter, R., and T. Ulen.2012. Law and Economics, 6th international ed. Boston: Pearson Education International.Search in Google Scholar

Dana, D. A.. 2001. “Rethinking the Puzzle of Escalating Penalties for Repeat Offenders.” The Yale Law Journal 110:733–83.10.2307/797607Search in Google Scholar

Dominguez Alvarez, R., and M. L. Loureiro. 2012. “Stigma, Ex-Convicts and Labour Markets.” German Economic Review 13:470–86.10.1111/j.1468-0475.2012.00580.xSearch in Google Scholar

Ehrlich, I. 1973. “Participation in Illegitimate Activities: A Theoretical and Empirical Investigation.” Journal of Political Economy 81:521–65.10.1086/260058Search in Google Scholar

Ehrlich, I. 1975. “The Deterrent Effect of Capital Punishment: A Question of Life and Death.” American Economic Review 65:397–417.10.3386/w0018Search in Google Scholar

Emons, W. 2003. “A Note on the Optimal Punishment for Repeat Offenders.” International Review of Law and Economics 23:253–9.10.1016/j.irle.2003.09.002Search in Google Scholar

Emons, W. 2007. “Escalating Penalties for Repeat Offenders.” International Review of Law and Economics 27:170–8.10.1016/j.irle.2007.06.005Search in Google Scholar

Endres, A. 2011. Environmental Economics – Theory and Policy. Cambridge and New York: Cambridge University Press.Search in Google Scholar

Endres, A., and B. Rundshagen. 2012. “Escalating Penalties – a Supergame Approach.” Economics of Governance 13:29–49.10.1007/s10101-011-0101-7Search in Google Scholar

Fajnzylber, P., D. Lederman, and N. Loayza. 2002. “What Causes Violent Crime?” European Economic Review 46:1323–57.10.1016/S0014-2921(01)00096-4Search in Google Scholar

Friehe, T. 2009. “Escalating Penalties for Repeat Offenders: A Note on the Role of Information.” Journal of Economics 97:165–83.10.1007/s00712-009-0058-ySearch in Google Scholar

Hylton, K. N. 2005. “The Theory of Penalties and the Economics of Criminal Law.” Review of Law and Economics 1:175–201.10.2202/1555-5879.1024Search in Google Scholar

Kaplow, L. 1992. “The Optimal Probability and Magnitude of Fines for Acts That Definitely Are Undesirable.” International Review of Law and Economics 12:3–11.10.3386/w3008Search in Google Scholar

Kleiman, M. A. R. 2009. When Brute Force Fails – How to Have Less Crime and Less Punishment. Princeton and Oxford: Princeton University Press.10.1515/9781400831265Search in Google Scholar

Luce, R. D., and H. Raiffa. 1957. Games and Decisions. New York: John Wiley.Search in Google Scholar

Mailath, G. J., and L. Samuelson. 2006. Repeated Games and Reputations – Long-Run Relationships. Oxford and New York: Oxford University Press.10.1093/acprof:oso/9780195300796.001.0001Search in Google Scholar

Miceli, T. J. 2013. “Escalating Penalties for Repeat Offenders: Why Are They so Hard to Explain?” Journal of Institutional and Theoretical Economics 169:587–604.10.1628/093245613X671193Search in Google Scholar

Miceli, T. J., and C. Bucci. 2005. “A Simple Theory of Increasing Penalties for Repeat Offenders.” Review of Law and Economics 1:71–80.10.2202/1555-5879.1011Search in Google Scholar

Motchenkova. 2014. “ Cost Minimizing Sequential Punishment Policies for Repeat Offenders.” Applied Economics Letters 21:360–5.10.1080/13504851.2013.861579Search in Google Scholar

Mungan, M. C. 2010. “Repeat Offenders: If They Learn, We Punish Them More Severely.” International Review of Law and Economics 30:173–7.10.1016/j.irle.2009.11.002Search in Google Scholar

Mungan, M. C. 2014. “A Behavioural Justification for Escalating Punishment Schemes.” International Review of Law and Economics 37:189–97.10.1016/j.irle.2013.10.002Search in Google Scholar

Persson, M., and C.-H. Siven. 2006. “Incentive and Incarceration Effects in a General Equilibrium Model of Crime.” Journal of Economic Behavior & Organization 59:214–29.10.1016/j.jebo.2004.03.011Search in Google Scholar

Polinsky, A. M., and D. L. Rubinfeld. 1991. “A Model of Optimal Fines for Repeat Offenders.” Journal of Public Economics 46:291–306.10.3386/w3739Search in Google Scholar

Polinsky, A. M., and S. Shavell. 1998. “On Offence History and the Theory of Deterrence.” International Review of Law and Economics 18:305–24.10.1016/S0144-8188(98)00009-XSearch in Google Scholar

Polinsky, A. M., and S. Shavell. 2000. “The Economic Theory of Public Enforcement of Law.” Journal of Economic Literature 38:45–76.10.3386/w6993Search in Google Scholar

Polinsky, A. M., and S. Shavell. 2007. “The Theory of Public Enforcement of Law.” In Handbook of Law and Economics, Volume 1, edited by A. M. Polinsky, and S. Shavell, 403–54. Elsevier: Amsterdam.10.1016/S1574-0730(07)01006-7Search in Google Scholar

Rubinstein, A. 1979. “An Optimal Conviction Policy for Offenses That May Have Been Committed by Accident.” In Applied Game Theory, edited by S. Brams, A. Schotter, and G. Schwödiauer, 406–13. Würzburg: Physica-Verlag.10.1007/978-3-662-41501-6_26Search in Google Scholar

Rubinstein, A. 1980. “On an Anomaly of the Deterrent Effect of Punishment.” Economics Letters 6:89–94.10.1016/0165-1765(80)90062-2Search in Google Scholar

Rubinstein, A. 1998. Modeling Bounded Rationality. Cambridge, MA and London: The MIT Press.10.7551/mitpress/4702.001.0001Search in Google Scholar

Selten, R. 1978. “The Chain Store Paradox.” Theory and Decision 9:127–59.10.1007/978-94-015-7774-8_2Search in Google Scholar

Shavell, S. 1992. “A Note on Marginal Deterrence.” International Review of Law and Economics 12:345–55.10.1016/0144-8188(92)90013-HSearch in Google Scholar

Shavell, S. 2004. Foundations of Economic Analysis of Law. Cambridge, MA and London: the Belknap Press of Harvard University Press.10.4159/9780674043497Search in Google Scholar

Wilde, L. L. 1992. “Criminal Choice, Nonmonetary Sanctions, and Marginal Deterrence: A Normative Analysis.” International Review of Law and Economics 12:333–44.10.1016/0144-8188(92)90012-GSearch in Google Scholar

Wilhite, A., and W. D. Allen. 2008. “Crime, Protection, and Incarceration.” Journal of Economic Behavior & Organization 67:481–94.10.1016/j.jebo.2006.10.013Search in Google Scholar

Published Online: 2016-1-12
Published in Print: 2016-6-1

©2016 by De Gruyter

Downloaded on 22.3.2023 from https://www.degruyter.com/document/doi/10.1515/bejte-2014-0098/html
Scroll Up Arrow