An introduction to the bell curve by Dr. Venstad

JMPicard · May 6, 2018, 8:46pm

The bell curve – an example

Today’s lecture is a minimal introduction to the normal distribution – often called the “bell curve” – and will be based around a simple example. Prerequisites are only the ability to add integers, and an open mind.

The lecture is given by Dr. Venstad, who has previously also taught various advanced applied mathematics courses (computational physics, algorithms and datastructures, etc.) at the Norwegian University of Science and Technology.

The normal distribution – what and why?

The normal distribution has its name because it is just that – a very normal distribution. It is typically the result of adding many indenpendently random variables, which is a surprisingly common both in physics and mathematics.

As an example, consider tossing a coin an even number of times: every time it lands heads up, you score one point, and every time it lands tails up, you lose one point. Your final score, after 2N tosses, is then a prime example of something that follows the normal distribution; the most likely score is … take a guess! … 0! Even so, with many throws, the chances of scoring exaclty 0, are minute. Scoring 1 is impossible, but scoring 2 is almost as likely as scoring 0. Scoring 4 is slightly less likely again, and so on, with increasing skew. The least likely result is a score of plus or minus 2N. It is, perhaps, not obvious why different scores have different likelihoods, but this is so simply because scoring, say, 2N, requires you to toss heads every time, while scoring 0 can be done in any number of ways, as long as the number of heads and tails are equal. Thus there are many more ways of scoring 0, each as likely as the other, or as likely as scoring 2N, and the total chance of scoring 0 is the sum of the chances for each of these. Shown below is the probability for scoring each of the possible results when performing 32 tosses.

Because of its shape, the graph of the normal distribution is often called “the bell curve”.

A simple example – taking an accurate shot.

Consider someone aiming to take a shot. They are aiming at a specific point; alas, various disturbances contribute random inaccuracy to the shot: the miniscule movement of each of hundreds of muscles in the body of the shooter, gusts of wind blowing in different directions at different points along the trajectory of the projectile, etc… The sum is inaccuracy in the horizontal, as well as the vertical, direction. One of the pleasant properties of the normal distribution is that smearing the initial point aimed at first in the x-direction, according to a normal distribution, and then in the y-direction, according to the same normal distribution, results in a two-dimensional distribution which is perfectly rotationally symmetric, like shown below. (A “Gaussian distribution” is yet another name.)

To make it feasible to do any computation by hand, we need to discretise this distribution. Let’s go with a grid as shown below. The point the shooter is aiming at is at the centre of the three green hexes.

WT34G

Now let’s assign a range of numbers to each cell, such that we can simulate a shot simply by drawing a random number and checking which cell has that number contained in its range. The size of the range in each cell is then proportial to the probability of hitting that cell. Below is the same grid with ranges assigned such that the probability of hitting each cell matches the normal distribution shown two images earlier, more or less.

WT34G

Although this grid is certainly crude, compared to the blurry one shown above, it still has roughly the same density distribution. Note that there is nothing outside the red cells; this is the same as saying no shot will ever land outside of those – a truncation of the normal distribution.

Time for excercises!

The answer to each exercise is shown on the line below, but is blurred; click to sharpen it.
Exercise 1a: What is the probability of hitting the upper green cell?

It contains 10 out of 123 numbers, so 10/123.

Exercise 1b: What is the probability of hitting any green cell?

The union of the ranges in the green cells has size 30, so 30/123.

Good. Let’s call the green shots highly accurate, as they are, after all, closest to the centre of aim. Now let’s examine the probability of taking a highly _in_accurate shot, i.e., hitting a red cell:

Exercise 2a: What is the probability of hitting the upper red cell?

It contains 2 out of 123 numbers, so 2/123.

Exercise 2b: What is the probability of hitting any red cell?

The union of the ranges in the red cells has size 30, so 30/123 again.

What, highly accurate and highly inaccurate shots are equally likely? Cool, that’s exactly what we want! Also, it matches what is written in a certain, fabled, .pdf of knowledge, written by what is probably one of the elder gods.

So, highly accurate and highly inaccurate shots are equally likely, but this tome of ancient lore mentioned
that mediocre shots are actually the most likely, because of some “bell curve”. We have a bell curve now,
haven’t we, so let’s see if we can confirm this last piece of information, passed down by the gods, as well:

Exercise 3a: What is the probability of hitting the upper yellow cell?

It contains 7 out of 123 numbers, so 7/123.

Exercise 3b: What is the probability of hitting any yellow cell?

The union of the ranges in the yellow cells has size 63, so 63/123.

Yes, hitting a yellow cell, i.e., a cell which lies in the middle between the centre and the outer boundary, is really the most likely of all outcomes. Fantastic!

Finally, it should be easy to convince ourselves that half the shots will land on the outside of the imagined yellow circle whose radius is half that of the truncation boundary, i.e., the outside of the red cells. We already know green and red is equally likely, and this half-radius circle cuts the yellow cells right at the middle, so it’s fair to say this criterion is satisfied as well.

But we started out with a bell curve which was centred between the green hexes … what is this magic!?

The magic is simply that there are many more yellow cells than green ones, and so it is much more likely to hit a yellow cell, even though each green cell is more likely to hit than each yellow. No more, no less.

In all fairness …

There is also a second bell curve here, but it’s not found directly in the images above; rather, if we make a plot with accuracy along the x-axis and probability of taking a shot of a given accuracy along the y-axis, this plot contains the three points (high, 30/123), (medium, 63/123), (low, 30/123). This is also acceptable as a bell curve.

Furthermore, this is the curve we would get if we plotted the probability of taking a given shot as a function of radius, while collapsing the angle portion of the polar coordinates we alternatively could have used to express our grid.

It should be noted that while both these curves – a slice along the x-axis or the one with the radius – can be made simultaneously normal in this example, this is not true in the continuous case; however, they can always be made simultaneously close to normal, and so the fear that has pervaded this forum lately, of a ring of extreme likelihoot precisely at a certain yellow circle, should be no more than phantoms: if this implementation has indeed been chosen, it is clearly in error, and will be remedied. I think it has not. (Some will still claim this, based on experience. However, human experience and statistics do not blend well, as discussed in, e.g., this fantastic book.)

Concluding remarks.

What we have seen today is that it’s perfectly possible for each of the following statements to be simultaneously true:

There is a bell curve with peak at the centre of aim.
There is a bell curve with peak at mediocre accuracy, i.e., at the half-radius of the cut-off.
Half our shots land inside the half-radius circle. (This circle may even be computed to fit this criterion?)
Highly accurate and highly inaccurate shots are equally likely.

Thanks for reading, and I hope you learned something!

Respectful questions will be answered.

WarrinGreyshield · May 6, 2018, 10:40pm

Give up. Those of us who understand probability do.

Those who insist on “perfect” probability plots in a very small number of samples, don’t.

Appreciate the effort at education, however unlikely to be successful.

{Edit - on reread, I realized the above appears really harsh, and directed at the poster, that was not my intent. The above post was actually potentially really helpful for people that don’t understand bell curves, and a nice visual for those who already have a good sense of them. I just don’t believe it will do anything to help resolve the 60+ comment thread about bell curves and optimal target placement, which appears to be equal parts people who understand probabilities, people who have fired enough firearms in reality to expect shot groups - not bullseyes, and people who want to use the outcomes of their last several shots to predict the outcome of their next shot. The above won’t help the latter, they’ve not internalized the concept of an independent event and the need for very, very large data sets before recorded events closely resemble predicted distributions.}

My apologies to JMPicard for any umbrage my post may have generated.

Potkeny · May 7, 2018, 2:54pm

My mind has built-in autocorrect, replaced the name with “Dr. Vahlen”.

Sorry

JMPicard · May 7, 2018, 7:50pm

No offense taken, my friend. And, sadly, you’re probably right

Aknazer · May 8, 2018, 4:01am

Here is the statement from Snapshot themselves in regards to their use of a Bell Curve for accuracy and can be found in the PDF that came with the build.

Red circle: All your shots will be randomly distributed in this circle (with a bell curve
distribution where extremes of inaccuracy and accuracy are less likely).

This means that they’re using a curve which prioritizes mediocre accuracy and thus it makes all guns inherently less accurate than one would initially think. This also means that there’s an invisible “band” for where shots are most likely to land but yet not displayed to the player (the yellow ring would cut this band in half assuming they’re using a symmetrical curve; especially since they state accurate/inaccurate shots are equally likely).

So you are correct in stating that shots of accuracy and inaccuracy are equally likely, but both of these are less likely than mediocre accuracy along the yellow ring. Yet you’re claiming it isn’t likely for them to be using the mediocre curve even though their own PDF and UV have stated as being the case. What we don’t know is how much this mediocre band is favored over the higher/lower accuracy shots and that is the concern of many. A 5% difference wouldn’t be that much, a 30%+ difference would be great. Regardless it changes up how a player would want to aim to optimize their shots and this info is hidden while the graphic is in such a manner than a standard player would reasonably assume that they would want to fill in as much of the yellow circle as possible instead of potentially adjusting the shot to catch more of the not-displayed mediocre band that the yellow ring likely cuts in half.

Your post is good info for those that don’t understand how a curve would affect the dispersion of shots but your hypothesis that PP doesn’t use the mediocre curve and that such a curve being used “is clearly in error” is ruined by Snapshot’s own statements.

JMPicard · May 8, 2018, 5:23am

Please take a look at this post by someone else at these forums. He put down a tremendous effort to show the numbers.

lompabo · May 8, 2018, 6:12am

Just my two cents: his argument about a Normal distribution being more physically likely is correct. I used a Beta in my post just because it is naturally bounded and easy to customize (hence a bit more game friendly).

Using a bi-variate Normal is also a viable choice: you can still do the sampling via marginalization. In fact, you can freely choose the kind of probability distribution that you want for a section.

Cheers!

Michele

Duskmare · May 8, 2018, 6:58am

While I can follow everything after “the normal distribution” explanation, it’s pretty pointless without understanding ‘the normal distribution’. So first off… What’s the Y axis on the graph? The one that goes 0-8000 and has no units of measurement. Without that your graph means nothing. You don’t even say what it could be in your description.

Additionally, @Aknazer is right. This doesn’t get around the issue of the developer stating that accurate and inaccurate shots were just as unlikely.

You then point over to @lompabo’s post, an excellently written guide to bell graphs. However that really doesn’t clear up the issue any better since his math is very complex and he uses a lot of jargon and algebra that most people, myself included, will get lost in. I made it about half way through before giving up, however I could see that it was a system designed to put the majority of shots within the centre, rather than around the middle of the circle. So while I said it’s something I would like to see in the game I don’t think it was describing what the developers have actually done. No disrespect to @lompabo but he was working from the premise that he wanted shots in the middle of the circle and how that would be possible with a bell curve. He’s not part of the dev team and his post is a standalone system. A great one, that I’d like to see in the game, but not necessarily the one they used in the backers build.

It’s just a very simple matter of contradicting what the dev said. You CAN’T have a HIGHER probability of shots in the centre if there’s an EQUALLY BAD chance of them being highly accurate or highly inaccurate. Therefore the highest probability is that the bullets will land between the centre and the edge of the circle, with a bell curve shape to the probability.
Sure, they could use a low, flat bell curve as opposed to a spire, meaning that shots have a lot of variance rather than being mostly likely to land in the centre between the two circles, but at the end of the day the majority of the shots won’t land toward the centre of the circle. I feel that’s a problem when the circle is already so big that targeting enemies more than 10 meters away will, in all likelihood, barely scratch them.

Geez, could you get any more passive aggressive? Not only are you bad at listening (because I never asked for “perfect” probability plots) you are also not in the right forum branch. This is about how a bell curve would work to give a 50/50 spread of probability in the calculation of bullet trajectories between the two circles we’re shown in game. It’s not about the UI giving percentage chances to hit or background processes that skew things in the player’s favour.
Also, if you’re going to mention someone by reference and insult them then just grow a pair, tag them and say it to their face. Pretty pointless writing a comment that’s nothing but negative if you’re not even going to address it to the right person.
(for any onlookers that don’t understand why this is aimed at me specifically check out this thread for context: Shots too random - #67 by lompabo )

Topic		Replies	Views
[Backer Build 1] Shot Circles and the Bell Curve Game Feedback	14	636	May 5, 2018
About randomness of bullet trajectories (with math!) Game Feedback	0	685	May 7, 2018
Shooting "grammar" and chances / visual presentation Game Feedback	2	831	July 8, 2018
[Backer Build 1] UI Information When Snap-Aiming Game Feedback	1	648	July 8, 2018
After 20 hours I still don't understand how manual aiming works Hints and Tips	11	1342	May 10, 2020