Benford's Law distribution of the 2019 Canadian election by party [data from elections.ca] - all polls - OmegaCanada

Benford's Law distribution of the 2019 Canadian election by party [data from elections.ca] - all polls (media.omegacanada.win)

posted 4 years ago by endr 4 years ago by endr +41 / -0

24 comments download

24 comments share download save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (24)

sorted by:

▲ 3 ▼

– RightOfSask 3 points 4 years ago +3 / -0

The leading number.

https://www.youtube.com/watch?v=XXjlR2OK1kM

permalink parent save report block reply

▲ 2 ▼

– Ham_Sandwich77 2 points 4 years ago +3 / -1

The leading number of what?

I understand Benford's law, I'm asking what the x axis is actually denoting. Those bars are counting how many times "the number" has a 1, 2, 3, etc as its first digit.

But what is "the number"? Vote counts per polling station? Vote counts per riding? Vote counts over time?

You have to know this because certain kinds of number sets can have artificial constraints that can give an unnatural distribution of first digits.

If we don't even know what X is supposed to be, we can hardly draw any conclusions from these graphs.

permalink parent save report block reply

▲ 3 ▼

– RightOfSask 3 points 4 years ago +3 / -0

You will have to ask OP.

If it's the leading number of votes in a electoral riding, then Benford's law won't work, because every electoral riding has between 60,000 to 80,000 voters. Thus the magnitude of the data doesn't vary enough. But maybe he plotted the vote counts in municipalities and townships, which should vary enough in magnitutde.

permalink parent save report block reply

▲ 2 ▼

– endr [S] 2 points 4 years ago +2 / -0

It's the poll results data from here:

https://elections.ca/content.aspx?section=res&dir=rep/off/43gedata&document=byed&lang=e

I separated all results across the country per party and per poll, then applied Benford's law to their sets.

You might be right about not getting enough variety in magnitude though... I'll try summing by polling station first

permalink parent save report block reply

▲ 2 ▼

– endr [S] 2 points 4 years ago +2 / -0

by district: https://i.maga.host/AsnD9H7.png

by polling station: https://i.maga.host/HOLwiqE.png

permalink parent save report block reply

... continue reading thread?

▲ 2 ▼

– RightOfSask 2 points 4 years ago +2 / -0

Even with polling stations the magnitude variation might be too small. You would need something like in the US, speak data on a county level. They have counties with only a few thousands votes and counties with a few million votes.

permalink parent save report block reply