Category: statistics

What and when and how often, revisited

Five years and one day ago I put up a table on the stats page to show how often and when, on average, items are found in the advent calendars that have been used for the competition. It was rudimentary and I always planned to upgrade it, and I’ve started a few times but always come unstuck in forming the database query that would drive it.

Until today, when I started from scratch, ran into the same problem again, googled it, found that I was on paper doing it right, looked up the syntax for the bit of the query that was clearly not working and tried something didn’t look like it would work but somehow did. So much, if not all, of this website was coded by trial and error!

Anyway, now the table shows not just how often each item has occurred, but how often on each individual day. On top I’ve layered mean and mode because I know how to calculate those. The table is not in the final form I envisaged – for that I’ll either use a scalable font size or a colour gradient on the numbers so higher ones stand out more, but that is for another day.

I also need to scan through the database and rationalise the names I’ve used for items, as, for example, Christmas Pudding appears twice as it must have an errant space or something similar to create a variety of spellings for the same item. I’ll try to get to that tomorrow!

Mild embarrassment

It’s fun to note that for a year I decreed that I want to try and have a bumper field, we are looking at having our lowest field since 2007 – admittedly I haven’t gone as gung-ho on promotion as I could have, because I’ve never really felt comfortable with the idea of it. Quality, not quantity, anyway!

Related to this I’ve fixed an element of the website that wasn’t working exactly as designed – the query that pulled the data from the database for the “number of new/returning players” bit of the stats page only actually worked if there was a new player registered for a given year – there are no new players this year so the table/graph didn’t recognise the existence of 2022. I have fixed this and all is now displaying as desired!

Tomorrow, if I have time, I might see if I can do my intended redesign of the “Item Stats” data on the same page – if not it will hopefully happen soon. The table as it stands is pretty naff!

Good luck to our 31 players, and it’s not too late to join yet if you haven’t!

2021 updates

This year I made it so that when you guess, it no longer reloads the same page but a specific page that simply says whether or not your update was successful and then offers a link back to the homepage so that a player can then see their guess in the table (as I assumed they’d want to), but I’m still getting the behaviour where a player is saying yes to reposting their form data and as a result the same guess is being entered multiple times in quick succession. A good design will stop this happening and I thought the bodge job that I’d done would be enough – after all, why would you want to refresh the page that only says your guess change was successful!? Anyway another rethink is required; I’ll have to do something clever like throw out an error if you are trying to change your guess to what it already is…

I haven’t had the time to dedicate to tweaking the website as much as I’d want but I’m well on the way to eliminating the ‘table-headers-at-the-bottom’ lunacy that was a direct result of imitating the design of the F1 digital graphics that I have mentioned previously. Hopefully that process is one I will complete in the next few days and then I’ll be able to move on to a better version of the which-items-have-appeared-on-what-days table that I’ve been planning for a while. I started this last week and found that it’s not as simple as I thought it would be so I’m scratching my head a bit, but with a bit of time I should be able to figure it out!

A player has entered this year with not their real name and this is as good a moment as any to remind players that if they want something similar then they can get in touch and their display name will be amended accordingly. I’m reluctant to give free choice (I know enough people that will choose either something rude or something akin to Boaty McBoatface to rule that option out) but I’m sure between us we can find something agreeable. It’s actually got me thinking about whether the default of full name is actually needed – if, for example, the all time top three were listed as Simon R, Mark C and Antony B I don’t think the competition would lose anything. I’m going to ponder this one and report back, and thoughts are welcome as ever!

A couple of players have pointed out on social media that there are twenty-five items to choose from and they are correct! This means that at least one day has more than one item behind its window, and I don’t think I’m giving too much away when I state this.

I got the Facebook and Twitter links working again and reverted them back to the “post everything” mode that they existed in when they were first created. In the interim the rate limiting of the third party services that I used meant that I had they had been restricted to only updates about window opening and points scoring, and after a couple of days it became clear to me that that had been a good thing so I have deliberately limited it myself now. Please continue to subscribe on Twitter and Facebook to get the important updates placed directly into your feeds (where their algorithms think best)!

Not long left to get your first guesses in on time, and remember a late start carries a penalty so don’t be late! Good luck everyone 🙂

The race to 1000

Couple of housekeeping measures before I begin – late last night I made a change that has no bearing over the actual results or standings of the competition but that I wanted to make for a while – where everything else is equal (score, number of correct guesses, highest scoring best guess etc) the current standings table now ranks players in the order they joined the competition this year. Previously it used a random and mildly nerdy method that is far to silly to explain. I’m happier now. I’ll reiterate that if all the other things are equal then the players are fully and unbreakably tied, and this is shown as such in the table!

If you head over to the stats page then you’ll see the all-time scores table in all it’s massive glory. Atop it sits Simon Rutter, who has been at the top of that table for fifteen straight years now. He is on 880, with Mark Coughlan 92 points behind in second (where he has sat for fourteen years himself), Antony Brown a further 47 back in third (no mean feat for a player who started four years after the others at the top) and Ross Turnbull is fourth, 18 points adrift of the top three. I highlight these players because all four could, in theory, hit a thousand if they play well this year!

Admittedly for Mark, Antony and Ross they’d require suspiciously high scores (212, 259 and 277 out of the maximum 300 respectively) but Simon is definitely within striking distance – 120 away is doable, last year’s victor scored 135 and Simon himself has a PB of 88 which isn’t too far away. I’m excited – this milestone is one that was never contemplated when this all started. I suppose, realistically, I shouldn’t get too excited. Simon averages 55 points per appearance so it’s statistically unlikely this year, or even next. But with a good performance, it could be this year. I’d love it to be. I might even make a first player to 1000 certificate (remember when I used to make certificates? I should start doing that again) to commemorate the occasion.

Please Simon, do well. It’d be epic!

Last quarter

We are now three-quarters of the way through this year’s Advent Calendar Challenge – yet again we have Santa appearing for the first time before day 24. A quick glance reveals that 2014 was the last time Santa made it’s first appearance on Christmas Eve. The downside is no-one scored today, but that was kinda predictable! I didn’t make the calendar so it’s obviously out of my control!

A quick note on the matters raised in the previous post. On dummy/red-herring answers, I will think about this before the 2020 challenge further, maybe with some thinking-out-loud analysis: watch this space! On the other matter, on whether to hide guesses after day 18 or not, I have a compromise. The compromise was partially inspired when I was asked by a competitor privately if I would hide guesses or not. I replied I intended to leave things as they were (i.e. hidden after window 18) and that I would make the announcement in this blog post today. The competitor pointed out that I had just given them a possible advantage in that they could now take a copy of everyone’s guesses, knowing what was about to come. It got me thinking – whilst it’s a slight advantage only because everyone could take such a copy at any time, there’s a half-way approach between the two I asked you to choose between in the last post, and it is that approach that I have settled on.

So from now on*, guesses made after the opening of window 18 will be hidden. Guesses made before that point will not be. Trial basis, and we’ll see how it goes.

Six days left. Good luck everyone!

-* it’s not been implemented properly – the official rules pdf is pending update, and the censorship still applies on individual profile pages. Those will be fixed tomorrow!

2019

I recently moved house and that’s been a bit of a focus of late – it was only earlier this afternoon that I got to thinking how it was only a week to go and I should do the necessary behind-the-scenes bits to get the website into 2019 mode.

An aside – 20 November being “a week to go” is a statistic I’m interested in testing. Using the date of the first database entry each year, we get an idea of the launch date each year!

(data only exists from 2006 onwards)

Had to do a couple of tweaks to make the database query work – one year the competition carried on into the new year because of reasons, and other years robots click buttons in random months and record things in the database. But there we have it, and displayed another way:

Data correct as at 20 November 2019

In short, my assertion today that I am a week away seems to be bang on – 27 November is by far the most popular date for me to launch, and last year’s 28 was in the early hours making it reasonably likely I sent those emails late the day before too! I’m also interested to see that I have started the whole thing as early as EIGHT days away a couple of times – that just seems crazy to me now!

On to news about this year’s competition now and it isn’t great (for me at least) this year. This website’s host has upgraded its servers and as a result the code that powers the ACC is COMPLETELY broken. It’ll probably take a week to get going, and even then it may well be a bit of a bodge job just to get the basic functionality back without necessarily doing it well. That complete rewrite is needed more than ever!

But anyway let this be my pledge, in this election season, that in the next week I’ll devote my time to this whenever I can and the 2019 Advent Calendar Challenge will launch, on time, on 27 November. Possibly the 28th. Heck, even if the website can’t be made to work, I reckon we can work something more manual out…

Item Analysis

This has been long overdue (by which I mean both the thing this post talks about, and this post itself – in the case of the latter, all I can say is “December is busy!”) but I’m glad to get something up, finally, in a basic form at least.

Antony Brown, three times Advent Calendar Challenge Champion, once told me how he had analysed (back in the days when guesses were a free choice and not from a restricted list) likely items by frequency of appearance and when they most often appeared. I have no idea if he, or any other player, still does that kind of analysis but now we all can – it is a new table on the “stats” page!

To develop the table took a little bit of work – firstly I had to a bit of tidying of the historic item data. I had written “Christmas Tree” in four different ways it seems, but I have edited that (in the database at least) to be only one now. This was the case for a few items – I guess there is possibly scope for a whole new “calendar items” database table with unique item codes but that’s for another time. Once that was done (over 2 days during which I lost the progress I’d already made) it was pretty simple to knock together that table that now exists. It’s a bit ugly and the worst example of why table headers at the bottom is terrible, but gets the data across, for now.

Items that are in this year’s calendar are yellow – to get that to show was a bit of a bodge itself and I need to find a more elegant (read: automated, cascading) solution for the future but don’t read too much into it: whilst items already seen will alter the table, items not yet seen do not. Put another way, this database doesn’t in any way know what hasn’t happened yet.

It’s fun to see that the most frequent items have appeared more times that number of years this has been running – because of the fact more than one item can be behind a window sometimes! I want to add more analysis based on this table but for now this is all I’ve got!

See the new table here.

Edit: just noticed a bug. Grr. I’ll fix it tomorrow!

Welcome, about and player numbers

Welcome to the optimistically titled “more depth” section of the Advent Calendar Challenge website. I’ve been hoping to add more content (and/or redesign the whole website) for ages now and I’ve come to the conclusion that a “blog” of sorts is how is to get things moving. I will aim to post every couple of days with an article that either examines statsitics, details an aspect of the history of the competition or explains what on earth is going on. Lets see how long it lasts!

As it’s day 1, I am minded to cast my eye over player numbers over time. I’m told this should be a line graph but it took me ages to get the charts script working as a stacked bar chart and so that it remains!

After a rapid early growth, the graph shows a steady decline is setting in – only once (2012, perhaps an Olympics hangover? Probably not…) have the player numbers topped 50. This year, at the time of writing, 44 players are signed up, comprised of 40 returning players and 4 new players. New players, by the way, are a statistic that varies apparently randomly:

I am not concerned by the wane in player numbers, as yet. Firstly, as it goes it isn’t a significant one. The rules of the competition almost arbitrarily set the minimum number of players as 12, which we are nowhere near (I can’t remember exactly, but I am fairly sure I originally chose 12 as that was the number of players who survived* the first edition, and so that became the benchmark). Secondly I am happy to keep the competition to people I know and their trusted friends, and of that field only the subset who want to play – the overall number might shrink but overall, there’s still a good few people who want to play. Frankly, I won’t be concerned until we only get 11 players one year…

The number of players each year graph is new and now updating live on the statistics page. See, this blog has done some good already 🙂

* thirteen played, but one was disqualified

Recent Posts

Categories

GiottoPress by Enrique Chavez