I am trying to research the effect of experience and breaks from play on performance [i.e., does spaced-training improve ELO more than massed training?]
I can see here that the user of this research did something similar; however, it seems like he went way beyond and generated his own offline database (the setup seems like a LOT of work and is beyond the scope of what I need): link: https://lichess.org/blog/YXgv5xMAAMxG4VeC/how-long-does-it-take-to-get-good
I am simply looking for lets say ~10K of player ELO ratings over time (or from game-to-game). I can see the API lets you export this per user but I would like to export it for multiple players ideally from when they start their time at Lichess.
There are two methods I envision:
-
use the lichess API ["Get rating history of a user"--> outsputs max 1 rating value per day] and choose lets say randomly 10K players: i'm not sure if the API will let me run it though for 10K distinct users... this user appeared to experience issues with exporting rankings for 300 users
-
download the massive lichess opendata base (https://database.lichess.org/) and filter and transform the data until you get the users you want [problem: the files are large up to 34 GB and because there's only one file per month, I will not capture performance changes larger than one month, unless I want to endeavor to compile all the hundreds of GB together]

