There are currently two sources, from where online games are imported:
- TWIC
- LichessEliteDatabase
Both Sources are having a dedicated "SOURCE" tag, wich is like I wrote above.
Additionally I set for the games out of the LichessEliteDatabase the eventname (Tag EVENT) to the "timecontrol" Blitz, Rapid or Classical.
So, if you filter for Titled Tuesday in the TWIC Source and Blitz in the LichessEliteDatabase Source, your good to go by removing the most online games.
I don't see a way, to stay perfectly clean with these tags for the same reason, written here:
@kajalmaya said in #39:
The problem is games from many sources don't say blitz or rapid anywhere, but just have a TimeControl tag. The pgn format has a specification, but I think it should be updated, and then people should follow it, which is not going to happen. The same game from different sources may have even the main 7 tags written differently. Such things make curating any database a difficult task. It is admirable that OP is trying to do that.
That's another reason, why I took a very aggressive way to eliminate the duplicates.
There are currently two sources, from where online games are imported:
- TWIC
- LichessEliteDatabase
Both Sources are having a dedicated "SOURCE" tag, wich is like I wrote above.
Additionally I set for the games out of the LichessEliteDatabase the eventname (Tag EVENT) to the "timecontrol" Blitz, Rapid or Classical.
So, if you filter for Titled Tuesday in the TWIC Source and Blitz in the LichessEliteDatabase Source, your good to go by removing the most online games.
I don't see a way, to stay perfectly clean with these tags for the same reason, written here:
@kajalmaya said in #39:
> The problem is games from many sources don't say blitz or rapid anywhere, but just have a TimeControl tag. The pgn format has a specification, but I think it should be updated, and then people should follow it, which is not going to happen. The same game from different sources may have even the main 7 tags written differently. Such things make curating any database a difficult task. It is admirable that OP is trying to do that.
That's another reason, why I took a very aggressive way to eliminate the duplicates.