'convert pgn database to pandas dataframe

Helo!

Using chess.pgn to convert a Chess database into a dataframe, to read the nth game from the database do I need to read all the previous ones first? I can't jump directly to the game n? If I want to distribute the processing in a database with 10^8 games, I can't start reading in the 9e7th game?

import pandas as pd
import chess.pgn
from datetime import datetime as dt
import os
import glob

nome_arquivo = "Analises_01.pgn"
inicio = 0
numero_jogos = 1.47e8

arquivo = open(nome_arquivo, encoding="utf8")

ratings = []
for j in range(numero_jogos):
    first_game = chess.pgn.read_game(arquivo)
    if j >= inicio:
        try:
            Brancas = int(first_game.headers["WhiteElo"])
            Pretas = int(first_game.headers["BlackElo"])
            ratings.append([Brancas, Pretas])
        except:
            pass


Solution 1:[1]

import chess.pgn
import pandas as pd

pgn = open("your_pgn_path_here.pgn")

my_list = []
for i in pgn:
    i = chess.pgn.read_game(pgn)
    my_list.append(i)
    df = pd.DataFrame(my_list)

#shows 210 game in dataframe    
print(df[0][210])

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source
Solution 1 higor fabiano