feat(DB): Chunk SQL Inserts #68

jimtendo · 2024-12-28T09:10:00Z

THIS IS A DRAFT. DO NOT MERGE.

Currently, during initial sync, when a block is added to the database, all transactions are also inserted in a single query. This leads to very high memory usage on the Postgres end for larger blocks.

This PR allows chunking of the SQL INSERT queries into smaller queries so that Chaingraph can be synced on memory-constrained devices/servers. This can be done by setting the CHAINGRAPH_POSTGRES_INSERT_CHUNK_SIZE_MB env var (on an 8GB device, I used 1MB successfully). On this PR, it defaults to 32MB - but probably should be made optional in future.

NOTE: This HAS NOT been properly tested. While syncing appears to have worked for me, I have not validated that all data has been inserted correctly. I'm putting this PR here now in case someone else is trying to do similar and is bumping into memory issues (and wants to take the gamble in trying this PR). Though I intend to complete this PR eventually, if someone else wants to take over, please do!

Signed-off-by: James Zuccon <[email protected]>

feat(DB): Chunk SQL Inserts

4cb6835

Signed-off-by: James Zuccon <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(DB): Chunk SQL Inserts #68

feat(DB): Chunk SQL Inserts #68

jimtendo commented Dec 28, 2024

feat(DB): Chunk SQL Inserts #68

Are you sure you want to change the base?

feat(DB): Chunk SQL Inserts #68

Conversation

jimtendo commented Dec 28, 2024