Address performance issues #142
Labels
flag:discuss
Flag issue that needs to be discussed before it can be implemented.
Milestone
Used to track other issues that are required to complete the milestone.
With the changes in #83, query performance has tanked to unacceptable levels - so much so, that we had to remove timeout limits from the APIs (e.g. neurobagel/api#304).
I made a simple time benchmark of these steps locally:
Note:
curl
run minus the internal time logged by the API itself (i.e. when the API is done minus when the user is done)Conclusions
All that is to say: I think we should take this moment to split the query flow into two parts:
The first phase can be highly optimized and will have a very small response.
The second phase will also be faster (because we don't have to search the entire graph),
and so the response will be smaller unless someone really wants to get all the data at once.
The text was updated successfully, but these errors were encountered: