-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cdot responsible for fixing bad HGVS, allow warnings etc #27
Comments
It would be good to collect a huge test case of bad HGVSs (from search bars around the place) and then work out how to resolve them etc |
The two big issues we see in Shariant search (the examples aren't valid, just showing off the kinds of issues) :
|
Will run on each environment:
Then collect them all together. Have put scripts in "paper" directory in cdot github Emailed csv to James and myself to continue analysis (need to clean etc stuff from private servers before I share it) |
Web developers know to clean their user text, but the main use case of cdot would be bioinformaticians hacking together scripts I think We could run an evaluation of how many HGVSs resolve from the literature and ClinVar etc as well |
Few thoughts: At the moment 0 modification is done on HGVS import Search currently works via:
Few ideas:
|
There are plenty of bad HGVS strings out there, especially when people are typing into a search box - eg they put spaces in there, forget the colon, have unbalanced brackets etc.
VariantGrid has a lot of functionality to handle sloppy/bad HGVSs - mostly in search and HGVS Matcher
Ideally should move all of this functionality into cdot, so that it can be generally useful.
Would be nice to have a framework where you return a list of well structured warnings / errors etc.
The text was updated successfully, but these errors were encountered: