-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to represent "linkouts" as opposed to "cross-references" in OBO ontologies? #165
Comments
We have some stuff like this in FBbt - currently all using xref, which I agree could maybe be improved. I guess it would be important to define any new property well enough that it is clear when to use it (rather than xref). For me I would consider that the wikipedia page could be a source for the term definition - so should maybe be an xref annotation on the definition? The other database linkouts are different to this, providing some data/context, but not really sources for the concept definition (users of the term rather than sources for the term...) - my 'spidey senses' tell me that this might be an important distinction |
@Clare72 yep, important point!
This indeed makes the decision a tiny bit more complex. |
I wonder if the issue is as you describe above not the property per se - as you say, seeAlso was made for that- but rather a project specific desire to hide or show some properties to some users? Eg if using seeAlso for everything, you'd want to show GH issues to ontology developers, and general info websites to ontology users. Does this then become a UI rendering layer need on top of the resource rather than different properties? I'm thinking about the work eagle-i did in the past for example @carlotorniai and @ontowonka worked on |
@mcourtot Thank you for your position. In the end, you are right. If I would create a nicely defined property for "link to additional information about this resource", I have not solved the problem yet that I want to show some information to ontology editors, some to clinicians and some to researchers - I essentially squeeze the "rdfs:seeAlso" problem into a narrower space. I was hoping we could find a way to find a property that could work for 85% of the cases where a linked resource provides "additional information about the domain concept". @cmungall has repeatedly encouraged me to see if we can define the problem as a UI problem, but it creates a big burden on API and UI developers to actively employ a second standard for linkouts. You could imagine a linkout config like this (maintained by the ontology curation team, say Mondo): resources:
- id: clingen
linkout_url: https://search.clinicalgenome.org/kb/conditions/$1
id_processing:
type: curie
idspace:
- MONDO:123
- MONDO:124
- MONDO:987
- id: otar
linkout_url: https://platform.opentargets.org/disease/$1
id_processing:
type: curie
replacements:
":": "_"
idspace:
- MONDO:123
- MONDO:124
- MONDO:987 This includes both the list of IDs covered by the external resource, the endpoint and the ID processing rules required to say, change MONDO:123 to MONDO_123. But lets say we generate a config like this, we still
|
Completely agree with all your points; just trying to start from the use case you described (different rendering to different user types) and see what the options are. I can only think of
Maybe try with a specific example of 1 and see if that works well enough as starting point? If it doesn't fit then we can evaluate other options? |
I think I would vote for subproperties based on the content of the resource, rather than trying to guess what each type of user may want (and having the same users across all ontologies!)
|
By linkout, it looks like you mean include links to websites where ontology users (i.e. database curators) have linked information to a specific ontology term (looking at the CLINGEN example). Is that right? If so, I personally feel like this is beyond the scope of ontologies. I understand the desire to promulgate these links to as many people as possible in as simple a way as possible and I completely agree that it is super important to connect this data. In my opinion, what we really need is a simple, standardized query language that supports querying ontologies and databases... something like federated SPARQL queries that goes beyond querying RDF and also supports querying APIs/databases (and ideally also handles OWL logical axioms better). |
@allenbaron I think you are right in that standardising data access is the "right" thing to do - but realistically, we are decades away from a world where we can ask "do you dear database have information about this term?". Furthermore we still would have to encode somewhere the information which resources to even talk to. @cmungall argued also along the lines of "this should be solved at API/UI level". My argument is:
The ontologies themselves have better chances to curate this information "where else can you find Information about this disease". If there was a real semantic web, yes, this issue would not exist.. what do you think? Are you amenable to the "practicality" angle as an argument? @Clare72 your suggestion is probably very sane. I will need to think it through, fearing, as always, term proliferation and making things complicated for Ontology implementors. |
I'm a dreamer, I guess 😅, and a realist. I am absolutely amenable to a practical approach that can be implemented. I think @Clare72's suggestion is probably most practical at the current time and agree that if this is done at the level of the ontology, it would require less people to do the work of identifying and creating these links, since they'd propagate to anyone who uses the ontology. We already have But, when it comes to data linked to an ontology term, there are a number of questions that, in my mind, need to be addressed before something reasonable can be implemented:
I know that practicality often gets squashed by hypotheticals in discussions. I don't want to squash a good practical approach, especially one I can implement 😉. I think about the problem of linking data literally every day. What about using rdfs:seealso subproperties to publish a list of online resources that might have additional information about terms in an ontology, along with some sort of description of the resource (maybe including an API endpoint?), instead of creating individual links at the term level? That would at least make ontology users aware of resources out there, which is sorely needed. |
NCBI has been doing this forever https://www.ncbi.nlm.nih.gov/projects/linkout/ Many other resources also adopt a similar mechanism |
@allenbaron very reasonable arguments, I will get back to you below, but lets first look at @cmungall's suggestion. Here is the key documentation: https://www.ncbi.nlm.nih.gov/books/NBK3802/ This is essentially what I am suggesting; only that
The linkout system from ncbi is much more powerful than what I am asking here (including the ability to provide boolean queries for search and I think API endpoints). It also still requires manual work; every element on the side of the internal resource (say the ontology) still needs to be manually associated with every element in the target, external resource. No free lunch in terms of: all genetic diseases in Mondo can be accessed at that location (its all individual linkouts). TLDR: If anything, NCBIs linkout confirms that we need some system to orchestrate the linkouts. Back to @allenbaron
Why should there be a rule? Link the ones you (the ontology owner) wants to link for one reason or the other. The question is not much different from the question of which resources to xref to, right? And no one has regulated that. I do expect some issues of course for the ontologies themselves that wish to build a linkout list like this and need to decide what to include. On ontology level, your concern may be warranted, but I daresay that while maybe 1 order of magnitude larger in scope, its essentially the same as the xref problem. Indeed, you could always phrase it as such, as you can supply arbitrary linkouts as xrefs in your ontology.
I totally agree with these questions. They are a big concern for me as well, as practically speaking, most UIs will want to have super fine grained control over what to display on which part of the side (Clinical reports; genomics data; etc). Right now I dont have an answer. The question is: must we solve this problem conclusively before moving forward here? We could decide on a small set of properties like the ones @Clare72 suggested and then see where this takes us. My slightly fishy provisional answer would be to add
I think we dont need to answer this, indeed challenging, question to move this discussion forward. My current ideal would be to have the external resources provide the linkouts themselves. So this is the SOP:
The automated process (API crawling etc) is also an option, but for me, the most important case right now is the case where the external resource wants to be linked out to. |
@allenbaron if I see this correctly, you deal with this issue in DO by doing this:
Its exactly these kinds of cases I would like to solve here - without getting into the debate of if we should, but if we do, how do we best do it. |
In some sense, you're right. We do add URL links as xref annotations on definitions to publications that provide more information about a disease. The great majority of these "publications" were actually used to create the definition, probably including this one (rarely we can't find publications to cite that we feel sufficiently support the definition and use OMIM or NCI as the definition source). I don't know why they're formatted starting with "url:". I'm drawing a blank on how best to solve this problem. For the links that are focused on providing more info for a human to review:
It could be as simple as creating a more specific properties for each (like One the other hand, this one:
Seems like the kind of thing that a user would want a program to be able to access and aggregate data from for use in further analysis. That wouldn't be solved with a simple link to a website. |
Our paper https://hal-lirmm.ccsd.cnrs.fr/lirmm-02945170 was measuring the differences between the different types of XRefs And the recommendations were : So indeed rdfs: seeAlso was recommended (which I agree with you @matentzn that it is used by many other things.... but in a sense this is the Semantic Web, you do not control how a property will be used). If you don't like landingPage, then I would go up the hierarchy and take foaf:page. |
Thank you everyone for the discussion and contributions. None of the discussed solutions are perfect. My personal preference after re-reading everyones positions would be the subproperty of rdfs:seeAlso suggestion here. Unless there is any strong objection, I will move to implement it. |
Right now, we cannot agree community wide on a standard way to represent linkouts, not even whether or not they should be curated in the ontology at all. This is breaking a bit more with Mondo being a pure ontology vs a bit databasey, but right now, we just need a convenient way to link to curated content for our stakeholders. Here, we add the property so we can start using it, even if we might change the ID to an OMO id in the future. Context: information-artifact-ontology/ontology-metadata#165 #7071
* Add curated content resource property Right now, we cannot agree community wide on a standard way to represent linkouts, not even whether or not they should be curated in the ontology at all. This is breaking a bit more with Mondo being a pure ontology vs a bit databasey, but right now, we just need a convenient way to link to curated content for our stakeholders. Here, we add the property so we can start using it, even if we might change the ID to an OMO id in the future. Context: information-artifact-ontology/ontology-metadata#165 #7071 * added new property to the qc-permitted-properties.sparql * Update qc-permitted-properties.sparql * Update qc-permitted-properties.sparql * Change the if of curated content resource to the usual format Instead of using an OBO ID. * Update qc-permitted-properties.sparql to reflect correct property id --------- Co-authored-by: Sabrina Toro <[email protected]>
I am thinking hard about the following problem. I would like a way to simply add URLs with additional information to a class. We all know about mappings - linking a class or term from one ontology to another; we know cross-references, which includes mappings from ontology terms to database identifiers or schema elements. But in some cases, we want to just link to a website that provides “additional information” about a class. For example, all these pages provide additional information about “X-linked syndromic intellectual disability” (MONDO:0020119).
Now you could go all in on ugly and do this:
xref: clingen.condition:MONDO:0020119
, or, god forbid,xref: clingen.condition.mondo:0020119
xref: otar.disease: MONDO_0020119
xref: wikipedia:X-linked_intellectual_disability
I am not saying this is impossible, but isnt this going a bit far? I don't really consider these examples cross-references at all, as you can see by the issue I lined above, and this rough categorisation (my opinion only):
Alternatively, we could go ahead and say, well my friend, use
rdfs:seeAlso
that is what it has been designed for ("rdfs:seeAlso is an instance of rdf:Property that is used to indicate a resource that might provide additional information about the subject resource.").Here is my problem with this:
Obviously, what I would really want is to be able to uniformly display those URLs across all our tools - If you show a page with information about a disease, you should be able to show all the disease relevant links on that page. In my experience, we have used "see also" for basically everything: thousands of links to GitHub issues, papers, and documentation pages. This information should probably not be displayed alongside information "about the concept" and presented to the user. So my spidey senses tell me a new property would be better, like "additional information", "linkout" or some such. I am very much not sure, but I very much need to solve this problem at least in my projects until the end of the month.
If anyone has an opinion or ideas, let me know!
The text was updated successfully, but these errors were encountered: