-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Establish standard for CL database_cross_reference and contributor annotations #2013
Comments
Also, thank you, @ubyndr, for these SPARQL queries: |
This is a case for the techboard, please add to it; you need these queries to be totally safe:
I would make sure all three are there (ask @anitacaron, @ubyndr or @udp for help) in a PR, then fix all the errors rather than the other way around (fixing first), because this ensures that the tests are working. |
Regarding DOIs, my take is that they should be recorded as But they SHOULD NOT be recorded as IRI with values of the form We SHOULD NOT apply the same logic as for OBO/PURL identifiers, where |
For those interested in the extended debate of what @gouttegd is talking about (and with whom I agree for practical purposes, modulo some limitations on the RDF side of things), is information-artifact-ontology/ontology-metadata#59. One of my least favourite threads my work life 🗡️ For now, we should use |
Ah, I knew I had ranted about that before! :D |
@bvarner-ebi I recommend to not make that choice hastily, and instead |
For what it’s worth, FBbt is internally consistent on using |
@anitacaron, as discussed offline, please see the pending questions, with more context above:
|
The questions are all great and very important. Let's not decide these things on an ontology by ontology basis. In general, database cross references should be used only for two use cases:
PMID vs DOI is a great question that should be asked on the issue tracker but I would tend to "does not matter", and second best "DOI". All other uses (publications (no standard yet but should request), contributors (dcterms:contributor), URLs with additional information (rdfs:seeAlso) should be faded out). In case 2, we go against rule 1 for orcids, and use IRI syntax (orcid - always IRI), but only for orcids. DOIs are always curie syntax. |
Don't forget the third case: to represent cross-ontology mappings, even when the mapping object is a OBO term. (Yes, ideally we should all use SSSOM for mappings, but for now almost everyone is still using DB cross-refs.) For this use case, and at least for Uberon/CL, the value of the DB cross-ref MUST be a curie, because that's what the bridge generation script expects, and we're not ready to switch to anything else. |
Yeah I would like at least to move this to skos / semapv in the mid term. |
Do you mean, keep using annotations directly in the ontology (instead of SSSOM mapping sets in externally maintained files), but with dedicated properties from skis or semapv instead of oboInOwl:hasDbXref? |
As an intermediate step yes |
Not convinced it is worth it. Such an intermediate step would already require lots of effort to either adapt the existing bridge generation script or come up with a new one -- I'd rather have those efforts directed at supporting the use of SSSOM directly. |
Fine by that. I am 100% on your side when we are talking about xrefs that serve a direct purpose, like generating bridge files; but most ontologies provide mappings merely as an additional piece of metadata, and here, I think teaching people SSSOM may be overkill. But for CL and Uberon (and Mondo etc) you are right! |
Thank you, all, for the feedback.
For database_cross_reference, use CURIE format, enter in Protégé as Value on the “Literal” tab, leave Datatype empty. Non-CURIE values (e.g., URLs) are discouraged, but when used are entered the same way. The exception to this is when ORCIDs are used. ORCIDs should be entered as an IRI in the IRI field on the “IRI Editor” tab.
Yes.
See above.
No, but the bioregistry standard should be used, although this currently is not consistent across ontologies. Since DOIs may be more readily available, they can be consistently used by editors if preferred, written as a CURIE (doi:x) and entered in the “Literal” tab as described above.
IRIs entered on the “IRI Editor” tab. When ORCIDs are used to annotate definitions (e.g., a subject matter expert provides a definition with no readily citable source), the precedent is to add this as a database_cross_reference on the text definition/comment. This will be the standard until otherwise advised. When adding ORCID to identify a term contributor, the annotation property dcterms:contributor is used, and the ORCID is still added on the “IRI Editor” tab,
For ORCIDs, the whole IRI is preferred. In all other cases, the CURIE is preferred. |
Excellent summary. |
@bvarner-ebi I'd say they are in scope, yes. It's clearly something most if not all editors (and reviewers!) should be aware of. |
In the process of reviewing #1997, inconsistencies were noted in how annotations are recorded.
Annotation properties to be clarified:
database_cross_reference:
For example,
the DOI database_cross_reference for the text definition of CL:4023072 'brain vascular cell' where it is literal
vs
the DOI database_cross_reference for the exact synonym "hepatic progenitor cell" in CL:0002196 'hepatic oval stem cell' where it is an IRI
contributor:
For all of the above:
Is there a standard for casing?
-- doi should be lowercase, PMID should be uppercase
The text was updated successfully, but these errors were encountered: