back

Cheminformatics to improve Wikidata on chemical compounds

If you suspend your transcription on amara.org, please add a timestamp below to indicate how far you progressed! This will help others to resume your work!

Please do not press “publish” on amara.org to save your progress, use “save draft” instead. Only press “publish” when you're done with quality control.

Video duration
00:23:21
Language
English
Abstract
-

Chemistry has long been an important domain-specific corner in the Wikipedia and Wikidata communities. The two are not tightly linked, though increasingly information from Wikidata shows up on Wikipedia ChemBoxes. We have been using Wikidata content in our research into human metabolism and metabolic diseases. This requires the information about metabolites in Wikidata to be accurate. We have been using cheminformatics to support our manual work to add missing information and compounds and curate existing knowledge.
In this presentation it will be shown how the Chemistry Development Kit (Q2383032), Bioclipse (Q1769726), and QuickStatements (Q20084080) have been used in the past two years for these purposes (chem-bla-ics.blogspot.com/search?q=wikidata). We will demonstrate this infrastructure of Open Source tools, and how it can be used for using the Simplified molecular input line entry specification (Q466769) and International Chemical Identifier (Q203250) information to: link out to external databases (e.g. the EPA CompTox Chemistry Dashboard (Q26998510), MassBank (Q24088019), LIPID MAPS (Q20968889), etc); add physicochemical properties; add missing InChIs and chemical formulas using the SMILES; add new compounds based on a SMILES; and, detect incorrect or inconsistent information in Wikidata items on chemical compounds.

Talk ID
wikidatacon2019-1144
Event:
wikidatacon2019
Day
2
Room
Kepler
Start
4:30 p.m.
Duration
00:25:00
Track
None
Type of
Talk
Speaker
Egon Willighagen
Talk Slug & media link
wikidatacon2019-1144-cheminformatics_to_improve_wikidata_on_chemical_compounds
100.0% Checking done100.0%
0.0% Syncing done0.0%
0.0% Transcribing done0.0%
0.0% Nothing done yet0.0%
  

Work on this video on Amara!