Ycasd - a tool for capturing and scaling data from graphical representations

Abstract:

BACKGROUND\backslashr\backslashnMathematical modelling of biological processes often requires a large variety of different data sets for parameter estimation and validation. It is common practice that clinical data are not available in raw formats but are provided as graphical representations. Hence, in order to include these data into environments used for model simulations and statistical analyses, it is necessary to extract them from their presentations in the literature. For this purpose, we developed the freely available open source tool ycasd. After establishing a coordinate system by simple axes definitions, it supports convenient retrieval of data points from arbitrary figures.\backslashr\backslashnRESULTS\backslashr\backslashnAfter describing the general functionality and providing an overview of the programme interface, we demonstrate on an example how to use ycasd. A major advantage of ycasd is that it does not require a certain input file format to open and process figures. All options of ycasd are accessible through a single window which eases handling and speeds up data extraction. For subsequent processing of extracted data points, results can be formatted as a Matlab or an R matrix. We extensively compare the functionality and other features of ycasd with other publically available tools. Finally, we provide a short summary of our experiences with ycasd in the context of modelling.\backslashr\backslashnCONCLUSIONS\backslashr\backslashnWe conclude that our tool is suitable for convenient and accurate data retrievals from graphical representations such as papers. Comparison of tools reveals that ycasd is a good compromise between easy and quick capturing of scientific data from publications and complexity. Our tool is routinely applied in the context of biological modelling, where numerous time series data are required to develop models. The software can also be useful for other kinds of analyses for which published data are required but are not available in raw formats such as systematic reviews and meta-analyses. BACKGROUND Mathematical modelling of biological processes often requires a large variety of different data sets for parameter estimation and validation. It is common practice that clinical data are not available in raw formats but are provided as graphical representations. Hence, in order to include these data into environments used for model simulations and statistical analyses, it is necessary to extract them from their presentations in the literature. For this purpose, we developed the freely available open source tool ycasd. After establishing a coordinate system by simple axes definitions, it supports convenient retrieval of data points from arbitrary figures. RESULTS After describing the general functionality and providing an overview of the programme interface, we demonstrate on an example how to use ycasd. A major advantage of ycasd is that it does not require a certain input file format to open and process figures. All options of ycasd are accessible through a single window which eases handling and speeds up data extraction. For subsequent processing of extracted data points, results can be formatted as a Matlab or an R matrix. We extensively compare the functionality and other features of ycasd with other publically available tools. Finally, we provide a short summary of our experiences with ycasd in the context of modelling. CONCLUSIONS We conclude that our tool is suitable for convenient and accurate data retrievals from graphical representations such as papers. Comparison of tools reveals that ycasd is a good compromise between easy and quick capturing of scientific data from publications and complexity. Our tool is routinely applied in the context of biological modelling, where numerous time series data are required to develop models. The software can also be useful for other kinds of analyses for which published data are required but are not available in raw formats such as systematic reviews and meta-analyses. BACKGROUND Mathematical modelling of biological processes often requires a large variety of different data sets for parameter estimation and validation. It is common practice that clinical data are not available in raw formats but are provided as graphical representations. Hence, in order to include these data into environments used for model simulations and statistical analyses, it is necessary to extract them from their presentations in the literature. For this purpose, we developed the freely available open source tool ycasd. After establishing a coordinate system by simple axes definitions, it supports convenient retrieval of data points from arbitrary figures. RESULTS After describing the general functionality and providing an overview of the programme interface, we demonstrate on an example how to use ycasd. A major advantage of ycasd is that it does not require a certain input file format to open and process figures. All options of ycasd are accessible through a single window which eases handling and speeds up data extraction. For subsequent processing of extracted data points, results can be formatted as a Matlab or an R matrix. We extensively compare the functionality and other features of ycasd with other publically available tools. Finally, we provide a short summary of our experiences with ycasd in the context of modelling. CONCLUSIONS We conclude that our tool is suitable for convenient and accurate data retrievals from graphical representations such as papers. Comparison of tools reveals that ycasd is a good compromise between easy and quick capturing of scientific data from publications and complexity. Our tool is routinely applied in the context of biological modelling, where numerous time series data are required to develop models. The software can also be useful for other kinds of analyses for which published data are required but are not available in raw formats such as systematic reviews and meta-analyses. BACKGROUND Mathematical modelling of biological processes often requires a large variety of different data sets for parameter estimation and validation. It is common practice that clinical data are not available in raw formats but are provided as graphical representations. Hence, in order to include these data into environments used for model simulations and statistical analyses, it is necessary to extract them from their presentations in the literature. For this purpose, we developed the freely available open source tool ycasd. After establishing a coordinate system by simple axes definitions, it supports convenient retrieval of data points from arbitrary figures. RESULTS After describing the general functionality and providing an overview of the programme interface, we demonstrate on an example how to use ycasd. A major advantage of ycasd is that it does not require a certain input file format to open and process figures. All options of ycasd are accessible through a single window which eases handling and speeds up data extraction. For subsequent processing of extracted data points, results can be formatted as a Matlab or an R matrix. We extensively compare the functionality and other features of ycasd with other publically available tools. Finally, we provide a short summary of our experiences with ycasd in the context of modelling. CONCLUSIONS We conclude that our tool is suitable for convenient and accurate data retrievals from graphical representations such as papers. Comparison of tools reveals that ycasd is a good compromise between easy and quick capturing of scientific data from publications and complexity. Our tool is routinely applied in the context of biological modelling, where numerous time series data are required to develop models. The software can also be useful for other kinds of analyses for which published data are required but are not available in raw formats such as systematic reviews and meta-analyses.

DOI: 10.1186/1471-2105-15-219

Projects: Genetical Statistics and Systems Biology

Publication type: Journal article

Journal: BMC bioinformatics

Human Diseases: No Human Disease specified

Citation: BMC Bioinformatics 15(1),219

Date Published: 1st Dec 2014

Registered Mode: imported from a bibtex file

Authors: Arnd Gross, Sibylle Schirm, Markus Scholz

Help
help Submitter
Citation
Gross, A., Schirm, S., & Scholz, M. (2014). Ycasd– a tool for capturing and scaling data from graphical representations. In BMC Bioinformatics (Vol. 15, Issue 1). Springer Science and Business Media LLC. https://doi.org/10.1186/1471-2105-15-219
Activity

Views: 972

Created: 14th Sep 2020 at 13:35

Last updated: 7th Dec 2021 at 17:58

help Tags

This item has not yet been tagged.

help Attributions

None

Related items

Powered by
(v.1.13.0-master)
Copyright © 2008 - 2021 The University of Manchester and HITS gGmbH
Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig

By continuing to use this site you agree to the use of cookies