Contact us
|
Home
|
Login
| Users Online: 532
Feedback
Subscribe
Advertise
Search
Advanced Search
Month wise articles
Figures next to the month indicate the number of articles in that month
2022
January
[
3
]
2021
December
[
1
]
November
[
3
]
September
[
1
]
May
[
1
]
April
[
3
]
January
[
1
]
2020
December
[
1
]
October
[
1
]
July
[
1
]
2019
April
[
1
]
February
[
1
]
2018
December
[
1
]
September
[
1
]
June
[
1
]
May
[
2
]
April
[
3
]
2017
December
[
1
]
November
[
1
]
October
[
1
]
September
[
1
]
July
[
1
]
June
[
1
]
April
[
2
]
March
[
1
]
February
[
2
]
2016
December
[
1
]
November
[
1
]
October
[
1
]
September
[
2
]
July
[
1
]
May
[
1
]
April
[
1
]
February
[
1
]
January
[
1
]
2015
November
[
2
]
September
[
1
]
August
[
1
]
July
[
2
]
June
[
1
]
March
[
1
]
January
[
2
]
2014
November
[
1
]
September
[
1
]
August
[
1
]
July
[
3
]
March
[
1
]
2013
September
[
1
]
August
[
1
]
January
[
1
]
2012
November
[
1
]
June
[
1
]
April
[
1
]
2011
December
[
1
]
November
[
1
]
October
[
1
]
August
[
1
]
June
[
1
]
May
[
2
]
March
[
1
]
2010
October
[
1
]
May
[
1
]
» Articles published in the past year
To view other articles click corresponding year from the navigation links on the left side.
All
|
Abstracts
|
Book Review
|
Commentaries
|
Commentary
|
Editorial
|
Letters to Editor
|
Original Article
|
Original Articles
|
Original Research
|
Original Research Article
|
Research Article
|
Research Articles
|
Review Articles
|
Symposium
|
Technical Note
|
Technical Note: Software
Export selected to
Endnote
Reference Manager
Procite
Medlars Format
RefWorks Format
BibTex Format
Show all abstracts
Show selected abstracts
Export selected to
Add to my list
Technical Note:
Pathology report data extraction from relational database using R, with extraction from reports on melanoma of skin as an example
Jay J Ye
J Pathol Inform
2016, 7:44 (21 October 2016)
DOI
:10.4103/2153-3539.192822
PMID
:28066684
Background:
Different methods have been described for data extraction from pathology reports with varying degrees of success. Here a technique for directly extracting data from relational database is described.
Methods:
Our department uses synoptic reports modified from College of American Pathologists (CAP) Cancer Protocol Templates to report most of our cancer diagnoses. Choosing the melanoma of skin synoptic report as an example, R scripting language extended with RODBC package was used to query the pathology information system database. Reports containing melanoma of skin synoptic report in the past 4 and a half years were retrieved and individual data elements were extracted. Using the retrieved list of the cases, the database was queried a second time to retrieve/extract the lymph node staging information in the subsequent reports from the same patients.
Results:
426 synoptic reports corresponding to unique lesions of melanoma of skin were retrieved, and data elements of interest were extracted into an R data frame. The distribution of Breslow depth of melanomas grouped by year is used as an example of intra-report data extraction and analysis. When the new pN staging information was present in the subsequent reports, 82% (77/94) was precisely retrieved (pN0, pN1, pN2 and pN3). Additional 15% (14/94) was retrieved with certain ambiguity (positive or knowing there was an update). The specificity was 100% for both. The relationship between Breslow depth and lymph node status was graphed as an example of lesion-specific multi-report data extraction and analysis.
Conclusions:
R extended with RODBC package is a simple and versatile approach well-suited for the above tasks. The success or failure of the retrieval and extraction depended largely on whether the reports were formatted and whether the contents of the elements were consistently phrased. This approach can be easily modified and adopted for other pathology information systems that use relational database for data management.
[ABSTRACT]
[HTML Full text]
[PDF]
[Mobile Full text]
[EPub]
[Citations (4) ]
[PubMed]
[Sword Plugin for Repository]
Beta
Sitemap
|
What's New
Feedback
|
Copyright and Disclaimer
|
Privacy Notice
© Journal of Pathology Informatics | Published by Wolters Kluwer -
Medknow
Online since 10
th
March, 2010