Healthcare Innovations and Translational Informatics lab (HITI) @ Emory


Machine learning for Health, Imaging Informatics

Mammography dataset creation



This project aims to build deep learning algorithms to improve breast cancer screening, including decreasing recall rates, reducing biopsies, or automatically detecting negative studies. To accomplish this, we must build a dataset of mammography images with ground truth labels extracted from radiology and pathology reports. 
The project requires NLP expertise to extract ground-truth labels from free-text radiology and pathology reports, which would then be used to annotate the mammographic images. Once this is achieved, we will require computer vision/DL expertise to predict outcomes based on the imaging. 
Students will have access to raw clinical and imaging data from Emory.

Publications


Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers


Dmytro S. Lituiev, Hari Trivedi, Maryam Panahiazar, Beau Norgeot, Youngho Seo, Benjamin L. Franc, Roy Harnish, Michael Kawczynski, Dexter Hadley


Journal of Digital Imaging, vol. 32(2), 2019 Feb 31, pp. 228-233


Large Scale Semi-Automated Labeling of Routine Free-Text Clinical Records for Deep Learning


Hari M. Trivedi, Maryam Panahiazar, April Liang, Dmytro Lituiev, Peter Chang, Jae Ho Sohn, Yunn-Yi Chen, Benjamin L. Franc, Bonnie Joe, Dexter Hadley


Journal of Digital Imaging, vol. 32(1), 2019 31, pp. 30-37