Aug
13
Tue
2013
Invited Talk: Applying Machine learning for Automated Identification of Patient Cohorts @ Sathyam Hall
Aug 13 @ 2:40 pm – 3:05 pm

SriSairamSrisairam Achuthan, Ph.D.
Senior Scientific Programmer, Research Informatics Division, Department of Information Sciences, City of Hope, CA, USA


Applying Machine learning for Automated Identification of Patient Cohorts

Srisairam Achuthan, Mike Chang, Ajay Shah, Joyce Niland

Patient cohorts for a clinical study are typically identified based on specific selection criteria. In most cases considerable time and effort are spent in finding the most relevant criteria that could potentially lead to a successful study. For complex diseases, this process can be more difficult and error prone since relevant features may not be easily identifiable. Additionally, the information captured in clinical notes is in non-coded text format. Our goal is to discover patterns within the coded and non-coded fields and thereby reveal complex relationships between clinical characteristics across different patients that would be difficult to accomplish manually. Towards this, we have applied machine learning techniques such as artificial neural networks and decision trees to determine patients sharing similar characteristics from available medical records. For this proof of concept study, we used coded and non-coded (i.e., clinical notes) patient data from a clinical database. Coded clinical information such as diagnoses, labs, medications and demographics recorded within the database were pooled together with non-coded information from clinical notes including, smoking status, life style (active / inactive) status derived from clinical notes. The non-coded textual information was identified and interpreted using a Natural Language Processing (NLP) tool I2E from Linguamatics.

Aug
14
Wed
2013
Delegate Talk: Proteomic profiling of gallbladder cancer secretome – a source for circulatory biomarker discovery @ Amriteshwari Hall
Aug 14 @ 12:55 pm – 1:06 pm
Delegate Talk: Proteomic profiling of gallbladder cancer secretome – a source for circulatory biomarker discovery @ Amriteshwari Hall | Vallikavu | Kerala | India

Tejaswini Subbannayya, Nandini A. Sahasrabuddhe, Arivusudar Marimuthu, Santosh Renuse, Gajanan Sathe, Srinivas M. Srikanth, Mustafa A. Barbhuiya, Bipin Nair, Juan Carlos Roa, Rafael Guerrero-Preston, H. C. Harsha, David Sidransky, Akhilesh Pandey, T. S. Keshava Prasad and Aditi Chatterjee


Proteomic profiling of gallbladder cancer secretome – a source for circulatory biomarker discovery

Gallbladder cancer (GBC) is the fifth most common cancer of the gastrointestinal tract and one of the common malignancies that occur in the biliary tract (Misra et al. 2006; Lazcano-Ponce et al. 2001). It has a poor prognosis with survival of less than 5 years in 90% of the cases (Misra et al. 2003). The etiology is ill-defined. Several risk factors have been reported including cholelithiasis, obesity, female gender and exposure to carcinogens (Eslick 2010; Kumar et al. 2006). Poor prognosis in GBC is mainly due to late presentation of the disease and lack of reliable biomarkers for early diagnosis. This emphasizes the need to identify and characterize cancer biomarkers to aid in the diagnosis and prognosis of GBC. Secreted proteins are an important class of molecules which can be detected in body fluids and has been targeted for biomarker discovery. There are challenges faced in the proteomic interrogation of body fluids especially plasma such as low abundance of tumor secreted proteins, high complexity and high abundance of other proteins that are not released by the tumor cells (Tonack et al. 2009). Profiling of conditioned media from the cancer cell lines can be used as an alternate means to identify secreted proteins from tumor cells (Kashyap et al. 2010; Marimuthu et al. 2012). We analyzed the invasive property of 7 GBC cell lines (SNU-308, G-415, GB-d1, TGBC2TKB, TGBC24TKB, OCUG-1 and NOZ). Four cell lines were selected for analysis of the cancer secretome based on the invasive property of the cells. We employed isobaric tags for relative and absolute quantitation (iTRAQ) labeling technology coupled with high resolution mass spectrometry to identify and characterize secretome from the panel of 4GBC cancer cells mentioned above. In total, we have identified around 2,000 proteins of which 175 were secreted at differential abundance across all the four cell lines. This secretome analysis will act as a reservoir of candidate biomarkers. Currently, we are investigating and validating these candidate markers from GBC cell secretome. Through this study, we have shown mass spectrometry-based quantitative proteomic analysis as a robust approach to investigate secreted proteins in cancer cells.