Task Website: https://n2c2.dbmi.hms.harvard.edu/2022-track-3
Registration Period: March 14 - April 25, 2022
Training Data Release: May 2, 2022
Test Data Release: July 5, 2022
System Output Due: July 8, 2022
Healthcare providers generate notes in the electronic health records (EHRs) to document the daily progress and plan. The Subjective, Objective, Assessment and Plan (SOAP) format is ubiquitous and an essential component in medical training to properly document care in the daily progress notes. The Assessment and Plan sections of the progress notes are the free-text fields where healthcare providers identify patients’ problems/diseases and treatment plans. The Assessment section summarizes the patients’ active health problems or diseases for that day. The Plan section consists of multiple subsections, each addressing a specific problem or diagnosis followed by a detailed treatment plan (e.g., Respiratory failure with MRSA pneumonia: continue seven days of vancomycin, continue mechanical ventilation with lung-protective strategy, wean oxygen). Patients typically have a main diagnosis or problem with associated conditions or downstream health effects. Healthcare providers can implicitly understand what parts of a treatment plan directly target the primary concerns of a series of health problems based on their background knowledge and medical reasoning, but this information is not easily available for downstream analysis. Automatic extraction of this kind of information can potentially be useful for downstream use cases such as problem list generation.
We formulate the physicians’ reasoning process as a *relation prediction task* between assessment and every plan subsection. Specifically, each plan subsection will be labeled with one of the four relations indicating the nature of its association with the assessment:
1. *Direct*, if the plan subsection targets the primary diagnoses or problem
2. *Indirect*, if the plan subsection lists problems/diagnoses (e.g., organ failure) associated with the primary diagnosis/problem, including other diagnoses/problems that are not part of the main diagnosis.
3. *Neither*, if the plan subsection mentions a problem/diagnosis that is not mentioned in that day’s progress note.
4. *Not Relevant,* if the plan subsection does not include a problem/diagnosis
This task utilizes progress notes from MIMIC-III Dataset sampled from all the intensive care units (https://physionet.org/content/mimiciii/1.4/). A team of medical knowledge experts annotated the progress notes for this task. Sample input and output data may be found at https://git.doit.wisc.edu/YGAO/N2C2-AP-Reasoning.
The data must be obtained through N2C2. All participants for this track must agree to MIMIC’s data usage agreement.
Participants are asked to submit a 500-word long abstract describing their methodologies. Abstracts may also have a graphical summary of the proposed architecture. The document should not exceed 2 pages (i.e., 1.5 line spacing, 12pt-font size, excluding spaces for figures). The authors of either top-performing systems or particularly novel approaches will be invited to present or demonstrate their systems at the workshop. A journal venue will be organized following the workshop.
Please join the discussion group below for announcements. Questions about the challenge can be addressed to the organizers by posting to the group (New Topic button) or sending an email to the address below. Google Group: N2C2_UW_ICU_DataSci_2022_AP_Task at googlegroups.com
-- Dr. Yanjun Gao, Postdoc Research Associate in Dept. of Medicine, SMPH, UW-Madison Ph.D. in Computer Science and Engineering, CSE, EECS, Penn State University UW-Madison Email: ygao at medicine.wisc.edu http://www.cse.psu.edu/~yug125/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 14268 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20220322/88a4638a/attachment.txt>