Approximately 21M RTF documents (transcripts and letters) from a mixture of hospitals, clinics, specialties, and private practices. The data is exclusively from the US and some overseas US territories / holdings.
Can segment data by region, ambulatory vs. acute care, specialty, gender, and age.
Sections and subsections are inserted as needed to further structure the narrative (e.g., VITAL SIGNS, HISTORY, etc.).
The documents often contain lists to enumerate similar information (e.g., medications, problems, diagnoses).