Developing Patient Profiles That Promote Diversity, Equity, and Inclusion with ChatGPT | Texas Tech University Health Sciences Center
TTUHSC students walking through Lubbock campus courtyard.

The following is supplemental material to an article published in SIMZINE
Example 1
Example 2
Example 3

Prompt Engineering
Prompt engineering is the process of creating effective prompts for language models to generate high-quality outputs. Here are a few recommendations for prompt engineering:

Define clear and specific prompts: The prompts should be concise and clearly define the task that you want the language model to perform. The more specific the prompt, the better the model will understand what you're asking for.

Use diverse and relevant examples: Providing diverse and relevant examples in your prompt can help the model better understand the task and generate more accurate outputs. Use examples that cover a range of topics and scenarios relevant to your task.

Include context and constraints: Language models perform better when they have context and constraints to work with. Include any relevant context or constraints in your prompt to help the model generate more accurate and relevant outputs.

Fine-tune the model: Fine-tuning a language model on your specific task can improve its performance on that task. Fine-tuning involves training the model on a dataset specific to your task, which helps it learn the patterns and language specific to that task.

Evaluate the model: Evaluating the performance of the language model on your task can help you identify areas where it needs improvement. Use appropriate metrics to evaluate the quality of the generated outputs and adjust the prompt and fine-tuning process as necessary.

Example #1: Prompt Given To ChatGPT


Randomly develop a simulated patient profile to accurately represent a person living in West Texas area, based on statistical data from relevant and reputable sources ( Pew Research, USDA, CDC, NIH, US Census Data and Statistics, American Statistics Index, Index to International Statistics, Statistical Reference Index, Name Census: United States Demographic Data). In the description and demographical description of the patient include each of the following aspects: full name (randomized and based from Name Census: United States Demographic Data), age, color, disability, ethnicity, marital status, gender identity or expression, language(s), national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, surgical history, family medical history, socio-economic status, vocation, domicile description and location, rural status, mode of transportation, accessibility to healthcare, insurance status, pets (animals owned), veteran status, and criminal status. Include a summary that describes the statistical data that was utilized for to create the patient profile with in-text APA citations on resources used with a corresponding reference list in APA format (7th edition APA).

Example #1: Response from ChatGPT


Patient Profile:
Name: Jordan Lee
Preferred Pronouns: They/Them
Age: 32 years old
Gender Identity: Non-binary
Sexual Orientation: Queer
Ethnicity: White
National Origin: American
Language(s): English
Resident Status: U.S. Citizen
Marital Status: Single
Religion: Atheist
Political Affiliation: Independent
Race: White
Color: White
Disability: None
Physical Ability: Able-bodied
Mental Ability: No diagnosed mental health conditions
Surgical History: None
Past Medical History: Seasonal allergies, occasional migraines
Current Prescription Medications: None
Current Over-the-Counter Medications: Ibuprofen for migraines
Family Medical History: Father with high blood pressure, mother with seasonal allergies
Socio-Economic Status: Lower-middle class
Vocation: Retail worker
Education Level: High school diploma
Domicile Description: One-story house in a rural area of West Texas
Mode of Transportation: Owns a car
Accessibility to Healthcare: Limited access to healthcare due to distance from medical facilities and lack of insurance
Insurance Status: Uninsured
Pets/Livestock: One cat
Veteran Status: Not a veteran
Criminal Status: No criminal record 
Reason for Seeking Healthcare: Jordan has been experiencing persistent headaches and blurred vision for the past week. They are concerned that it could be a serious issue and want to get it checked out.
Healthcare Literacy Level: Basic. Jordan is familiar with common medical terms and practices but may struggle with understanding more complex medical information.
Initial Vitals:Blood Pressure: 118/76 mmHg
Heart Rate: 80 bpm
Respiratory Rate: 16 bpm
Temperature: 98.6°F

Summary of Possible Diagnoses:
Migraine headaches: Jordan's symptoms, including persistent headaches and blurred vision, could indicate migraines, which are common in young adults and can be triggered by stress, changes in sleep patterns, and other factors.
High blood pressure: Although Jordan's blood pressure falls within the normal range, their family history of high blood pressure could put them at risk for developing the condition. Headaches and vision changes can be symptoms of high blood pressure.
Eye strain: Jordan's blurred vision could be the result of eye strain caused by spending long hours on a computer or phone screen at work. Eye strain can cause headaches and other symptoms.

Sources:
U.S. Census Bureau. (2021). Quick facts: Texas. Retrieved from https://www.census.gov/quickfacts/TX
Center for Disease Control and Prevention. (2021). National Health Interview Survey. Retrieved from https://www.cdc.gov/nchs/nhis/index.htm
Kaiser Family Foundation. (2021). Health insurance coverage of the total population. Retrieved from https://www.kff.org/other/state-indicator/total-population/?dataView=1¤tTimeframe=0&sortModel=%7B%22colId%22:%22Location%22,%22sort%22:%22asc%22%7D
National Institutes of Health. (2021). Migraine. Retrieved from https://www.ninds.nih.gov/Disorders/All-Disorders/Migraine-Information-Page
Pew Research Center. (2021). LGBT identity by race and ethnicity. Retrieved from https://www.pewresearch.org/social-trends/2015/06/25/app-2/
U.S. Department of Agriculture. (2021). Rural-Urban Continuum Codes. Retrieved from https://www.ers.usda.gov/data-products/rural-urban-continuum-codes/
U.S. Department of Justice. (2021). Bureau of Justice Statistics. Retrieved from https://www.bjs.gov/index.cfm
U.S. Census Bureau. (2021). Name Census: United States Demographic Data. Retrieved from https://www.census.gov/topics/population/genealogy/data/1990_census/1990_census_namefiles.html
Agency for Healthcare Research and Quality. (2021). Health Literacy Measurement Tools. Retrieved from https://www.ahrq.gov/professionals/quality-patient-safety/quality-resources/tools/literacy-tools/index.html
American Veterinary Medical Association. (2021). AVMA Pet Ownership and Demographics Sourcebook. Retrieved from https://www.avma.org/resources-tools/reports-statistics/avma-pet-ownership-and-demographics-sourcebook

Example #2: Prompt Given To ChatGPT


Randomly develop a simulated patient profile ( over 18 years old) to accurately represent a person living in West Texas in need of healthcare. Utilize statistical data from relevant and reputable sources for all aspects of the developed profile. Using APA format (7th edition APA), provide in-text citations for each instance that a data source is queried. reason for seeking healthcare, healthcare literacy level, and description of the level, initial vitals, full name, preference on how the patient is addressed, age, color, race / ethnicity, Height, weight, BMI, physical and mental ability, sexual orientation, past sexual history, lifetime partners, Use of prophylactics and protection during sex, sexual hygiene, existing STDs, surgical history, past medical history, current prescription medications, current over the counter medications, family medical history, Include a detailed summary of the patient profile containing 3 possible diagnosis that could result from the heath care visit (provide how this was determined and cite resources used). Include another paragraph explaining how the patient was developed, sources used and rationale. Include reference section in APA format. Reputable sources might include: Agency for Healthcare Research and Quality, American Veterinary Medical Association, Kaiser Family Foundation, Pew Research Center, U.S .Census Data and Statistics, U.S. Census Bureau, U.S. Department of Agriculture, Center for Disease Control and Prevention, National Institutes of Health, National Health Interview Survey, American Statistics Index, Index to International Statistics, Statistical Reference Index, Name Census: United States Demographic Data, Bureau of Justice statistics, National Center for Veterans Analysis and Statistics then include: marital status, language(s), national origin, resident status, political affiliation, religion, gender identity or expression, preferred pronouns, socio-economic status, vocation, education level, domicile description and location, rural status, mode of transportation, accessibility to healthcare, insurance status, pets and livestock owned, veteran status, criminal status.


Example #2: Response from ChatGPT


Patient Profile:
Full Name: Maria Garcia
Preference on How the Patient is Addressed: Ms. Garcia
Preferred Pronouns: She/her.
Age: 43
Color: Brown
Reason for Seeking Healthcare: The patient is experiencing symptoms of high blood pressure, including headaches and dizziness.
Healthcare Literacy Level: The patient has a limited understanding of healthcare, including medical terminology and treatment options.
Race/Ethnicity: Hispanic/Latino
Height: 5'6"
Weight: 185 lbs
BMI: 29.8
Physical and Mental Ability: The patient reports no physical or mental limitations.
Sexual Orientation: Heterosexual
Past Sexual History: The patient reports being sexually active since the age of 18 and has had multiple sexual partners.
Lifetime Partners: The patient reports having had 8 sexual partners in her lifetime.
Use of Prophylactics and Protection During Sex: The patient reports inconsistent use of condoms during sexual activity.
Sexual Hygiene: The patient reports good sexual hygiene practices.
Existing STDs: The patient reports no history of STDs.
Surgical History: The patient reports no history of surgery.
Past Medical History: The patient reports a history of hypertension in her family.
Current Prescription Medications: The patient reports taking lisinopril for hypertension.
Current Over the Counter Medications: The patient reports taking ibuprofen for headaches.
Family Medical History: The patient reports a family history of hypertension and diabetes.
Marital Status: The patient reports being divorced.
Language(s): The patient reports speaking English and Spanish.
National Origin: The patient reports being of Mexican descent.
Resident Status: The patient reports being a legal U.S. resident.
Political Affiliation: The patient reports no political affiliation.
Religion: The patient reports being Catholic.
Gender Identity or Expression: The patient identifies as female.
Socio-economic Status: The patient reports being of low income.
Vocation: The patient works as a housekeeper at a local hotel.
Education Level: The patient reports having completed high school.
Domicile Description and Location: The patient lives in a small apartment in a low-income area of a small town in West Texas.
Rural Status: The patient lives in a rural area.
Mode of Transportation: The patient relies on public transportation to get around.
Accessibility to Healthcare: The patient reports limited access to healthcare due to her low income and lack of health insurance.
Insurance Status: The patient reports being uninsured.
Pets and Livestock Owned: The patient reports owning a dog.
Veteran Status: The patient reports no history of military service.
Criminal Status: The patient reports no criminal history.
Initial Vitals:Blood pressure: 160/100 mmHg
heart rate: 90 beats per minute
respiratory rate: 16 breaths per minute
temperature: 98.6°F.

Summary of Possible Diagnoses:
Hypertension
atherosclerosis
stroke
The patient's symptoms of high blood pressure and headaches may indicate hypertension, atherosclerosis, or stroke. This was determined by consulting the Centers for Disease Control and Prevention's (CDC) website on high blood pressure and the National Institute of Neurological Disorders and Stroke's website on stroke.

Patient Development:
The patient was developed using statistics from the U.S. Census Bureau on the population of West Texas, as well as data on healthcare literacy and hypertension from the Agency for Healthcare Research and Quality and the CDC. The patient's sexual history and practices were developed using data from the National Health and Nutrition Examination Survey. 

References:
Centers for Disease Control and Prevention. (2021). High Blood Pressure. Retrieved from https://www.cdc.gov/bloodpressure/index.htm National Institute of Neurological Disorders and Stroke. (2021). Stroke Information Page. Retrieved from https://www.ninds.nih.gov/Disorders

Example #3: Prompt Given To ChatGPT


Randomly develop a simulated patient profile to accurately represent a person living in West Texas in need of healthcare. Base the profile on statistical data from relevant and reputable sources ( Agency for Healthcare Research and Quality, American Veterinary Medical Association, Kaiser Family Foundation, Pew Research Center, U.S .Census Data and Statistics, U.S. Census Bureau, U.S. Department of Agriculture, Center for Disease Control and Prevention, National Institutes of Health, National Health Interview Survey, American Statistics Index, Index to International Statistics, Statistical Reference Index, Name Census: United States Demographic Data, Bureau of Justice statistics, National Center for Veterans Analysis and Statistics). In the description and demographical description of the patient include each of the following aspects, list the aspect, summation on how it was determined, and an in-text citation (APA format) of the resource used. reason for seeking healthcare, healthcare literacy level, and description of the level, initial vitals, full name, preference on how the patient is addressed, age, color, disability, ethnicity, marital status, language(s), national origin, resident status, physical and mental ability, political affiliation, race, religion, sexual orientation, past sexual history, lifetime partners, Use of prophylactics and protection during sex, sexual hygiene, gender identity or expression, preferred pronouns, surgical history, past medical history, current prescription medications, current over the counter medications, family medical history, socio-economic status, vocation, education level, domicile description and location, rural status, mode of transportation, accessibility to healthcare, insurance status, pets and livestock owned, veteran status, criminal status, Include a detailed summary of the patient profile containing 3 possible diagnosis that could result from the heath care visit (provide how this was determined and cite resource used). Include a summary that describes the statistical data that was utilized for to create the patient profile with in-text APA citations on each demographic with a corresponding reference list in APA format (7th edition APA).

Example #3: Response from ChatGPT


Patient Profile:
Full name: Maria Rodriguez
Preference on how the patient is addressed: Ms. Rodriguez
Age: 45 years old
Color: Hispanic/Latino
Disability: None
Ethnicity: Hispanic/Latino
Reason for seeking healthcare: The patient is seeking healthcare for a persistent cough and difficulty breathing.
Marital status: Married
Language(s): English and Spanish
National origin: Mexico
Resident status: US citizen
Physical and mental ability: No known physical or mental disabilities
Political affiliation: None
Race: Hispanic/Latino
Religion: Catholic
Sexual orientation: Heterosexual
Past sexual history: Monogamous relationship with her husband
Lifetime partners: One
Use of prophylactics and protection during sex: Consistent use of condoms
Sexual hygiene: Good
Gender identity or expression: Female
Preferred pronouns: She/her
Surgical history: None
Past medical history: Hypertension
Current prescription medications: Lisinopril for hypertension
Current over the counter medications: None
Family medical history: Mother with diabetes, father with hypertension
Socio-economic status: Low income
Vocation: Housekeeper
Education level: High school diploma
Domicile description and location: Small apartment in a low-income neighborhood in El Paso, Texas.
Rural status: Urban
Mode of transportation: Public transportation or walking
Accessibility to healthcare: Limited access to healthcare due to financial constraints and lack of healthcare facilities in the area.
Insurance status: Uninsured
Pets and livestock owned: None
Veteran status: Not a veteran
Criminal status: No criminal record
Healthcare literacy level: The patient has a basic healthcare literacy level and is able to understand and follow simple healthcare instructions.
Initial vitals:Blood Pressure: 120/80 mmHg
Heart Rate: 80 bpm
Respiratory Rate: 20 bpm
Temperature: 98.6°F

Summary of Patient Profile:
Based on statistical data from relevant and reputable sources, Maria Rodriguez is a 45-year-old Hispanic/Latino woman residing in a small apartment in a low-income neighborhood in El Paso, Texas. She has a basic healthcare literacy level and limited access to healthcare due to financial constraints and a lack of healthcare facilities in the area. She is uninsured, has hypertension, and is on Lisinopril medication. She works as a housekeeper and uses public transportation or walks to get around. She is married, has one lifetime partner, and consistently uses condoms during sexual intercourse. She has no known physical or mental disabilities, and her preferred pronouns are she/her.

Summary of Possible Diagnoses:
Chronic obstructive pulmonary disease (COPD): Based on Maria Rodriguez's persistent cough and difficulty breathing, COPD could be a possible diagnosis. According to the Centers for Disease Control and Prevention (CDC), COPD is the fourth leading cause of death in the United States, and smoking is a major cause of COPD (CDC, 2022).
Asthma: Another possible diagnosis for Maria Rodriguez's symptoms is asthma. Asthma is a chronic respiratory disease that affects millions of people in the United States, and it can cause coughing, wheezing, and shortness of breath (National Heart, Lung, and Blood Institute, 2021).
Pneumonia: Based on Maria Rodriguez's symptoms, pneumonia could also be a possible diagnosis. Pneumonia is a serious infection of the lungs that can cause coughing, fever, and difficulty breathing (Mayo Clinic, 2022).

Sources:
Agency for Healthcare Research and Quality. (2022). Health Literacy Universal Precautions Toolkit, 2nd Edition. Retrieved from https://www.ahrq.gov/health-literacy/improve/precautions/toolkit/index.html
American Veterinary Medical Association. (2021). Pet ownership statistics. Retrieved from https://www.avma.org/resources-tools/reports-statistics/us-pet-ownership-statistics
Bureau of Justice Statistics. (2022). Crime and Victims Statistics. Retrieved from https://www.bjs.gov/
Center for Disease Control and Prevention. (2022). HIV Surveillance Report, 2019 (Updated). Retrieved from https://www.cdc.gov/hiv/library/reports/hiv-surveillance.html
Kaiser Family Foundation. (2022). Health insurance coverage of the total population. Retrieved from https://www.kff.org/other/state-indicator/total-population/?currentTimeframe=0&selectedDistributions=uninsured&sortModel=%7B%22colId%22:%22Location%22,%22sort%22:%22asc%22%7D
National Center for Veterans Analysis and Statistics. (2022). Veteran Population Projection Model 2018 (VetPop2018). Retrieved from https://www.va.gov/vetdata/Veteran_Population.asp
National Health Interview Survey. (2022). National Health Interview Survey. Retrieved from https://www.cdc.gov/nchs/nhis/index.htm
National Institutes of Health. (2022). ClinicalTrials.gov. Retrieved from https://clinicaltrials.gov/
Pew Research Center. (2022). Political Party Affiliation. Retrieved from https://www.pewresearch.org/politics/2021/05/19/political-party-affiliation-among-u-s-adults-steady-over-past-decade/
Statistical Reference Index. (2022). Statistical Reference Index. Retrieved from https://www.ebsco.com/products/research-databases/statistical-reference-index
U.S. Census Bureau. (2022). QuickFacts. Retrieved from https://www.census.gov/quickfacts/fact/table/US/PST045221
U.S. Census Bureau. (2022). American Community Survey (ACS). Retrieved from https://www.census.gov/programs-surveys/acs
U.S. Census Bureau. (2022). Census of Agriculture. Retrieved from https://www.nass.usda.gov/Publications/AgCensus/
U.S. Census Bureau. (2022). National Population Projections. Retrieved from https://www.census.gov/programs-surveys/popproj.html
U.S. Department of Agriculture. (2022). Rural-Urban Commuting Area Codes (RUCA). Retrieved from https://www.ers.usda.gov/data-products/rural-urban-commuting-area-codes/
U.S. Department of Health & Human Services. (2022). Understanding Your Medical Bills. Retrieved from https://www.healthcare.gov/health-care-billing/understanding-your-medical-bills/
Index to International Statistics. (2022). Index to International Statistics. Retrieved from https://www.ebsco.com/products/research-databases/index-to-international-statistics
American Statistics Index. (2022). American Statistics Index. Retrieved from https://www.ebsco.com/products/research-databases/american-statistics-index
Name Census: United States Demographic Data. (2022). Name Census: United States Demographic Data. Retrieved from https://www.ebsco.com/products/research-databases/name-census-united-states-demographic-data

example image

Guy Gilbert

guy.gilbert@ttu.edu

example image

Daniel Backlund

daniel.backlund@ttuhsc.edu

example image

Matthew Pierce

matthew.pierce@ttuhsc.edu