TY - JOUR
T1 - Construction of a Social-Media Based Clinical Database - Roadmap, Challenges, and Feasibility for ADHD Recognition
AU - Gelashvili, Anton
AU - Resheff, Yehezkel S.
AU - Blumrosen, Gaddi
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024
Y1 - 2024
N2 - The shortage of available high-quality clinical databases restricts medical diagnostics downstream. Clinical databases are often limited to controlled non-natural environments, they are restricted due to privacy limitations and require complex scoring procedures that ultimately result in rater bias. Social media includes massive amounts of information on subjects through streams of text, audio, and video data that is accessible and currently underutilized for medical research. In this work we suggest a method for utilizing this information, by constructing databases for medical condition assessment. To this end we have created SMDC (Social Medical Data Constructor), a utility based on medical expert requirements. Data Features and non-confidential demographic information are extracted online, and labels are derived using data mining techniques. We examine the feasibility of the suggested technology with ADHD recognition from a database extracted from YouTube clips using the self-tagging as ADHD labels. The database maintain privacy and copywrite limitations and no personal identification is provided. To validate the database, we show a high correlation of the model labels with expert labeling (r = 0.68) and compatibility of six known ADHD motor biomarker features of hyperactivity to the ones derived using our database. We extracted from the video clips kinematics features and reached ADHD recognition accuracy of 83%, and 81%, for female sand males respectively. The suggested technology has a potential to assess natural real-life behavior properties of the medical condition and be further used for pre-training the medical condition prediction model, and consequently reduced the required clinical dataset size that can be used efficiently for model fine-tuning and clinical verification.
AB - The shortage of available high-quality clinical databases restricts medical diagnostics downstream. Clinical databases are often limited to controlled non-natural environments, they are restricted due to privacy limitations and require complex scoring procedures that ultimately result in rater bias. Social media includes massive amounts of information on subjects through streams of text, audio, and video data that is accessible and currently underutilized for medical research. In this work we suggest a method for utilizing this information, by constructing databases for medical condition assessment. To this end we have created SMDC (Social Medical Data Constructor), a utility based on medical expert requirements. Data Features and non-confidential demographic information are extracted online, and labels are derived using data mining techniques. We examine the feasibility of the suggested technology with ADHD recognition from a database extracted from YouTube clips using the self-tagging as ADHD labels. The database maintain privacy and copywrite limitations and no personal identification is provided. To validate the database, we show a high correlation of the model labels with expert labeling (r = 0.68) and compatibility of six known ADHD motor biomarker features of hyperactivity to the ones derived using our database. We extracted from the video clips kinematics features and reached ADHD recognition accuracy of 83%, and 81%, for female sand males respectively. The suggested technology has a potential to assess natural real-life behavior properties of the medical condition and be further used for pre-training the medical condition prediction model, and consequently reduced the required clinical dataset size that can be used efficiently for model fine-tuning and clinical verification.
KW - ADHD
KW - databases
KW - machine learning
KW - medical diagnosis
KW - social networks
UR - http://www.scopus.com/inward/record.url?scp=85209138059&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3483311
DO - 10.1109/ACCESS.2024.3483311
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85209138059
SN - 2169-3536
JO - IEEE Access
JF - IEEE Access
ER -