{"id":9541,"date":"2022-01-08T18:04:35","date_gmt":"2022-01-09T02:04:35","guid":{"rendered":"https:\/\/www.hmc.edu\/about-hmc\/?p=9541"},"modified":"2022-01-08T18:06:28","modified_gmt":"2022-01-09T02:06:28","slug":"harvey-mudd-college-wins-citadel-west-coast-data-open","status":"publish","type":"post","link":"https:\/\/www.hmc.edu\/about\/2022\/01\/08\/harvey-mudd-college-wins-citadel-west-coast-data-open\/","title":{"rendered":"Harvey Mudd College Wins Citadel West Coast Data Open"},"content":{"rendered":"<p>Defeating teams of graduate students, two groups of Harvey Mudd College first years placed first and third in Citadel\u2019s West Coast Data Open.<\/p>\n<p>Members of Harvey Mudd&#8217;s first-place team\u2014Milo Knell \u201925 (CS and math), Alan Wu \u201925 (CS and math), David Chen \u201925 (CS) and Forrest Bicker \u201925 (CS and math)\u2014received a $10,000 cash prize and interview offers at Citadel, a leading alternative investment manager. As winners of the West Coast regional, they qualified for the Datathon Global Championship and the opportunity to compete against other top regional teams for a $100,000 cash prize.<\/p>\n<p>The third-place finishers were Sahil Rane \u201825, Baltazar Zuniga-Ruiz \u201925, Karina Walker \u201925 and Shahnawaz Mogal \u201925 (University of Arizona). They received a $2,500 prize.<\/p>\n<p>At the competition, participants work in teams on large and complex dataset challenges impacting the global markets then present their findings to a panel of judges. Both teams were given a dataset from the research archive of Upworthy, a digital media platform often credited for the rise of overly dramatic clickbait headlines, due in large part to a series of A\/B tests they conducted from 2013 to 2015. The teams analyzed and reported on findings of a dataset of Upworthy\u2019s A\/B tests consisting of 150,817 different article packages and the respective number of clicks each received.<\/p>\n<p>\u201cGiven Upworthy\u2019s interesting reputation for clickbait, we wanted to build a machine learning model to measure whether an article is clickbait and see what it said about Upworthy\u2019s headlines,\u201d said Bicker, a member of HMC&#8217;s first-place team, whose members all share a love for computer science and machine learning. \u201cTo do this, we theorized that fake news tends to look very similar to clickbait because both aim to pull in viewers, so we trained an AI classifier on an external dataset of fake news.<\/p>\n<p>\u201cApplying the classifier on Upworthy\u2019s dataset of headlines, we found that fake news predicted clickbait more accurately than click rate alone,\u201d he said. \u201cWe found that predicted fake news is a good proxy to examine clickbait that avoids the influence of confounding variables like overall business performance and external factors that are not accounted for in the Upworthy data. Using a variety of Natural Language Processing techniques, we also found that clickbait tends to use more extreme emotional language (very positive or negative) that is potentially harmful to the public\u2019s mental health and emotional wellbeing.\u201d<\/p>\n<p>Bicker said the team took a learning-focused approach to the competition, using it as an opportunity to explore new analytical techniques. \u201cWe wanted to push ourselves to think of novel, creative solutions to the problem, so we experimented with a number of distinct approaches. It was also our priority to bring a high standard of rigor to our work, making sure not to cut corners on our analysis and budget time appropriately for quality checks,\u201d he said.<\/p>\n<p>Read the<a href=\"https:\/\/github.com\/miloKnell\/The-Data-Open-Fall-2021\/blob\/main\/Team_10_report.pdf\"> first-place team\u2019s full report<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Defeating teams of graduate students, two groups of Harvey Mudd College first years placed first and third in Citadel\u2019s West [&hellip;]<\/p>\n","protected":false},"author":47,"featured_media":9323,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[3,11,22,30],"class_list":["post-9541","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-award","category-computer-science","category-mathematics","category-students"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/posts\/9541","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/users\/47"}],"replies":[{"embeddable":true,"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/comments?post=9541"}],"version-history":[{"count":0,"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/posts\/9541\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/media\/9323"}],"wp:attachment":[{"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/media?parent=9541"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.hmc.edu\/about\/wp-json\/wp\/v2\/categories?post=9541"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}