College of San Francisco Information Science Convention 2023 Datathon in partnership with AWS and Amazon SageMaker Studio Lab


As a part of the 2023 Information Science Convention (DSCO 23), AWS partnered with the Information Institute on the College of San Francisco (USF) to conduct a datathon. Contributors, each highschool and undergraduate college students, competed on an information science undertaking that centered on air high quality and sustainability. The Information Institute on the USF goals to help cross-disciplinary analysis and schooling within the area of knowledge science. The Information Institute and the Information Science Convention present a particular fusion of cutting-edge tutorial analysis and the entrepreneurial tradition of the know-how trade within the San Francisco Bay Space.

The scholars used Amazon SageMaker Studio Lab, which is a free platform that gives a JupyterLab atmosphere with compute (CPU and GPU) and storage (as much as 15GB). As a result of a lot of the college students have been unfamiliar with machine studying (ML), they got a quick tutorial illustrating easy methods to arrange an ML pipeline: easy methods to conduct exploratory information evaluation, function engineering, mannequin constructing, and mannequin analysis, and easy methods to arrange inference and monitoring. The tutorial referenced Amazon Sustainability Data Initiative (ASDI) datasets from the Nationwide Oceanic and Atmospheric Administration (NOAA) and OpenAQ to construct an ML mannequin to foretell air high quality ranges utilizing climate information through a binary classification AutoGluon mannequin. Subsequent, the scholars have been turned free to work on their very own initiatives of their groups. The profitable groups have been led by Peter Ma, Ben Welner, and Ei Coltin, who have been all awarded prizes on the opening ceremony of the Information Science Convention at USF.

Response from the occasion

“This was a enjoyable occasion, and a good way to work with others. I realized some Python coding in school however this helped make it actual. Throughout the datathon, my workforce member and I carried out analysis on completely different ML fashions (LightGBM, logistic regression, SVM fashions, Random Forest Classifier, and so on.) and their efficiency on an AQI dataset from NOAA geared toward detecting the toxicity of the ambiance below particular climate situations. We constructed a gradient boosting classifier to foretell air high quality from climate statistics.”

– Anay Pant, a junior on the Athenian Faculty, Danville, California, and one of many winners of the datathon.

“AI is turning into more and more necessary within the office, and 82% of corporations want staff with machine studying abilities. It’s important that we develop the expertise wanted to construct merchandise and experiences that we are going to all profit from, this contains software program engineering, information science, area data, and extra. We have been thrilled to assist the following era of builders discover machine studying and experiment with its capabilities. Our hope is that they take this ahead and increase their ML data. I personally hope to in the future use an app constructed by one of many college students at this datathon!”

– Sherry Marcus, Director of AWS ML Options Lab.

“That is the primary yr we used SageMaker Studio Lab. We have been happy by how shortly highschool/undergraduate college students and our graduate pupil mentors may begin their initiatives and collaborate utilizing SageMaker Studio.”

– Diane Woodbridge from the Information Institute of the College of San Francisco.

Get began with Studio Lab

In the event you missed this datathon, you’ll be able to nonetheless register for your own Studio Lab account and work by yourself undertaking. In the event you’re considering working your personal hackathon, attain out to your AWS consultant for a Studio Lab referral code, which can give your contributors rapid entry to the service. Lastly, you’ll be able to search for next year’s challenge on the USF Information Institute.


Concerning the Authors

Neha Narwal is a Machine Studying Engineer at AWS Bedrock the place she contributes to improvement of huge language fashions for generative AI purposes. Her focus lies on the intersection of science and engineering to affect analysis in Pure Language Processing area.

Vidya Sagar Ravipati is a Utilized Science Supervisor on the Generative AI Innovation Middle, the place he leverages his huge expertise in large-scale distributed methods and his ardour for machine studying to assist AWS clients throughout completely different trade verticals speed up their AI and cloud adoption.

Leave a Reply

Your email address will not be published. Required fields are marked *