Data Analysis

Predicting Stock Prices Using Social Network Sentiment
February 20, 2023

The emergence of social media has provided a vast source of data that can be used to predict market trends. In this project, we explore the use of sentiment analysis on tweets to predict future stock prices. By combining stock price data with sentiment scores from Twitter, we employ machine learning models such as
long short term memory and convolutional neural networks to create a powerful predictive tool for stock price forecasting. Our aim is to contribute to the field of quantitative finance by demonstrating the potential of social media data as a valuable resource for stock market prediction.

Image via www.vpnsrus.com

Age Friendly Northfield

June 4, 2022

In this statistical analysis, we set out to understand the in-home services and supports that are most important to the elder community in Northfield, MN, based on the results of a
survey conducted by Age Friendly Northfield Health and Wellness Domain Team. By gathering data on the responses to the survey, we used statistical techniques to identify patterns and trends in the data. Our analysis aims to provide a quantitative assessment of the services that are most valued by the elder community, and to explore any service gaps that may exist in each subgroup.


Understanding Alcohol Consumption

March 16, 2022

This is a Shiny app that allows users to explore the complex relationship between alcohol consumption and a variety of socioeconomic factors such as GDP, the Happiness Index, literacy rate and homicide rate in countries around the world. The app not only provides basic data visualization but also regression fitting and machine learning model fitting to further help users understand how alcohol consumption may be affected by socioeconomic factors.


COVID-19 and Vaccination Tracker
March 3, 2022

This is a Shiny app that allows users to track the latest COVID-19 cases and vaccination rates in Asian countries using time series plots and maps. By gathering data from reliable sources and visualizing it in an interactive and user-friendly interface, the app provides an up-to-date and comprehensive overview of the global pandemic.


Does Vaccination Rate Impact Coronavirus Cases
February 10, 2022

Do vaccines actually work to reduce the spread of coronavirus? This report aims to answer this question using
statistical analysis and data visualization. By gathering data on vaccination rates and COVID-19 case counts in Canada, Mexico, and USA, we used statistical techniques to determine whether there is a relationship between the two variables.


Analysis of Airbnb Prices in Amsterdam
November 24, 2021

To combat over-tourism, the Amsterdam city council introduced a ban against short term Airbnb rentals in its city center. This statistical analysis aims to explore one aspect of this issue by investigating the factors that impact the price of rental bookings in Amsterdam. By using a multivariate regression model, we sought to uncover which variables have a significant effect on the cost of Airbnb rentals in Amsterdam. The results of our analysis provide valuable insight into the ongoing debate about the ban and the future of tourism policy in the city.


The Dramatic US Presidential Election of 2000
October 7, 2021

The 2000 US presidential election was one of the most controversial in history, with disputes over ballot design in the pivotal state of Florida leading to George Bush narrowly defeating Al Gore. Voters claimed the Reform Party candidate, Pat Buchanan, received an unexpectedly high number of votes that may have otherwise gone to Gore. This statistical analysis examines Buchanan’s vote share and compares it to expectations using a
simple linear regression model to see if he really did unexpectedly snag a significant number of votes.


Modeling PPE Consumption and Shortage
August 26, 2020

In the midst of the COVID-19 pandemic, personal protective equipment (PPE) has become a critical resource for medical professionals and front-line workers. With the sudden surge in coronavirus cases, many countries are facing a shortage of PPE, making it important to be able to model and predict its consumption and shortage. This statistical analysis report sets out to explore the relationships between COVID-19 cases and PPE consumption by applying and comparing multiple regression models including KNN regression, multiple linear regression, and SVM regression.


How Do Family Factors Affect Student Performance
March 16, 2020

Do family life and background contribute to students’ failure in school? This report delves into the relationship between family life and background and academic performance among secondary school students in Portugal. Using robust statistical methods, including multivariable regression and permutation tests, we aim to shed light on the impact of social factors on student achievement. This analysis will provide valuable insights for educators, policymakers, and families, as they work to support the academic success of students.

css.php