Posts listed under tag: statistics
-
May 20, 2024
End to end recommendation system project covering data collection to web application deployment. The aim is to create personalized recommendations based on their preferences and viewing habits. This system also considers multiple users simultaneously for group-based suggestions.
-
Apr 28, 2024
In this notebook we will explore the datasets scraped by our webscraping scripts. Exploration in this notebook will be guided by some key questions within each section.
-
Apr 1, 2024
In this notebook we explore a few HDB Resale Prices datasets fgrom Jan1990 to Mar2024, analysing the data to answer a few common questions homebuyers have in recent times.
-
Mar 19, 2024
This notebook explores a dataset containing the top 50 bestselling books on Amazon from the years 2010 to 2020 inclusive. Data was scraped from Amazon webpages and additional information was obtained from Google Books API.
-
Jan 1, 2024
In this notebook we will be exploring rainfall patterns in Singapore, showing the seasonal patterns of rainfall and how some areas of the island receive more rainfall than others. Models will also be built and tested to forecast monthly rainfall on the island.
-
Jan 1, 2024
In this notebook we will be using an autoencoder on the fraud dataset used in a previous notebook for novelty detection. Novelty detection refers to the identification of new or unknown signals not available to a machine learning system during training. In this case it refers to training a machine learning model only on normal(non-fradulent) transactions data but the resultant model has the ability to recognise fraudulent transactions.
-
Jan 1, 2024
In this notebook we explore and analyse chat data of a controversial chat group where users discussed issues relating to the COVID-19 pandemic.
-
Jan 1, 2024
In this notebook we will be analysing smartphone sensors data collected from an experiment and analysing the information retrieved from the data, studying the extent that these data can be used to identify the user.
-
Jan 1, 2024
This notebook explores a dataset of credit card transactions over a span of two days, analysing the data and tackling the extremely imbalanced classification problem of fraud detection.
-
Jan 1, 2024
In this notebook we will be analysing and discussing Covid-19 related data from all around the world, looking at how the pandemic hits different places differently and how to understand some statistics commonly quoted on mainstream/social media.
1
numpy
pandas
matplotlib
seaborn
scikit-learn
classification
statistics
nlp
fun
scipy
dimensionality_reduction
webscrape
tensorflow
computer_vision
requests
html
bs4
transfer_learning
regression
pytorch
nltk
multiprocessing
kaggle
generative_ai
featured
competition
transformers
statsmodels
statsmodel
sql
recommendation
ollama
object_detection
langchain
forecast
flask
embedding
database
cv2
automation
api
tkinter
statistics
math
gradio