type
Post
status
Published
slug
weather_prediction
summary
This is to create the best model that can predict if it is going to be cloudy tomorrow for 10 locations in Australia.
category
Artificial Intelligence
tags
R
Data Analysis
Data Modelling
date
Jun 28, 2023 02:34 AM
password
icon
Priority
URL
weather_prediction
jasonxsiuUpdated Aug 8, 2021

Cloudy or not in Australia ?

What is this ?

This is to create the best model that can predict if it is going to be cloudy tomorrow for 10 locations in Australia.

Why am I doing it? (Motivation)

Apply the concepts learnt at school. To compare different machine learning models and find the best one using R.

The process of this modelling

  1. Data Exploratory
  1. Data pre-processing
  1. Spilt the data into training and testing sets (70% and 30%)
  1. Implement a classification models to predict the cloudiness: • Decision Tree • Naïve Bayes • Bagging • Boosting • Random Forest
  1. Comparison based on confusion matrix and report the accuracy of each model
  1. Comparison based on AUC
  1. Choose the best model
  1. Implement an Artificial Neural Network classifier and report its performance

Data source

This dataset is a edited the Kaggle competition data: Predict next-day rain in Australia (Rain in Australia | Kaggle); but instead, this is to predict if the following day will be cloudy. The data contains a number of meteorological observations as attributes, and the class attribute “CloudTomorrow”. Details of the decision attributes please refer to the photo "DataDesc.png".
 
System Analysis for a university PhD Research Meeting SystemLIWC Social network analysis