Skip to content

This project explores a Kaggle dataset of 16,000+ used and new cars in Australia. The goal is to clean, analyze, and build predictive models for car prices.

Notifications You must be signed in to change notification settings

echandragrg/PortfolioProjects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

# πŸš— Car Price Data Science Project

## πŸ“Œ Project Overview
This project explores a Kaggle dataset of 16,000+ used and new cars in Australia.  
The goal is to clean, analyze, and build predictive models for **car prices**.

## πŸ“Š Key Steps
1. Data Cleaning:
   - Converted `Price`, `Kilometres`, `FuelConsumption`, etc. into numeric values
   - Extracted `Engine Capacity` and `Cylinders`
   - Created new features like `CarAge`, `PricePerKm`, and `LuxuryBrand`
   
2. Exploratory Data Analysis (EDA):
   - Distribution of car prices
   - Price vs Kilometres scatter plot
   - Average prices by brand
   - Correlation heatmap

3. Feature Engineering:
   - Added `LuxuryBrand` flag
   - Created `PricePerKm`
   - Derived `CarAge`

4. Predictive Modeling:
   - Built regression models to predict car price
   - Evaluated with RΒ² and MAE

5. Clustering (optional):
   - Used KMeans to group cars into clusters (e.g., economy, mid-range, luxury)

## πŸ“ˆ Tools & Libraries
- Python
- Pandas, NumPy
- Matplotlib, Seaborn
- Scikit-learn

## πŸš€ Results
- Cleaned dataset of **16,000+ cars**
- Clear insights into pricing trends
- Predictive model to estimate car price
- Visualizations to support analysis

---

About

This project explores a Kaggle dataset of 16,000+ used and new cars in Australia. The goal is to clean, analyze, and build predictive models for car prices.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published