Selenium_Webscraping_Kalibrr

Description

This project showcases web scraping of job postings from the Kalibrr website using a combination of BeautifulSoup and Selenium. It provides a deep dive into extracting relevant job data and serves as a guide on using these tools effectively together and deploying it on Flask app.

Project milestones:

Setting Up the Web Scraping Environment: Preparing your system for scraping with Selenium and BeautifulSoup.
Data Extraction: Accessing and extracting relevant job posting data from Kalibrr.
Data Visualization: Creating plots based on the extracted data
Data Presentation: Using Flask to visually present scraped data.

Getting Started

Prerequisites

Python 3.10
A compatible web browser (e.g., Chrome, if using ChromeDriver with Selenium)
Jupyter Notebook (optional)

Installation

To set up the environment:

Clone this repository:
git clone https://github.com/audichandra/Selenium_Webscraping_Kalibrr.git
Navigate to the directory:
cd Selenium_Webscraping_Kalibrr
Install the required packages:
pip install -r requirements.txt

File Structure

img/ : Contains the image file for the example that are used in readme.md
templates/: Contains HTML files for Flask visualization.
static/: Contains static resources like CSS and JavaScript for Flask.
Selenium web scraping Kalibrr.py: Python script detailing the web scraping process.
app.py: Flask application script to showcase the results.
README.md: The file you're currently reading.

Usage

After you installed the required packages, you can navigate into app.py file manually and run it. Then, open your browser and navigate to http://127.0.0.1:5000 to see the visualized job listing data. For a detailed explanation and code walkthrough, please refer to Selenium Webscraping Kalibrr Notebook.

Results

Below are some visual results obtained from the scraped data:

This graph shows the distribution of job postings by Indonesia top 10 areas, indicating the cities with the highest demand

The above visualization gives insights into how long the companies will open their job postings

Acknowledgements

Authors: Audi Chandra
License: MIT License
A nod to Kalibrr for providing a platform filled with rich job posting data.
Heartfelt gratitude to Algoritma Data Science School for making available the base example of the project and providing a learning opportunity.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Selenium_Webscraping_Kalibrr

Table of Contents

Description

Getting Started

Prerequisites

Installation

File Structure

Usage

Results

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
img		img
static		static
templates		templates
LICENSE		LICENSE
README.md		README.md
Selenium web scraping Kalibrr.ipynb		Selenium web scraping Kalibrr.ipynb
app.py		app.py
requirements.txt		requirements.txt

License

audichandra/Selenium_Webscraping_Kalibrr

Folders and files

Latest commit

History

Repository files navigation

Selenium_Webscraping_Kalibrr

Table of Contents

Description

Getting Started

Prerequisites

Installation

File Structure

Usage

Results

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages