Job Data Scraper

This project is a job data scraper that extracts job applicant details from the German Federal Employment Agency's website. The data is fetched, parsed, and saved in both JSON and Excel formats.

Prerequisites

Python 3.8 or higher
pip (Python package installer)
Google Chrome browser

Installation

Clone the repository:

git clone https://github.com/faisal-fida/job-data-scraper.git
cd job-data-scraper

Create a virtual environment:
```
python -m venv venv
```
Activate the virtual environment:
- On Windows:
```
venv\Scripts\activate
```
- On macOS/Linux:
```
source venv/bin/activate
```
Install the required packages:
```
pip install -r requirements.txt
```
Download the Playwright browser driver:
```
python -m playwright install chrome
```

Configuration

Copy the config.py file to the same directory as app.py and url_fetcher.py.

Usage

Fetch URLs and extract data:
```
python app.py
```
The extracted data will be saved in the

output

directory as data.json and parsed_data.xlsx.

Logging

The application logs its activities, which can be helpful for debugging and monitoring. The logs are printed to the console.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
data_parser.py		data_parser.py
requirements.txt		requirements.txt
url_fetcher.py		url_fetcher.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Job Data Scraper

Table of Contents

Prerequisites

Installation

Configuration

Usage

Logging

About

Releases

Packages

Languages

faisal-fida/Job-Data-Scraper

Folders and files

Latest commit

History

Repository files navigation

Job Data Scraper

Table of Contents

Prerequisites

Installation

Configuration

Usage

Logging

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages