🚀 Log User Extractor

A powerful and efficient Python script designed to process log files, extract unique user identifiers (userCode and userId), and save them into a CSV file. Ideal for environments with multiple directories containing extensive log files.

✨ Features

📂 Directory Scanning: Scans specified directories for log files.
🔍 Identifier Extraction: Extracts user identifiers (userCode and userId) from log files.
💾 CSV Output: Saves unique user identifiers to a CSV file.
⚡ Parallel Processing: Handles multiple log files simultaneously, significantly reducing processing time.
🔧 Robust Logging: Enhanced error handling and logging for improved monitoring and debugging.
⚙️ Configurable Parameters: Easily specify file patterns, output file names, and log levels via a configuration file.

🛠 Requirements

🐍 Python 3.x
🐼 pandas

📥 Installation

Clone the repository:

git clone https://github.com/PKHarsimran/LogUserExtractor.git

Navigate to the project directory:
```
cd LogUserExtractor
```
Install the required Python packages:
```
pip install pandas
```

🚀 Usage

Configure the script:

Edit the config.ini file to specify your directories, file pattern, output file name, and log level:

[Paths]
log_directories = test
output_csv = extracted_user_codes.csv

[Settings]
file_pattern = .*\.log$

[Logging]
log_filename = log_user_extractor.log
log_level = INFO

Run the script:
```
python log_user_extractor.py
```
Check the output:

The script will create a CSV file named extracted_user_codes.csv containing the unique user identifiers.

📊 Flowchart

📊 Workflow

Start
Load Configuration from config.ini
Initialize LogProcessor with directories and file pattern
- Input: List of directories containing log files and the file pattern to match.
Process log files
- For each directory:
  - List files in the directory.
  - For each file:
    - Check if the file matches the pattern.
    - If true, process the file.
Process the file
- Read the file line by line.
- For each line:
  - Extract userCode.
  - Extract userId.
  - Add identifiers to a set to ensure uniqueness.
Save identifiers to CSV
- Convert the set of identifiers to a DataFrame.
- Save the DataFrame to a CSV file.
End

🚀 Recent and Planned Improvements

We're excited to share the latest updates and upcoming enhancements for the Log User Extractor script. These changes are designed to make the script smarter, faster, and more user-friendly!

🎉 Recently Implemented

⚡ Parallel Processing

Status: Implemented
Details: We've introduced parallel processing to handle multiple log files simultaneously. This enhancement significantly reduces the time required to process large datasets, making the script more efficient and scalable.

🔧 Enhanced Error Handling and Logging

Status: Implemented
Details: We've added robust error handling and logging mechanisms to track processing status and any issues that arise. This improvement enhances monitoring, debugging, and the overall reliability of the script.

⚙️ Configurable Parameters

Status: Implemented
Details: Users can now specify options such as file patterns, output file names, and log levels through a configuration file. This provides greater flexibility and customization.

📈 Progress Tracking

We believe in transparency and continuous improvement. Here's a snapshot of our progress:

Parallel Processing: ✅ Completed
Enhanced Error Handling and Logging: ✅ Completed
Configurable Parameters: ✅ Completed

Stay tuned for more updates as we continue to enhance the Log User Extractor. Your feedback and contributions are always welcome!

🤝 Contributing

We welcome contributions to enhance Log User Extractor. To contribute:

🍴 Fork the repository.
🌿 Create a new branch.
💾 Make your changes and commit them.
🚀 Push to the branch.
🔄 Create a new Pull Request.

We appreciate your help in making this project better for everyone!

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📝 Acknowledgments

Special thanks to all the contributors who have helped in improving this project. Your efforts are highly valued!

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github		.github
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
config.ini		config.ini
log_user_extractor.py		log_user_extractor.py
requirements.txt		requirements.txt
run_in_docker.py		run_in_docker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Log User Extractor

✨ Features

🛠 Requirements

📥 Installation

🚀 Usage

📊 Flowchart

📊 Workflow

🚀 Recent and Planned Improvements

🎉 Recently Implemented

⚡ Parallel Processing

🔧 Enhanced Error Handling and Logging

⚙️ Configurable Parameters

📈 Progress Tracking

🤝 Contributing

📄 License

📝 Acknowledgments

About

Releases 1

Packages

Languages

License

PKHarsimran/LogUserExtractor

Folders and files

Latest commit

History

Repository files navigation

🚀 Log User Extractor

✨ Features

🛠 Requirements

📥 Installation

🚀 Usage

📊 Flowchart

📊 Workflow

🚀 Recent and Planned Improvements

🎉 Recently Implemented

⚡ Parallel Processing

🔧 Enhanced Error Handling and Logging

⚙️ Configurable Parameters

📈 Progress Tracking

🤝 Contributing

📄 License

📝 Acknowledgments

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages