Skip to content

DaaanielTV/web-scraping-chatbot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Scraping Chatbot

A modern web application that allows users to scrape and analyze web content through a user-friendly interface.

Features

  • 🎨 Modern, responsive design
  • 📑 Tab navigation between Web Scraping and History
  • ⚡ Asynchronous web scraping
  • 📝 History tracking
  • 🌐 Simple and intuitive user interface
  • 🔒 Error handling and input validation

Installation

Prerequisites

  • Python 3.8 or higher
  • pip (Python package manager)

Setup

  1. Clone the repository:

git clone https://github.com/DaaanielTV/web-scraping-chatbot.git cd web-scraping-chatbot

  1. Install required packages:

pip install flask httpx beautifulsoup4

Running the Application

  1. Start the Flask server:

python app.py

  1. Open your browser and navigate to:

http://localhost:5000

Project Structure

project/ ├── static/ │ ├── css/ │ │ └── style.css │ └── js/ │ └── main.js ├── templates/ │ └── index.html ├── app.py └── README.md

Usage

  1. Navigate to the Web Scraping tab
  2. Enter a valid URL in the input field
  3. Click "Send" to scrape the website
  4. View the extracted information in the chat window
  5. Check the History tab to see previous scraping results

Planned Features

Short-term

  • 📊 Database integration for history storage
  • 📤 Export functionality for scraped data
  • 🎨 Additional styling options and themes

Long-term

  • 🔐 User authentication system
  • 📱 Mobile app version
  • 🤖 Advanced scraping options
  • 📈 Data visualization
  • 🔍 Advanced search functionality
  • 💾 Local storage support
  • 🌍 Multi-language support

Contributing

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

Technical Requirements

  • Python 3.8+
  • Flask
  • httpx
  • BeautifulSoup4
  • Modern web browser with JavaScript enabled

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

  • Flask documentation
  • BeautifulSoup4 documentation
  • httpx documentation
  • Modern web development community

Support

For support, please open an issue in the GitHub repository or contact the maintainers.

Authors


Made with ❤️ by Daniel

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published