Web-Scraping-with-Beautiful-Soup-and-Selenium

This repository offers a guide to web scraping with Beautiful Soup and Selenium. It covers data extraction, HTML basics, XPath, and CSS selectors. Includes a project for scraping Trendyol. Ideal for beginners wanting to learn web scraping.

Stars
0
Committers
1

Web Scraping with Beautiful Soup and Selenium

Welcome to the Web Scraping with Beautiful Soup and Selenium repository! This repository provides a comprehensive guide to web scraping, from basic concepts to practical implementations.

Table of Contents

Introduction

This repository is designed to help beginners understand the basics of web scraping using Beautiful Soup and Selenium. You'll learn how to extract data from websites and perform various tasks such as navigating HTML structures, using XPath, and applying CSS selectors. The repository includes a practical project for scraping data from the Trendyol website.

Prerequisites

Before you begin, make sure you have the following installed:

  • Python 3.x
  • pip (Python package installer)

Installation

To install the required libraries

beautifulsoup4
selenium
requests

Usage

Here's a brief overview of how to use the scripts in this repository:

  1. Clone the repository:
    git clone https://github.com/Memirdogan/Web-Scraping-with-Beautiful-Soup-and-Selenium.git
    
  2. Navigate to the project directory:
    cd Web-Scraping-with-Beautiful-Soup-and-Selenium
    

Project Structure

Web-Scraping-with-Beautiful-Soup-and-Selenium/
├── .idea/
├── Beautiful Soup’a Giriş/
├── Bot Tespiti ve Proxy/
├── HTML Temelleri/
│   └── Uygulama
├── Selenium'a Giriş/
├── TrendyolPhoneData/
│   ├── TrendyolPhoneData.py
│   ├── main.py
│   ├── DataClear.py
│   ├── datasetanalyser.py
│   └── ...
└── XPath ve CSS Seçicilere Giriş/

Topics Covered

  • Introduction to Beautiful Soup
  • Bot Detection and Proxy
  • HTML Basics
  • Introduction to Selenium
  • TrendyolPhoneData Project
  • Introduction to XPath and CSS Selectors

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.