Fetch-News-with-Web-Scraping Readme

This Jupyter Notebook provides a script for web scraping news articles from a website and storing them in a Pandas DataFrame. It uses the BeautifulSoup library for web scraping and the requests library for downloading web pages.

Installation

Before running the notebook, you need to install the required dependencies. You can do this by executing the following commands in a code cell:

!pip install html5lib
!pip install bs4

Usage

Import the necessary libraries by executing the code cell:

from bs4 import BeautifulSoup
import requests
import pandas as pd

Set up the configuration by defining the types of news and the base URL in the code cell:

types = {
    "politics": "سياسة",
    "ebusiness": "اقتصاد",
    "culture": "ثقافة",
    "sport": "رياضة",
    "arts": "فن",
    "tech": "تكنولوجيا",
    "turath": "تراث",
    "midan": "ميدان"
}

base_url = "https://1-a1072.azureedge.net"

Download the HTML of the news pages by executing the code cell:

for key in types:
    downlaod_html_news(key)

Extract the news articles for each type and append them to a Pandas DataFrame by executing the code cell:

def extract_news_with_type_and_append_to_dataframe(_type):
    # ...
    return data

data = pd.DataFrame(columns=["Type", "Header", "Body", "Date", "URL"])

for key in types:
    news_data = extract_news_with_type_and_append_to_dataframe(key)
    data = pd.concat([data, news_data], ignore_index=True)

View the extracted news articles by executing the code cell:

data

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
FetchingNews.ipynb		FetchingNews.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FetchingNews.ipynb

FetchingNews.ipynb

README.md

README.md

Repository files navigation

Fetch-News-with-Web-Scraping Readme

Installation

Usage

About

Releases

Packages

Languages

AbdelrhmanSror/Fetch-News-with-Web-Scraping

Folders and files

Latest commit

History

FetchingNews.ipynb

FetchingNews.ipynb

README.md

README.md

Repository files navigation

Fetch-News-with-Web-Scraping Readme

Installation

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages