Bing Scraper

The bingscraper is python3 package which extracts the text and images content on search engine bing.com.

It helps the user in a way that he/she will be getting only meaningful results and images for their search query. It does not download the ad content and hence saving data for the user.

The script working in background requests for a search term and creates directory (if not made previously) in the root directory of the script where all the content of the related particular search is stored. This script will be downloading the hypertext and hyperlink to that text and saving it to a .txt file within the directory made by itself. This directory saves the text content as well as the images downloaded using the script.

Requirements

Modules:

a. requests: For requesting content through two HTTPS Methods: GET and POST. Used GET Method.

b. BeautifulSoup: For creating JSON like dictionary using HTML Parser. Package uses bs4.

c. os: For checking and making directories.

d. PIL.Image: Pillow Module. For extracting image content.

e. io.ByteIO: For saving the extracted image using the PIL.Image.
Internet Connection: Continuous high speed internet connection is required for the proper function of the python package as it continuously creates the copy of the images into the local machine.
Python: Version 3.6.4 or above. This package is written in python 3.6.4

Installation

For python installation:

pip install bingscraper or python -m pip install bingscraper

For Anaconda installation:

conda install bingscraper

How to use

Install the above modules. Successful import of bingscraper depends only after the above imports.

Sample code in python:

import bingscraper as bs

search = str(input())

bs.scrape(search).text() #For Text Scraping.

bs.scrape(search).image() #For Image Scraping.

OR

from bingscraper import scrape

search = str(input())

scrape(search).text() #For Text Scraping.

scrape(search).image() #For Image Scraping.

`scrape()` takes a string argument and the `.text()` or `.image()` does the scraping work.

How to cite the project?

If the tool has been helpful to you and wish to cite it, you're requested to cite it as follows:

@misc{sachan2018bingscraper,
      title={bingscraper • pypi},
      author={Sachan, Anubhav},
      year={2018},
      url={https://pypi.org/project/bingscraper/}
}

For other formats, cite as per Google Scholar

Change Log

Version 2.0:

Separated .text() and .image(). Use as per requirement.

Version 3.0:

Minor Changes.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
bingscraper.egg-info		bingscraper.egg-info
bingscraper		bingscraper
build/lib/bingscraper		build/lib/bingscraper
dist		dist
LICENSE		LICENSE
README.md		README.md
README_DESC.md		README_DESC.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bingscraper.egg-info

bingscraper.egg-info

bingscraper

bingscraper

build/lib/bingscraper

build/lib/bingscraper

dist

dist

LICENSE

LICENSE

README.md

README.md

README_DESC.md

README_DESC.md

setup.py

setup.py

Repository files navigation

Bing Scraper

Requirements

Installation

How to use

`scrape()` takes a string argument and the `.text()` or `.image()` does the scraping work.

How to cite the project?

Change Log

Version 2.0:

Version 3.0:

About

Releases

Packages

Languages

License

anubhav4sachan/bing-scraper

Folders and files

Latest commit

History

Repository files navigation

Bing Scraper

Requirements

Installation

How to use

scrape() takes a string argument and the .text() or .image() does the scraping work.

How to cite the project?

Change Log

Version 2.0:

Version 3.0:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages

`scrape()` takes a string argument and the `.text()` or `.image()` does the scraping work.