Skip to content

ValeryVerkhoturov/gemini-webscrapping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gemini webscrapper

It`s required to extract info from BBB, YouTube and unstructured US news sites.

Example

Setup

Add GEMINI_API_KEY and YOUTUBE_API_KEY to /.env file.

Run benchmark

It is required to launch program with USA IP address.

go mod tidy
go run .

Architecture

@startuml
package "System" {

  component "URL Input" as urlInput
  component "URL Classifier" as urlClass
  component "YouTube API Client" as ytClient
  component "BBB API Client" as bbbClient
  component "Rate Limiter for Gemini" as geminiLimiter
  component "Gemini API Client" as geminiClient
  component "Output Formatter" as output

  urlInput -right-> urlClass
  urlClass ..> ytClient : <<youtube url>>
  urlClass ..> bbbClient : <<bbb.org url>> 
  urlClass ..> geminiLimiter : <<other url>>
  geminiLimiter -right-> geminiClient
  ytClient -right-> output
  bbbClient -right-> output
  geminiClient -right-> output
  
}
@enduml

About

Web scrapping reviews of contractors using Gemini

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages