Skip Navigation

Scout Archives

Home Projects Publications Archives About Sign Up or Log In

Browse Resources

Data mining -- Automation -- Software

Resources

Mechanical Soup

MechanicalSoup is a Python library that allows programs to interact with websites. Rather than running a full web browser, it combines the Requests library for http sessions and the BeautifulSoup library for HTML document parsing. This approach consumes fewer resources than a full browser would and does not require the installation of additional external tools. However, it also lacks a Javascript...

https://mechanicalsoup.readthedocs.io/en/stable/
Screenshot
Web Scraper

Web Scraper is a tool for extracting data from websites. Users provide a sitemap that specifies which pages on a site to visit and what elements on each page to extract data from. Web Scraper is able to extract data from dynamic and interactive pages that often confuse other tools. It can also simulate button clicks and scroll pages to make them load additional data. Detailed documentation on...

https://www.webscraper.io/