Category "web-scraping"

How to scrape data from Twitter without its API using BeatifulSoup

I'm currently trying to scrape some data from Twitter, like username, screen name, the content of the tweet etc. But I've run into some problems: I've been tryi

find_all() prints everythigh twice

I just started my first Web scraping project and out of some reason when I try to run this simple code, it prints all of the headlines twice. I have no Idea why

Scraping multiple sites in one scrapy-spider

I am scraping 6 sites in 6 different spiders. But now, I have to scrape these sites in one single spider. Is there a way of scraping multiple links in the same

What's the best way to manually save a webpage's html, so I can practice my parsing skills? (Right Clicking in Chrome isn't working)

I am just getting started on my first web scraping project. Before I go and install a headless browser etc, I thought I would just save the page manually and wo

How to scrape related searches on google?

I'm trying to scrape google for related searches when given a list of keywords, and then output these related searches into a csv file. My problem is getting be

set limit to pages for scrapy

I'm scraping https://myanimelist.net/anime.php#/ and you can see there is genres section I want to return as a csv only first 18 pages and stop before explicit

BS4 - 'NoneType' object has no attribute 'findAll' when scanning spans on amazon page

I'm following a Udemy course on learning BS4 and it seems to be a bit outdated so I'm having trouble with this part. The objective is to scrape the price of thi

Python Web Scrape Request resulting in a 406 Error

I am trying to scrape https://registry.verra.org/app/search/VCS/All%20Projects for a school project. I am trying to send a request to the "download excel" butto

Python & Selenium: How to get values generated by JavaScript

I use Selenium in Python for scraping. I can't get values though these values are displayed on the browser. So I checked the HTML source code, then I found that

Scrape Website that is running meteor, using python requests

You see, there is this website called edabit. All you need to know is that I want to scrape the website without using selenium. I want to learn how. What does s

Why does my Selenium script work only every other time?

I think I should start off by mentioning I have Mac OS and am using Safari browser. This script is a web-scraper that is supposed to search for a given type of

How to select and scrape specific texts out of a bunch <ul> and <li>?

I need to scrape "2015" and "09/09/2015" from the below link: lacentrale.fr/auto-occasion-annonce-87102353714.html But since there are many li and ul, I cant sc

Selenium won't load specific webpages on bet365

I am trying to scrape data from bet365 for basketball odds however I am encountering where certain leagues webpages wont load even when simply just loading the

How to scrape sofifa website positions. Text inside of span beautiful soup

So I am webs scraping the sofifa website into a workable csv. Each player gets a column. My main problem is the position section of the website is only exportin

Scrapyd-client schedule.json produces an AttributeError

I am trying to deploy my Scrapy spider using Scrapyd and Scrapyd-client. I have managed to successfully create a project containing my spider, but when i try to

Webscrap product price with javascript

im making a blog were i will show a product and all of it's prices on all the websites that sell this product, but i whant the prices to be up to date so i deci

Selenium cant find an element unless the page is inspected

I am automating with selenium and noticed something weird, on a specific webpage, there is a ul list containing some li roles that are buttons. For some reason,

Selenium Webhook equivalent / listen to new data from telegram without bot

I have a web scraping project that requires the scraper to wait for when there is new data. I thought about reverse engineering the websocket but that might tak

Scraping webpage with tabs that do not change url

I am trying to scrape Nasdaq webpage and have some issue with locating elements: My code: from selenium import webdriver import time import pandas as pd driver

How to extract the price in selenium if they have same tag?

I would like to get first prices ($916.65) in pyhton but I do not know which method do I need to apply ? Each price has the same tag (div class="price"). Shoul