All Questions
278 questions
1
vote
2
answers
64
views
Extracting text from Wikisource using BeautifulSoup returns empty result
I'm trying to extract the text of a book from a Wikisource page using BeautifulSoup, but the result is always empty. The page I'm working on is Le Père Goriot by Balzac.
Here's the code I'm using:
...
1
vote
1
answer
32
views
My Beautiful Soup library is not extracting out the all the anchor elements from a listed display
Hi so I am very new to web scraping and I am trying out the basics for it. Right now, I wanted to extract links from a root website (coventry.gov.uk). The problem was, however, I could not get the ...
-2
votes
1
answer
44
views
how can i get website html code using requests?
I repeated all the steps as in the tutorial but I don’t get the code. Although a couple of days ago everything worked
I write a simple code:
import requests
url = 'https://i-teka.kz/almaty/spisokaptek'...
-1
votes
1
answer
34
views
why part of content disappear magically [duplicate]
I try to scraping the site https://www.pik.ru/search/vangarden/storehouse, and I got html from web-site succesfully and write it in the file, but when I try to get html ater it lots of information ...
0
votes
1
answer
44
views
How can I get needs html from site?
Thank you for your attention,and sorry for my poor english.
I have been trying to get html from https://www.skiddle.com/festivals/dates.html without any success. I understand, that some parts download ...
1
vote
3
answers
69
views
Need Assistance with a regex pattern in Python – Parsing complex HTML structures
I'm trying to parse complex HTML structures using Python's re module, and I've run into a roadblock with my regex pattern. Here's what I'm trying to do:
I have HTML text that contains nested elements,...
-2
votes
1
answer
62
views
How to Extract Content Inside Each <a href> Tag?
I am working on a project that involves extracting some data from the website. Specifically, I am interested in pulling out the name of each category along with its description.
I have considered ...
0
votes
2
answers
90
views
Why is my code print out the same html link a lot of times?
I'm doing a following link activity on Python ( it's an assignment on Python Web Access Data - Coursera). Here is the problem:
In this assignment you will write a Python program that expands on http://...
1
vote
1
answer
105
views
Parsing a website with "load more"/"show more" buttons
For my project, I need to get all the reviews for a business on Capterra(should work for different businesses). I'm not that experienced in web scraping so I have trouble trying to do that). I tried ...
-2
votes
1
answer
102
views
'module' object is not callable in python
!pip install bs4
!pip install html5lib
import pandas as pd
!pip install lxml
import requests
import bs4 as BeautifulSoup
url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/...
1
vote
1
answer
492
views
How to parse a specific part of html table data using pandas
I have been learning how to scrape a web page using Pandas and I have hit a bit of a wall where I cant extract a specific piece of data that inside the itself.
Here is the html which is being parsed ...
0
votes
1
answer
398
views
How to scrape WSJ headlines with requests and BeautifulSoup?
WSJ doesn't want to be parsed - I have this function:
def get_wsj_news():
global prev_news_wsj
url = "https://www.wsj.com/news/world"
news = []
news_to_post = []
try:
...
0
votes
1
answer
39
views
Python Web Scraper not populating .txt file with any scraped data
I'm new to python but I have a slight grasp on what's going on. I'm trying to write a web scraper with BeautifulSoup. I'm tyring to scrape a site for a string of numbers and then to write that string ...
0
votes
1
answer
39
views
Code is working problem is that the loop in card_elements is not working
url = "https://github.com/marketplace?category=project-management&type=actions"
driver.get(url)
Parse the HTML content of the page
soup = BeautifulSoup(driver.page_source, "html....
0
votes
2
answers
87
views
Python Selenium search for sibbling [object Text] that only have a text
I want to get the text of an expression in xpath that only has text in its sibling
html_code
I'm trying this way but it gives me an error and I don't know how to select the text, since it doesn't have ...