Skip to main content

All Questions

1 vote
2 answers
64 views

Extracting text from Wikisource using BeautifulSoup returns empty result

I'm trying to extract the text of a book from a Wikisource page using BeautifulSoup, but the result is always empty. The page I'm working on is Le Père Goriot by Balzac. Here's the code I'm using: ...
Hugo Durif's user avatar
1 vote
1 answer
32 views

My Beautiful Soup library is not extracting out the all the anchor elements from a listed display

Hi so I am very new to web scraping and I am trying out the basics for it. Right now, I wanted to extract links from a root website (coventry.gov.uk). The problem was, however, I could not get the ...
Gs can't's user avatar
-2 votes
1 answer
44 views

how can i get website html code using requests?

I repeated all the steps as in the tutorial but I don’t get the code. Although a couple of days ago everything worked I write a simple code: import requests url = 'https://i-teka.kz/almaty/spisokaptek'...
David Zakharyan's user avatar
-1 votes
1 answer
34 views

why part of content disappear magically [duplicate]

I try to scraping the site https://www.pik.ru/search/vangarden/storehouse, and I got html from web-site succesfully and write it in the file, but when I try to get html ater it lots of information ...
POPIF's user avatar
  • 11
0 votes
1 answer
44 views

How can I get needs html from site?

Thank you for your attention,and sorry for my poor english. I have been trying to get html from https://www.skiddle.com/festivals/dates.html without any success. I understand, that some parts download ...
POPIF's user avatar
  • 11
1 vote
3 answers
69 views

Need Assistance with a regex pattern in Python – Parsing complex HTML structures

I'm trying to parse complex HTML structures using Python's re module, and I've run into a roadblock with my regex pattern. Here's what I'm trying to do: I have HTML text that contains nested elements,...
prabu naresh's user avatar
-2 votes
1 answer
62 views

How to Extract Content Inside Each <a href> Tag?

I am working on a project that involves extracting some data from the website. Specifically, I am interested in pulling out the name of each category along with its description. I have considered ...
Alex VIJU's user avatar
0 votes
2 answers
90 views

Why is my code print out the same html link a lot of times?

I'm doing a following link activity on Python ( it's an assignment on Python Web Access Data - Coursera). Here is the problem: In this assignment you will write a Python program that expands on http://...
Vinh Nguyễn Thành's user avatar
1 vote
1 answer
105 views

Parsing a website with "load more"/"show more" buttons

For my project, I need to get all the reviews for a business on Capterra(should work for different businesses). I'm not that experienced in web scraping so I have trouble trying to do that). I tried ...
Rustam's user avatar
  • 21
-2 votes
1 answer
102 views

'module' object is not callable in python

!pip install bs4 !pip install html5lib import pandas as pd !pip install lxml import requests import bs4 as BeautifulSoup url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/...
Inam Buneri's user avatar
1 vote
1 answer
492 views

How to parse a specific part of html table data using pandas

I have been learning how to scrape a web page using Pandas and I have hit a bit of a wall where I cant extract a specific piece of data that inside the itself. Here is the html which is being parsed ...
30ThreeDegrees's user avatar
0 votes
1 answer
398 views

How to scrape WSJ headlines with requests and BeautifulSoup?

WSJ doesn't want to be parsed - I have this function: def get_wsj_news(): global prev_news_wsj url = "https://www.wsj.com/news/world" news = [] news_to_post = [] try: ...
miraculous_guru's user avatar
0 votes
1 answer
39 views

Python Web Scraper not populating .txt file with any scraped data

I'm new to python but I have a slight grasp on what's going on. I'm trying to write a web scraper with BeautifulSoup. I'm tyring to scrape a site for a string of numbers and then to write that string ...
BlackHeart's user avatar
0 votes
1 answer
39 views

Code is working problem is that the loop in card_elements is not working

url = "https://github.com/marketplace?category=project-management&type=actions" driver.get(url) Parse the HTML content of the page soup = BeautifulSoup(driver.page_source, "html....
Yeuhan Shen's user avatar
0 votes
2 answers
87 views

Python Selenium search for sibbling [object Text] that only have a text

I want to get the text of an expression in xpath that only has text in its sibling html_code I'm trying this way but it gives me an error and I don't know how to select the text, since it doesn't have ...
Freddy André Cuellar López's user avatar

15 30 50 per page
1
2 3 4 5
19