Newest 'python+web-scraping+html-parsing' Questions

1 vote

2 answers

64 views

Extracting text from Wikisource using BeautifulSoup returns empty result

I'm trying to extract the text of a book from a Wikisource page using BeautifulSoup, but the result is always empty. The page I'm working on is Le Père Goriot by Balzac. Here's the code I'm using: ...

Hugo Durif

13

asked Jan 30 at 21:31

1 vote

1 answer

32 views

My Beautiful Soup library is not extracting out the all the anchor elements from a listed display

Hi so I am very new to web scraping and I am trying out the basics for it. Right now, I wanted to extract links from a root website (coventry.gov.uk). The problem was, however, I could not get the ...

Gs can't

23

asked Jul 7, 2024 at 18:02

-2 votes

1 answer

44 views

how can i get website html code using requests?

I repeated all the steps as in the tutorial but I don’t get the code. Although a couple of days ago everything worked I write a simple code: import requests url = 'https://i-teka.kz/almaty/spisokaptek'...

David Zakharyan

3

asked Mar 23, 2024 at 12:21

-1 votes

1 answer

34 views

why part of content disappear magically [duplicate]

I try to scraping the site https://www.pik.ru/search/vangarden/storehouse, and I got html from web-site succesfully and write it in the file, but when I try to get html ater it lots of information ...

POPIF

11

asked Feb 8, 2024 at 20:52

0 votes

1 answer

44 views

How can I get needs html from site?

Thank you for your attention,and sorry for my poor english. I have been trying to get html from https://www.skiddle.com/festivals/dates.html without any success. I understand, that some parts download ...

POPIF

11

asked Feb 6, 2024 at 18:55

1 vote

3 answers

69 views

Need Assistance with a regex pattern in Python – Parsing complex HTML structures

I'm trying to parse complex HTML structures using Python's re module, and I've run into a roadblock with my regex pattern. Here's what I'm trying to do: I have HTML text that contains nested elements,...

prabu naresh

200

asked Sep 2, 2023 at 12:54

-2 votes

1 answer

62 views

How to Extract Content Inside Each <a href> Tag?

I am working on a project that involves extracting some data from the website. Specifically, I am interested in pulling out the name of each category along with its description. I have considered ...

Alex VIJU

13

asked Aug 30, 2023 at 19:58

0 votes

2 answers

90 views

Why is my code print out the same html link a lot of times?

I'm doing a following link activity on Python ( it's an assignment on Python Web Access Data - Coursera). Here is the problem: In this assignment you will write a Python program that expands on http://...

Vinh Nguyễn Thành

1

asked Jul 27, 2023 at 4:44

1 vote

1 answer

105 views

Parsing a website with "load more"/"show more" buttons

For my project, I need to get all the reviews for a business on Capterra(should work for different businesses). I'm not that experienced in web scraping so I have trouble trying to do that). I tried ...

Rustam

21

asked Jul 12, 2023 at 20:08

-2 votes

1 answer

102 views

'module' object is not callable in python

!pip install bs4 !pip install html5lib import pandas as pd !pip install lxml import requests import bs4 as BeautifulSoup url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/...

Inam Buneri

1

asked Jun 8, 2023 at 9:17

1 vote

1 answer

492 views

How to parse a specific part of html table data using pandas

I have been learning how to scrape a web page using Pandas and I have hit a bit of a wall where I cant extract a specific piece of data that inside the itself. Here is the html which is being parsed ...

30ThreeDegrees

41

asked May 27, 2023 at 22:58

0 votes

1 answer

398 views

How to scrape WSJ headlines with requests and BeautifulSoup?

WSJ doesn't want to be parsed - I have this function: def get_wsj_news(): global prev_news_wsj url = "https://www.wsj.com/news/world" news = [] news_to_post = [] try: ...

miraculous_guru

31

asked Apr 1, 2023 at 11:16

0 votes

1 answer

39 views

Python Web Scraper not populating .txt file with any scraped data

I'm new to python but I have a slight grasp on what's going on. I'm trying to write a web scraper with BeautifulSoup. I'm tyring to scrape a site for a string of numbers and then to write that string ...

BlackHeart

1

asked Apr 1, 2023 at 4:22

0 votes

1 answer

39 views

Code is working problem is that the loop in card_elements is not working

url = "https://github.com/marketplace?category=project-management&type=actions" driver.get(url) Parse the HTML content of the page soup = BeautifulSoup(driver.page_source, "html....

Yeuhan Shen

9

asked Feb 10, 2023 at 18:04

0 votes

2 answers

87 views

Python Selenium search for sibbling [object Text] that only have a text

I want to get the text of an expression in xpath that only has text in its sibling html_code I'm trying this way but it gives me an error and I don't know how to select the text, since it doesn't have ...

Freddy André Cuellar López

11

asked Jan 24, 2023 at 6:34

Collectives™ on Stack Overflow

All Questions

Extracting text from Wikisource using BeautifulSoup returns empty result

My Beautiful Soup library is not extracting out the all the anchor elements from a listed display

how can i get website html code using requests?

why part of content disappear magically [duplicate]

How can I get needs html from site?

Need Assistance with a regex pattern in Python – Parsing complex HTML structures

How to Extract Content Inside Each <a href> Tag?

Why is my code print out the same html link a lot of times?

Parsing a website with "load more"/"show more" buttons

'module' object is not callable in python

How to parse a specific part of html table data using pandas

How to scrape WSJ headlines with requests and BeautifulSoup?

Python Web Scraper not populating .txt file with any scraped data

Code is working problem is that the loop in card_elements is not working

Python Selenium search for sibbling [object Text] that only have a text

Hot Network Questions

Collectives™ on Stack Overflow

All Questions

Related Tags