0

So I'm using BeautifulSoup and Requests to get data from an https website

I know I need to provide authorisation details (I don't want to say verify=False)

Something like the solution in: How do I scrape an https page?

How do I ensure that the login and password are protected and not sent as plain text (I'm a rookie)

Is there anything I need to consider here?

Keagan
  • 37
  • 3

1 Answers1

0

I think you misunderstand the underlying technology concepts with regards to HTTP/S.

When you use BeautifulSoup, you Request a web page, and it returns the HTML. Some websites require you to authenticate to ensure content that is private remains protected.

A better question for you to ask would be:

"How do I use BeautifulSoup and Requests to scrape data from a website that requires authentication?"

And to that, I would answer, read the docs. Here is a link to get you started: Requests Sessions - http://docs.python-requests.org/en/v2.3.0/user/advanced/#session-objects

Swift
  • 1,663
  • 1
  • 10
  • 21