How to Bypass 403 Forbidden Error When Web Scraping: Tutorial

Рет қаралды 36,815

Oxylabs

Күн бұрын

Пікірлер: 22

@oxylabs Жыл бұрын

Thanks for watching! We hope you enjoyed this video 💙 Find more content like this here: oxy.yt/jimW

@DjMakinetor 4 ай бұрын

Not work. Still Error 403 ;) or 502 and 504

@HelloHelloHell-o 25 күн бұрын

Does this also work to access a area that was built to give error 403? When there is able to enter password.

@odkdsjf Жыл бұрын

You explained it very well and produced a very high-quality video... which is extremely rare on KZbin. Good job. Thank you

@oxylabs Жыл бұрын

Thank you! We're really happy you enjoyed it! :)

@umair5807 Жыл бұрын

I solved the 403 error for a website, after watching this video. First I used User Agents, it didn't solve, then I used request headers, it solved.

@oxylabs Жыл бұрын

We're so happy it helped! Thanks for your feedback :D

@ritikshukla3855 3 ай бұрын

How did you solve it with request headers? I was following the video and I still got 403.

@MrRaveHaven 11 ай бұрын

Would be really cool if there was a Python library which created a full set of realistic headers for use with Requests/scraping.

@dantelangone4829 7 ай бұрын

It is a great video, thank you! One thing I did not understand is how do I select the headers to include, the resource you cite in description is really tough to understand.

@oxylabs 7 ай бұрын

Hello, thanks for your comment! Compiling header sets yourself could be tricky. A headless browser is probably the easiest way, as it will automatically use relevant headers. Alternatively, you could integrate a random header generator library into your code (e.g. Python has random-header-generator, but there are more out there). Hope that helps!

@dantelangone4829 7 ай бұрын

Thank you for redirecting me there. I was able to have a valid header myself by copying the entries of my browsers in different machines and matching them to the user agents. Yet, it would be nice to explore a library. Also, I believe that a headless browser has fingerprints of it being headless, and no normal user would navigate headless… What other change would I need? Thanks again for the video and precious info.

@ronnielipman9071 5 ай бұрын

I alwatch s tube…..today on all my devices I’m getting the 403 error on all my android boxes…..om you tube without ads etc etc….are u able to help

@Andrei-ds8qv Жыл бұрын

Interesting and educational video, but what you did there was not reading the answer from the server to which you have make a request, you just printed out your headers from the request itself. It is the same thing, but for the sake of the truth you should have read what came back in the data_request.text(), because that is where the server will put it's answer and will tell you what it sees.

@DevalPatel-zs2sp 2 ай бұрын

really really LOVE you My friend, Thank you for sharing the information :)

@DevalPatel-zs2sp 2 ай бұрын

in case if anyone is still suffering, i would like to tell you that from inspect > network > right click on the webpage request status> header > copy everything from headers> send it to chatgpt and tell them to create headerds of most of them and donot change the context... (chatgpt might change the safari version or something but tell to have the exact same context) i hpoe this work for you, please ignore spelling mistakes :)

@scoutgaming737 8 ай бұрын

I'm trying to make discord bot that just post e621 posts one by one and I'm just wondering why that website would be concerned with bots just looking aorund lol