python 3.x - how to get data from whoscored -
i need data whoscored.com when type code
import requests bs4 import beautifulsoup soup url = "https://www.whoscored.com/statistics" page_html = requests.get(url) page_soup = soup(page_html.content, 'html.parser')
i gettin page_soup variable follows
<html style="height:100%"> <head> <meta content="noindex, nofollow" name="robots"/> <meta content="telephone=no" name="format-detection"/> <meta content="initial-scale=1.0" name="viewport"/> <meta content="ie=edge,chrome=1" http-equiv="x-ua-compatible"/> <script src="/_incapsula_resource? swjiylwa=2977d8d74f63d7f8fedbea018b7a1d05" type="text/javascript"></script> </head> <body style="margin:0px;height:100%"><iframe frameborder="0" height="100%" marginheight="0px" marginwidth="0px" src="/_incapsula_resource?cwudnsai=18&xinfo=10-12988947- 0%200nnn%20rt%281505336855621%20883%29%20q%280%20-1%20-1%2077%29%20r%280%20- 1%29%20b15%284%2c200%2c0%29%20u2&incident_id=473000610021824233- 81324132038935514&edet=15&cinfo=04000000" width="100%">request unsuccessful. incapsula incident id: 473000610021824233-81324132038935514</iframe></body>
is there way data in situation ?
whoscored.com using cloud-based application delivery platform named incapsula provides security features webservers in following way:
through simple change website dns records (and no hardware or software required), website traffic seamlessly routed through incapsula’s globally distributed network of high powered servers. incoming traffic intelligently profiled in real-time, blocking latest web threats (e.g., sql injection attacks, scrapers, malicious bots, comment spammers) , thwarting triple-digit gigabit ddos attacks. meanwhile outgoing traffic accelerated , optimized incapsula's global cdn, faster load times, keeping welcome visitors speeding through.
source: https://www.incapsula.com/faqs/
your http-request getting blocked can see in html body:
request unsuccessful. incapsula incident id: 473000610021824233-81324132038935514
what want bypass incapsula. there exists python module that: https://github.com/ziplokk1/incapsula-cracker-py3
i recommend take closer @ issue #4 because may have work able bypass incapsula.
Comments
Post a Comment