[問題] 爬蟲post&header unhumanWu PTT批踢踢實業坊

[問題] 爬蟲post&header

作者: unhumanWu (阿文) 2017-09-23 17:10:36

大家好，初學爬蟲
最近需要在https://www.taiwanmobile.com/mobile/storelbs/lbs.html#
擷取店點相關資訊
爬之前的文章發現header似乎是問題所在
於是把header全部放進去，但好像沒有用...
想請教各位大大該如何克服呢？感恩
程式碼如下：
form_data = {"city":"台北市",
"district":"松山區",
"lat":"25.0464207",
"lng":"121.5555859",
"searchDistance":"-1"}
headers = {"Accept":"application/json, text/javascript, */*; q=0.01",
"Accept-Encoding":"gzip, deflate, br",
"Accept-Language":"zh-TW,zh;q=0.8,en-US;q=0.6,en;q=0.4",
"Connection":"keep-alive",
"Content-Length":"127",
"Content-Type":"application/x-www-form-urlencoded; charset=UTF-8",
"Cookie":"_msuuid_558dza12683=D58CE660-772F-4345-A7C6-B1B732FB85F0; \
JSESSIONID=nt+dIYPkeJGODFLtAALzUKsu; _ga=GA1.2.142560397.1498299011; \
_gid=GA1.2.594796640.1506128988",
"Host":"www.taiwanmobile.com",
"Origin":"https://www.taiwanmobile.com",
"Referer":"https://www.taiwanmobile.com/mobile/storelbs/lbs.html",
"User-Agent":"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_5) AppleWe\
bKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.113 Safari/537.36",
"X-Requested-With":"XMLHttpRequest"}
response_post = requests.post("https://www.taiwanmobile.com/mobile/stor\
elbs/lbs.html#", data = form_data, headers=headers)
response_post.encoding = 'utf-8'
soup_post = BeautifulSoup(response_post.text, "lxml")

作者: vi000246 (Vi) 2017-09-23 23:16:00

headers有些看起來像亂碼的有時是發起請求時才產生你直接寫死有些網站會擋因為不是當下產生的header

繼續閱讀

[問題] for x in iterator(variable) 問題znmkhxrw [問題] 關於Python編碼的疑惑jamesxxx1997 用Django 寫的聖經，不是Django 學習聖經喔davidmaxlee [問題] 將輸出導入 re.compiletakahashikag [問題] 要如何顯示出shell script內的logghost108 Re: [問題] 深度學習(deep learning)出問題f496328mm [問題] pandas爬蟲 .read_htmlblackhores Re: [問題] 深度學習(deep learning)出問題pipidog [問題] 讀csv檔的問題SuperBu [問題] PyQt5與matplotlib的結合Fisher810228