[問題] 用rvest抓取youtube資料

作者: lovedmagic (EricZou)   2017-05-25 18:14:27
[問題類型]:
意見調查(我對R 有個很棒的想法,想問問大家的意見)
[軟體熟悉度]:
入門(寫過其他程式,只是對語法不熟悉)
[問題敘述]:
小弟我最近想要抓YOUTUBE的人數與影片長短等等結構化資料來分析,
無奈用rvest只能夠抓前30筆資料,我目的是想抓取所有的影片資料
有試著用RCurl來抓,但是編碼的問題讓我非常困擾,請求大大指點迷津
[程式範例]:
pew.ytb <- read_html('https://www.youtube.com/user/PewDiePie/videos') #讀取
pewdiepie的影片
ytb.nodes <-
html_nodes(pew.ytb,"div.yt-lockup.clearfix.yt-lockup-video.yt-lockup-grid") #
截取影片觀看人數與發佈時間
[環境敘述]:
R version 3.3.2 (2016-10-31)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 7 x64 (build 7601) Service Pack 1
locale:
[1] LC_COLLATE=Chinese (Traditional)_Taiwan.950
[2] LC_CTYPE=Chinese (Traditional)_Taiwan.950
[3] LC_MONETARY=Chinese (Traditional)_Taiwan.950
[4] LC_NUMERIC=C
[5] LC_TIME=Chinese (Traditional)_Taiwan.950
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] XML_3.98-1.5 rvest_0.3.2 xml2_1.0.0 RCurl_1.95-4.8
[5] bitops_1.0-6
loaded via a namespace (and not attached):
[1] httr_1.2.1 selectr_0.3-0 magrittr_1.5 R6_2.2.0 tools_3.3.2
[6] curl_2.2 Rcpp_0.12.7 stringi_1.1.2 stringr_1.1.0
[關鍵字]:網路爬蟲、youtube
作者: lovedmagic (EricZou)   2017-06-07 16:33:00
已解決

Links booklink

Contact Us: admin [ a t ] ucptt.com