[問題] 文字探勘無法將中文的\n移除

作者: ardodo (米蟲)   2015-08-06 15:25:59
[問題類型]:
程式諮詢(我想用R 做某件事情,但是我不知道要怎麼用R 寫出來)
[軟體熟悉度]:
使用者(已經有用R 做過不少作品)
[問題敘述]:
text mining無法將\n移除
我參考了陳嘉葳的文章,照著他的作法作,但是我無法將中文的\n移除,利用
設定了stopwords後依然無法將\n給斷掉,請問該如何解決呢?
範例檔案在此↓
https://drive.google.com/open?id=0Bz0IlJks1nIiUVktajFlc29PODA
[程式範例]:
http://pastebin.com/gaeXZX6w
[sessionInfo]
R version 3.2.1 (2015-06-18)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 7 x64 (build 7601) Service Pack 1
locale:
[1] LC_COLLATE=Chinese (Traditional)_Taiwan.950 LC_CTYPE=Chinese
(Traditional)_Taiwan.950
[3] LC_MONETARY=Chinese (Traditional)_Taiwan.950
LC_NUMERIC=C
[5] LC_TIME=Chinese (Traditional)_Taiwan.950
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] cluster_2.0.1 fpc_2.1-9 wordcloud_2.5
RColorBrewer_1.1-2 Rwordseg_0.2-1
[6] rJava_0.9-6 tmcn_0.1-4 tm_0.6-2
NLP_0.1-8
loaded via a namespace (and not attached):
[1] flexmix_2.3-13 Rcpp_0.11.6 MASS_7.3-40 mclust_5.0.2
lattice_0.20-31
[6] prabclus_2.2-6 tools_3.2.1 nnet_7.3-9 parallel_3.2.1
grid_3.2.1
[11] modeltools_0.2-21 class_7.3-12 trimcluster_0.1-2 kernlab_0.9-20
robustbase_0.92-5
[16] slam_0.1-32 DEoptimR_1.0-3 diptest_0.75-7 stats4_3.2.1
mvtnorm_1.0-3
作者: psinqoo (零度空間)   2015-08-06 16:00:00
R的版本?在3.0X前不會出現
作者: ardodo (米蟲)   2015-08-07 11:54:00
是喔?稍後我用舊版本試試看

Links booklink

Contact Us: admin [ a t ] ucptt.com