Re: [問題] 讀取XML檔

作者: alanfengjkl (Alan)   2018-05-24 10:08:50
※ 引述《alanfengjkl (Alan)》之銘言:
: [問題類型]:
: 程式諮詢
: [軟體熟悉度]:
: 使用者
: [問題敘述]:
: 在使用R讀取XML檔案時
: 出現下列ERROR
: 想知道如何排除問題?
: 另外由於資料是BIG5
: 讀進來後若有亂碼該如何處理?
: [程式範例]:
: library(XML)
: xml.doc <- xmlParse(file,encoding = "BIG5")
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: xmlParseEntityRef: no name
: input conversion failed due to input error, bytes 0xF9 0xDC 0xBC 0x7A
: input conversion failed due to input error, bytes 0xF9 0xDC 0xBC 0x7A
: encoder errorPremature end of data in tag ?冽饿??line 168391
: Premature end of data in tag 撣喳腙?怿敦 line 168384
: Premature end of data in tag INVOICE line 2
: Error: 1: xmlParseEntityRef: no name
: 2: xmlParseEntityRef: no name
: 3: xmlParseEntityRef: no name
: 4: xmlParseEntityRef: no name
: 5: xmlParseEntityRef: no name
: 6: xmlParseEntityRef: no name
: 7: xmlParseEntityRef: no name
: 8: xmlParseEntityRef: no name
: 9: xmlParseEntityRef: no name
: 10: xmlParseEntityRef: no name
: 11: input conversion failed due to input error, bytes 0xF9 0xDC 0xBC 0x7A
: 12: input conversion failed due to input error, bytes 0xF9 0xDC 0xBC 0x7A
: 13: encoder error14: Premature end of data in tag ?冽饿??line 168391
: 15: Premature end of data in tag 撣喳腙?怿敦 line 168384
: 16: Premature end of data in tag INVOICE line 2
: [環境敘述]:
: 最新版R
:
可是我用xml2也有error如下
> rm(list=ls())
> file="C0001BILL9_EBill_20180505_20180505114335.xml"
> setwd("D:/")
> library(xml2)
> d=read_xml(file,encoding="BIG5")
Error in doc_parse_file(con, encoding = encoding, as_html = as_html, options =
options) :
xmlParseEntityRef: no name [68]
> d=read_html(file,encoding="BIG5")
Error in doc_parse_file(con, encoding = encoding, as_html = as_html, options =
options) :
input conversion failed due to input error, bytes 0xF9 0xDC 0xBC 0x7A [600
3]
作者: Wush978 (拒看低質媒體)   2018-05-24 11:33:00
可是你沒有給讓網友能重現錯誤的方式,所以我也幫不了
作者: obarisk (OSWALT)   2018-05-25 08:55:00
先把Windows換掉,大概就解決了
作者: yulunchu (mouseegg)   2018-05-31 05:57:00
encoding="UTF-8"

Links booklink

Contact Us: admin [ a t ] ucptt.com