[問題] LabelEncoder移除沒訓練過的data

作者: ctr1 (【積π】)   2020-05-12 16:43:54
from sklearn import preprocessing
le = preprocessing.LabelEncoder()
le.fit(["paris", "paris", "tokyo", "amsterdam"])
print('class:{}'.format(list(le.classes_)))
data1 = ["tokyo", "tokyo", "paris"]
print(le.transform(data1))
data2 = ["tokyo", "tokyo", "paris", "USA", "Taiwan"]
print(le.transform(data2))
作者: TitanEric (泰坦)   2020-05-12 19:04:00
先filter掉再transform?
作者: ctr1 (【積π】)   2020-05-13 13:14:00
無法得知哪些東西沒出現過
作者: eooft   2020-05-13 13:21:00
data2=list(filter((lambda d: d in data1), data2))

Links booklink

Contact Us: admin [ a t ] ucptt.com