Document Classification in Weka

  Рет қаралды 11,643

jengolbeck

jengolbeck

Күн бұрын

Пікірлер: 25
@brylie
@brylie 6 жыл бұрын
Thanks for explaining this without much jargon. Your teaching style is friendly and accessible. Cheers 😀
@amine-us7hn
@amine-us7hn 4 жыл бұрын
How did you create this arff file, I tried many times but did not
@hyunjungkang8378
@hyunjungkang8378 6 жыл бұрын
Thanks a lot for the videos on Weka. I like the way you explain stuffs, they are very clear and easy to understand :)
@mjma1984
@mjma1984 6 жыл бұрын
So i have a csv file with two columns, first is text and second is class. When i use the apply the filter, i don't see the list of words in my fist column, i simply don't see the attributes the way shown in your video @4:22 ! Any ideas how to get that ? When i click on the Edit, i see that Weka is treating each row in the first column as a whole word, meaning it doesn't split words in the sentences. I tried using stemmer, tokenizer, etc etc but i am still getting the saem ieeuse !
@jengolbeck
@jengolbeck 6 жыл бұрын
When you apply the StringToWord vector, what does it show?
@mjma1984
@mjma1984 6 жыл бұрын
Nothing happens. I still see the window of Attributes with only the names of my attributes, on the selected attribute window, nothing changes !
@jengolbeck
@jengolbeck 6 жыл бұрын
If you want, drop me an email at jgolbeck@umd.edu with your file and I'd be happy to take a look
@martinl2603
@martinl2603 6 жыл бұрын
Probably that your string data get converted into a nominal type (instead of a string type) when loading it into WEKA. StringToWordVector doesn't support nominal data type and does not work though.
@Marie-fu1db
@Marie-fu1db 5 жыл бұрын
I get this same issue. What is the solution?
@martinl2603
@martinl2603 6 жыл бұрын
@jengolbeck, Well, as a member in WEKA community I can say that there are 2 main issues with your video: First: it is not bad way of using the "StringToWordVector" filter, but not absolutely the correct way, as this approach in the way you explained, brings some class information to the tokens, which provides clue to the machine learning algorithm later on about the class type and then provides an optimistic result such the one you had. Second: NaiveBayesMultinomialText can work with string data type, so the default StringToWordVector is not really necessary if you managed to use NaiveBayesMultinomialText classifier ;)
@JiminPark-ld2xx
@JiminPark-ld2xx 3 жыл бұрын
Really appreciate if you can do a video about how to convert CSV files or .txt files into ARFF files. Is there any cleaning process before we convert it to ARFF or anything. Because a lot of students are suffering including myself due to this issue. Thank You...
@solomonngare8382
@solomonngare8382 8 ай бұрын
Hello. What if the dataset is not labelled. it's just plain reviews with no label i.e. positive or negative. How do you go about labelling this
@jengolbeck
@jengolbeck 8 ай бұрын
In that scenario, it's not clear what you would be using Weka for. Weka allows you to build a model based on your data. If you don't have labels on the data, there is nothing to train a model from. From your comment mentioning "positive or negative", I feel like you might be interested in doing sentiment analysis? If that's the case, you would want to use an off-the-shelf sentiment analysis tool.
@matanonson4
@matanonson4 3 жыл бұрын
can you share with us your trump.arff file please?
@yarjung5332
@yarjung5332 5 жыл бұрын
tooo much nice explanation love your way of teaching..
@aseelh8123
@aseelh8123 6 жыл бұрын
Thanks for your explanation, I have a data set of Arabic tweets, when i try to open it using WEKA a question marks appear!, is there a way for defining the Arabic language in WEKA ? Regards,
@TheRegent
@TheRegent 5 жыл бұрын
Try to save the arff file using notepad as utf-8 format instead of ANSI. Then, sure it will read Arabic texts. Or use the CLI with updated package of languages fetched from java updates!
@Balawi28
@Balawi28 3 жыл бұрын
Great tutorial, straight to the point, thanks!
@mjma1984
@mjma1984 6 жыл бұрын
you are awesome ! Thank you for the very informative video
@SamuelEA1
@SamuelEA1 4 ай бұрын
Thanks alot ❤
@yuzhengfeng5701
@yuzhengfeng5701 3 жыл бұрын
Thanks for this very helpful vedio!
@BigAsciiHappyStar
@BigAsciiHappyStar Жыл бұрын
Does the word COVFEFE appear in the list? 😁
@mngugi7
@mngugi7 9 ай бұрын
😍
Data Mining Feature Selection Using WEKA
18:09
EzzaAk
Рет қаралды 16 М.
Interpreting Results and Accuracy in Weka
13:05
jengolbeck
Рет қаралды 45 М.
BAYGUYSTAN | 1 СЕРИЯ | bayGUYS
36:55
bayGUYS
Рет қаралды 1,9 МЛН
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН
The Weka Explorer Interface
17:42
jengolbeck
Рет қаралды 9 М.
Weka Tutorial 31: Document Classification 1 (Application)
12:04
Rushdi Shams
Рет қаралды 39 М.
pca weka
17:51
Chris Kimmer
Рет қаралды 46 М.
Data Mining with Weka (2.1: Be a classifier!)
11:19
WekaMOOC
Рет қаралды 85 М.
Class Balancing in Weka
8:06
jengolbeck
Рет қаралды 23 М.
Data Mining with Weka (1.5: Using a filter )
7:34
WekaMOOC
Рет қаралды 133 М.
Network Basics
37:34
jengolbeck
Рет қаралды 7 М.
Creating ARFF Files for Weka
5:15
jengolbeck
Рет қаралды 85 М.
BAYGUYSTAN | 1 СЕРИЯ | bayGUYS
36:55
bayGUYS
Рет қаралды 1,9 МЛН