Practical 5

Data Pre-processing and text analytics using Orange

What is the effect of discretization, continuation, Normalization, Randomization on the data w.r.t. Orange?
Normalization: Construct a preprocessor for normalization of features. Given a data table, preprocessor returns a new table in which the continuous attributes are normalized.
Randomization: Randomization allows creating random data sets, where variables correspond to the selected distributions.

Fig 1 Normalization of data with orange api in python

Fig 2 Randomization of data with orange api in python

Text Preprocessing:  In natural language processing, text preprocessing is the practice of cleaning and preparing text data



Fig 3 load text data
Fig 4 word cloud


Fig 5 Preprocess data with filtering



Fig 6 show data in word cloud



Fig 7 Corpus View

Get the code from here

Comments