You are on page 1of 21

SENTIMENT ANALYSIS USING RAPIDMINER

ABHILASH H C 4BD10CS003

CONTENTS
Introduction Sentiment Analysis RapidMiner Demonstration Screen Shots Advantages and Disadvantages Conclusion

INTRODUCTION

Two main types of textual information: Facts and Opinions Most current text information processing methods work with factual information (e.g., web search, text mining) Sentiment analysis or opinion mining, computational study of opinions (sentiments, emotions) expressed in text

WHAT IS SENTIMENT ANALYSIS?

Identify the orientation of opinion in a piece of text (blogs, user comments, review websites, community websites, ), in others words determine if a sentence or a document expresses positive, negative, neutral sentiment towards some object?

The movie was fabulous!


[ Sentimental ]

The movie stars Mr. X

The movie was horrible!

[ Factual ]

[ Sentimental ]

USES :

Consumer information

Product reviews
Consumer attitudes Trends Politicians want to know voters views Voters want to know politicians' stances and who else supports them Find like-minded individuals or communities

Marketing

Politics

Social

HOW IT IS DONE ?

First eliminate objective sentences, then use remaining sentences to classify document polarity (reduce noise)

Fig. Polarity Classifier

RAPIDMINER

Around since 2001 Open source - Community Editions Client/Server model with Server as SaaS(Service as a Software) Most popular for data analytics GUI based - no need to write code Java based - Runs on All Platforms
All usual Windows versions are supported as well as Macintosh, Linux or UNIX systems. Download is available from http://www.rapid-i.com.

WELCOME PERSPECTIVE

DESIGN PERSPECTIVE

DEMONSTRATION STEP 1

STEP 2

STEP 3

STEP 4

STEP 5

STEP 6

STEP 7

STEP 8

ADVANTAGES
Free version has adequate resources to avoid big name options if a small business It is a quality tool, given its ranking among the other commercial products GUI is very user friendly. GUI is used to create data mining operators in XML files XML Standardization is great for utilizing various data sources Ease of use and available tutorials Works on any operating system

DISADVANTAGE
Some options are not available in free product, but you can upgrade Possibly less customer service available for free version There can be some restriction on customized use Beginner may face some difficulty in understanding

CONCLUSION

RapidMiner is an open source learning environment for data mining and machine learning. This environment can be used to extract meaning from a dataset. There are hundreds of machine learning operators to choose from, helpful pre and post processing operators, descriptive graphic visualizations, and many other features. Users with limited knowledge in computer science and programming may find RapidMiner's learning curve to be substantial.

THANK YOU

You might also like