You are on page 1of 6

Proceedings of the SMART-2016, IEEE Conference ID: 39669

5th International Conference on System Modeling & Advancement in Research Trends , 25th _27'h November, 2016
College of Computing Sciences & Information Technology, Teerthanker Mahaveer University, Moradabad , India

Business Intelligence using Data Mining


Techniques and Business Analytics
Brojo Kishore Mishra 1, Deepannita Hazra2, Kahkashan Tarannum 3 and Manas Kumar4
1,2,3,4Department of IT, C,V Raman College of Engineering, Bhubaneswar, India
E-mail : lbkmishra@cvrgi.edu.in.2hazra.deepannita@gmail.com.3tara.shifa@gmail.com.
4manasbit2 712@yahoo,com

AbstraCl-The objective of this paper is to present a the real life process of mining out nuggets of gold from
review literature on what are impacts of Data Mining (DM) the Earth. More specifically it is like taking out non-
in Business Intelligence (BI). The paper highlights various trivial nuggets from the huge volumes of available data.
features of DM. It involves three steps: explorations, pattern This paper gives a view about how data mining assists
identification and deployment. BI is the hot topic among all
business intelligence to find out patterns and gain
industries aiming for relevance. BI emphasizes on detail
integration and or organizing of data. DM and BI work knowledge from existing data.
together to process and analyse data to lighten workload for In the research [2], it was explained that it is because
the user and organization and hence in understanding of the intense competition that the companies are
discovered materials. It also explains Business Analytics (BA) compelled to find out innovative ideas in which they can
as a part of BI which is again dependent on BI. There are capture and enhance their market shares while reducing
various sectors in business to which BA has proved to be a their costs too. Implementation of the data analysis
powerful tool to obtain effective results. techniques can help the companies to find such solutions
Keywords: Business Intelligence, Data Mining, Business like finding out some unexpected patterns from the large
Ana/ytics
volumes of the data present in the database or data
L INTRODUCTION warehouse. These patterns can provide information which
could help in predicting future outcomes [1],
The task of studying through and finding out certain
patterns in business data is not new, The author of the A. Why do we Need to Manage Data?
paper [10] spreads the awareness that the business
community of today is suffering information overload and For faster decision making: Almost 77% of the
executives complain of not having real time
business source analysis shows that:
information so that they can take decisions. Data
61 % of managers believe that information
needs to be managed and kept in an organized
overload at their own workplace,
way so that it would be easy and quick to be
80% believe the situation will get worse. referred to when taking the decisions.
Over 50% of managers ignore data in current
Limited insight due to large volumes of
decision-making process because of the
information: About 6 out of 10 respondents agree
information overload,
to the statistics that almost all of the
84% of managers store this information for the organizations have more volumes of data than
future use; it is not used for current analysis. they can handle and use effectively. Since the
60% believe that the cost of gathering organizations cannot handle so much of data,
information outweighs its value, their working procedures and insight gets
Since a long time it is done with the help of statistical restricted and ultimately they function
techniques, But now, only to make the task easier inefficiently.
improved techniques like "Data Mining" is used. Data New emerging varieties: Emails, audio, video,
mining is the process of "knowledge discovery" in documents, and images are responsible for
database which can be used in decision making. It is a fast generating 80% of the new data. Due to this
expanding and dynamic field that uses artificial newly generated data there emerges another
intelligence, machine learning, database systems and problem of the storage. Data should be stored in
statistics to apply the advanced techniques of data analysis such a way that they can be identified and
[1]. In this [5], the research stated that the process which is segregated ant any point of time.
designed and used for the purpose of exploring data is
called as data mining, This process is very much similar to

84 Copyright SMART-2016 ISBN: 978-1-5090-3543-4


Business Intelligence using Data Mining Techniques and Business Analytics

Overflowing volume: The amount of data is Association Rule Discovery (Dependency Model):
increasing at the speed of 44x. In the next decade The descriptive method which is used in data mining is
it is expected to range from SOOK petabytes to 35 Association rule discovery. In this model significant
zettabytes [3]. This is why it is important to dependencies between variables are defined. Though it is a
organize the data from the very beginning so that very simple method to be used but it is capable of
later confusions and efforts can be avoided. providing a lot of insight and information related to the
Among the extremely large volumes present, the day to day business. This information can be to generate
data can be divided into four broad classifications the required revenue and even improve the efficiency of
which are as follows: the business. There are far fledged applications related to
Attitudinal data: Options, preferences, needs and this method which can help various industries and
desires
business to increase their value. Here are some examples:
Interaction data: Call centre notes, in person
Up-selling and cross-selling of products, physical
dialogues, emaiIlchat transcripts and web click
streams. organization of items, network analysis, and marketing
Behavioural data: Transaction, usage history, and management. This method was used for many years in
payment history and orders the industry for the market basket analysis but now new
Descriptive data: Self- declared info, (geo) recommendations have been made by the engineers, which
demographics, characteristics and attributes. [3] have overpowered the traditional methods.
Classification: Before digging into the hectic
B. Techniques of Data Mining modelling phase of the analysis of data the primary step
we have to take is classification. This classifies the data
The extraction of hidden patterns of data with the help
item in anyone of the predefined classes. Assume you
of different data mining methods can be classified into two
have a set of records which have their own set of attributes
types: description methods and prediction methods. The
and one of the present attribute is our class (as per the
data description methods focus on understanding and
interpreting the data with the help of examples and the letter grades). Our main motive is to find a model for the
class that will be able to predict the undiscovered records
way in which the underlying data relates to its parts.
(from external similar data sources) accurately which will
According to the research [1], the aim of the prediction-
be similar to the known label of the class, provided all
oriented models is to construct a behavioural model with
values of other attributes. We usually divide the data set
new samples which can predict the values which are
into two subsets, to train the model in a particular manner
related to the sample.
for a specific task: training set and test set. The model will
The data mining techniques which are used for the
be built with the help of training set and the test set will do
analyses of data are as follows:
the validation. It is the test set which determines the
Regression
accuracy and performance of the model.
Association Rule Discovery (Dependency Clustering: Clustering is an important technique
Model)
through which object grouping can be done (like the
Classification [23] different groups of customers). The objects belonging to
Clustering [5] the same cluster are similar but those which are in the
Anomaly detection different groups are different. In this descriptive task a
Summarization [4] finite set of clusters are determined which identify or
Regression: Regression can be simply called as the describe the data. The process of clustering can be defined
"predictive power". Assuming a linear or non linear model in such a way that if you have a group of data points
of dependency, regression analysis can be used by us to which have attributes of their own and have some kind of
predict the value of given (continuous) features based on similarity then they should be clustered in such a way that
the other features in the data. The data item is mapped into the data points in that cluster are much alike each other.
a real valued prediction variable. Here are some examples: Data points in separate clusters are likely to be dissimilar
The revenue of new products are predicted depending to one another. To find how close or far one cluster is
upon the complementary products. Based on the amount from the another, we can use the Euclidean distance,
of food and cigarette consumed by a person and his age which can be applied only if attributes are continuous or
the prediction of cancer can be done. "Logistic regression" other similarity measures that is relevant to the specific
is such a term which appears in almost every aspect ofthis problem. A useful application of clustering is marketing
field and regression techniques are also found to be useful segmentation, in which distinct set of customers are made
in this science. These techniques are especially used in the in the market and distinct marketing strategies are applied
case of neural network which can be used to create such to each of the subsets. It is possible to do this by analysing
complex functions which help in imitating the the lifestyle related and geographical information of each
functionalities of the brain. customer and make their clusters. This will help in finding

Copyright SMART-2016 ISBN: 978-1-5090-3543-4 85


5th International Conference on System Modeling & Advancement in Research Trends, 25th _27'h November, 2016
College of Computing Sciences & Information Technology, Teerthanker Mahaveer University, Moradabad, India

out the clustering quality of the customers by observing D. Business Analytics


the difference in the buying patterns of the customers in
Business analytics is a major part of business
one cluster to the customers in the other cluster. [5]
intelligence. Business analytics is directly aided by data
Anomaly detection (change and deviation detection):
mining and business intelligence. Business intelligence is
This technique helps to determine the most significant
mainly analysing data and collection of knowledge and
data change that has taken place in the database. This is
applying them to various different methods.
calculated and identified on the previously determined
This paper [22] explains that identitying various
data.
patterns in a data set exception to the side of the data set
Summarization: With the help of this technique, a
i.e. either a small database or large data warehouse is the
subset of the data present in the database is evaluated and
main purpose of data mining. Searching for a pattern or
a consequently a compact description is found.
relationship among different data groups is the main
11. BUSINESS INTELI GENC E purpose served by DM. It is unlike a normal OLAP query
where an identified pattern or relationship is used to
As described by the author [7] Business Intelligence process answers from the database. Identitying possible
(BI) is a concept of applying a set of technologies to patterns in DM can help organizations at the most. The
convert data into meaningful information. Basically, the main purpose of an organization is to provide better
term business intelligence has two different meanings products and outstanding services to their customers. If
when related to intelligence. The first is the human patterns can be identified it will aid in prediction,
intelligence or the capacity of a common brain applied to association and grouping of various events, products, or
business affairs. Business Intelligence has become a customers in a more effective manner.
novelty, the applications of human intellect and new Business analytics is a term used in context with the
entire process which involves application of skills,
technologies like artificial intelligence is used for
technology and different algorithms of data mining.
management and decision making in different business Business analysis produce valuable information to help
related problems. The second is the information which managers make better decisions regarding their business
helps raise currency in business. The intelligent and have proper control on their business operations.
knowledge gained by experts and efficient technology in There are two main faces of business analytics function ,
managing organizational and individual business. the back-end where the main application of data mining
takes place and the front-end is a collation of diverse
C. Business Intelligence Using Data Mining information and executive reporting metrics. If we can
Emergence of business intelligence has thrown a light effectively execute the business analytics function, it may
upon the new dimensions of the data collected over a result in becoming the core competence for an
business. In this paper [8] the author said that risk organization containing valuable business intelligence
which can support an organization in taking strategic and
management and enterprise decision-making are
efficient actions in business.
inseparable from mining tools. Business Intelligence (BI)
can only be acquired by using mining of data in different E. Use of Data Mining in Business Analytics
ways. Use of data warehousing and Information Systems
The paper [22] says that the main locomotive driving
(IS) have made it possible for enterprise datasets to grow the application of business analytics in businesses is data
rapidly. mining or knowledge discovery in databases. Data mining
With the prescient knowledge the author in paper [9] give us a view of the past and present situations and a
has said that the demand for more sophisticated and understanding of the possible future outcomes which can
intelligent BI solutions is constantly growing due to the give effective results, hence, we can say that DM act as a
fact that storage capacity grows with twice the speed of detective. Clusters are made by examining the past and the
processor power. This unbalanced growth relationship will current customers' behaviour like transaction, sales
over time make data processing tasks more time selections and servicing choices.
consuming when using traditional BI solutions. Simple extrapolation is used to describe the working
There are a variety of advanced data processing of DM. Queries related to data on various data software
help us extract useful information. Data mining in
techniques that can help BI processes to run efficiently
organization is mainly used for the growth of business
which are offered by DM. The comprehensive process of through discovery of useful patterns. In simple words,
applying BI for a business problem is referred to as the queries help us retrieve information of which we already
Knowledge Discovery in Databases (KDD) process and is have pre-knowledge whereas mining of data help us
vital for successful DM implementations with BI in mind. discover unknown facts that are there in the database. The

86 Copyright SMART-2016 ISBN: 978-1-5090-3543-4


Business Intelligence using Data Mining Techniques and Business Analytics

latter is termed as knowledge discovery [1], it is a process identified and kind of product they are likely to
through which huge databases can be identified of various buy next can be found out.
novel , valid and recognizable patterns which are hidden. Customer retention: Adjust the portfolio, pricing
The terms knowledge discovery and data mining are and promotions of the products according to the
sometimes used interchangeably. customer shopping patterns.
Customer segmentation: Associate each
Ill. C AS E STUDY
customer' s to proper group by identifYing their
Telecomm Services: Fraudulent activities in groups.
services and call intrusion. Analyse sales campaign: We can determine the
Results: Reduced fraud activities in services and effectiveness of the sales campaign by studying
save resources time and money. certain factors such as advertisements used and
Financial Companies: Client attracted to their discounts offered.
offers, cross sell standard products to clients.
B. Banking or Finance
Results: Discover key drivers for purchasing re-
mortgage producers; get greater response and How can we use it? Data mining is used in financial
worth of mortgage application revenue. sectors such as credit analysis, marketing, predicting
Software sales companies: facing difficulty payment default, ranking investments, cash managements
customer purchasing hardware and software and forecasting operations and many more. [17] In data
decisions for online sales. mining technique we can use it in the following
Results: Recommendation engine went live pages applications:
viewed per month more than 67 per cent, profits Credit Scoring: Factors like customer payment
increased than previous years.] history can be distinguished which can influence
Some of these are broadly explained under the loan payment.
application ofDM in BI. Customer Retention: Adjust the portfolio, pricing
and promotions of the products according to the
IV. ApPLIC ATIONS OF DATA MINING BUSINESS customer shopping patterns.
Data mining is a business process used to study huge Customer Segmentation: Include the new
volumes of data and derive some useful patterns of customers in the right groups by establishing
information from them. Many companies have improved certain customer groups.
in their business by using data mining. [1] Those Predict customer profitability: Factors like the
companies which have a strong focus on consumers in products used by the customers help to identifY
fields like Communication, Financial, Marketing patterns and predict the profitability of the
Organization, Retail use Data mining to go deep into or customers.
"drill down" into their transactional data. This will help Nowadays Rules Visualizer of MineSet [14] and
them in determining the customer preferences, the pricing Nicheworks [15] are tools which can be used to identifY
and the positioning of the product, the satisfaction of the the frequently purchased products. The performance
customers along with the corporate profits. [11] Data analysis can be done with the help of an explanation based
mining has been successfully applied in the following mining system called as Spotlight [16].
areas.
C. Insurance
A. Marketing or Retail Data Mining is used in many of the business practices
In the marketing field, the applications of data mining such as performing complex classifications and
includes market based analysis, product performance correlations, gathering new customers while relating the
analysis, market segmentation analysis and retail sales existing ones, designing and selection of policies [19] . The
analysis. [11] The buying behaviour, the support patterns data mining techniques will have following applications:
and trends that can be identified using data mining and Fraud detection: The factors which show a high
hence better customer satisfaction and retention could be probability of a claim or a fraud taking place and
achieved and goods consumption ratio can be enhanced its different patterns can be analysed.
thereby reducing the cost of business [12]. The techniques Risk factor identification: Factors like behaviour
of data mining which could be useful in the retail industry pattern or customer claims history may have an
are as follows: influence over the insured level of risk.
Establish customer shopping behaviour: So the Customer segmentation and retention: IdentifY
buying patterns of the customers could be such packages and discounts which could

Copyright SMART-2016 ISBN: 978-1-5090-3543-4 87


5th International Conference on System Modeling & Advancement in Research Trends, 25th _27'h November, 2016
College of Computing Sciences & Information Technology, Teerthanker Mahaveer University, Moradabad, India

increase the loyalty of the customers and include a proper relationship with the customer it is necessary to
each new customer to appropriate groups. collect and analyse information. [6]
Sometimes more than one machine learning
techniques are used in data mining applications. Kim and V. CONCLUSION
Noh [14] report an integrated system that is combined This paper discusses the till date effect of data mining
with NN and CBR to forecast the rate of interest for the technique in business intelligence. Two powerful tools
treasury bills and corporate bonds. In some field of determine the growth in business sector. The primary is
finance data visualization is used. Knowledge Seeker, data mining which is used to deal with large amount of
GUHA and KEX are used to identify accounts with data to find useful result, whereas the secondary is
interesting behaviour patterns [18]. An NN-based business intelligence which helps in making business
approach is used by FALCON to identify the credit card related decisions. The paper shows business analytics with
transactions which are suspicious [15]. a wide application domain almost in every industry where
the data is generated that's why data mining is considered
D. Biomedical and DNA Data Analysis
one of the most important outwork in databases and
Nowadays data mining is being widely used in areas information systems and business intelligence as an
related to Medical science such as Genetics, DNA, interface of the organization.
Medicine, Biomedical etc. it is being used in the field of
Genetics to learn about the mapping relationships which REFERENCES

are related to the DNA sequences of humans and the [I] Ruxandra PET RE, "Data Mining Solutions for the Business
susceptibility of certain diseases. Data mining serves as an Environment", Database Systems Journal vol. IV, no. 4/20 I 3
[2] Ashish K. Jha, Varun Jain, VridhiChowdhry and Indranil Bose,
aid in treatment as well as the prevention of diseases and
"Connecting the unconnected ' kirana ' stores through social supply
providing proper diagnosis. [20] The data mining chain innovation can help small Indian bu sinesses draw more
techniques will have following applications: benefits from the increasing purchasing power of consumers",
Data Cleansing and Data Mining: The data of the Asian Management Insights.
[3] Executive Summary, Data Growth, Business Opportunities, and the
DNA is found to be highly distributed and IT Imperatives, http://www.emc.comlIeadership/digital-
heterogeneous as well as uncontrolled in nature. universe/20 I 4 iview/executive-summary .htm
The process of data mining can serve as tool to [4] Usama Fayyad, Gregory PiatetskyShapiro and Padhraic Smyth,
properly systemize the data and then store it in a "From Data Mining to Knowledge Di scovery in Databases", AI
Magazine, Vol. 17, Issue 3, 1996, ISSN 0738-4602, pp. 37-54
data warehouse or a database so that it can be [5] NirKaldero, Director of Sciences, head of data science experts
used in research processes. [6] Rajkumar P, Blogger and software engineer, Big Data Made
simple,A crayon data resource
E. Telecommunication Industry [7] Arti J. Ugale, P. S. Mohod, "Business Intelligence Using Data
Mining Techniques on Very Large Datasets", International Journal
Telecommunication Industry and technology both of Science and Research (IJSR), Volume 4 Issue 6, June 2015 , pp-
grow at the same pace. The services of 2932-2937
Telecommunication have also grown from the local as [8] Prachiagarwal, "Benefits and Issues Surrounding Data Mining and
its Application in the Retail Indu stry" , International Journal of
well as the long distance voice communication to the Scientific and Research Publications, Volume 4, Issue 7, July 2014
advanced methods of pager, fax, e-mails and cellular [9] NielsArnth-Jensen, "Applied Data Mining for Bu siness
phones. Now they are integrated with various Intelligence".
communication technologies like internet, network and [10] Harvinder Singh, "Implementation Benefit to Bu siness Intelligence
using Data MiningTechniqu es", International Journal of Computing
computer. [21] The data mining techniques will have & Bu siness Research ISSN (Online): 2229-6 I 66
following applications: [I I] R.Shortland, R.S carfe, Digging for gold, IEEE Revi ew 4 I (5) 1995,
Cluster analysis: Fraudulent activities pose a pp. 2 I3-2 I 7.
major threat to the telecommunication industry. [12] Jiawei Han, MichelineKamber and Jian Pei, Data Mining: Concepts
and Techniques. Third Edition, Morgan Kaufmann Publi shing,
The performance of the network is affected by USA, 201 I.
these activities. Clustering can help in detecting [13] Gordon S. Lin off and Michael J . A. Berry, Data Mining
these fraudulent patterns and increasing the Techniques: for Marketing, Sales and Customer Relationship
efficiency of the various communication services. Management. Third Edition, Wiley Publi shing, USA, 20 I I .
[14] B.G.Becker, Using MineS et for knowledge di scovery, IEEE
Computer Graphics and Application (1997), pp. 75-78
F. CRM [IS] RJ .Brachman, T. Khabaza, W. Kloesgen, G . Piatetsky Shapiro, E.
The process of acquiring and relating the customers, Simoudis, Mining Business Databases, Communications of ACM
39(1 I), pp 42-48
increasing their loyalty level and executing the strategies [16] S.S Anand, A. R. Pattrick, J.G. Hughes, D.A. Bell, a data mining
focused on the customers all are included under the methodology for cross sales, Knowledge-based Systems, 10, 1998,
Customer Relationship Management. In order to maintain pp 449-461 .

88 Copyright SMART-2016 ISBN: 978-1-5090-3543-4


Business Intelligence using Data Mining Techniques and Business Analytics

[17] VikasJayasree and RethnamoneyVijayalakshmi Siva Balan, A [20] Biomedical p art ref and telecommuni cati on:-SimmiBagga
Revi ew on D ataMining in Banking Sector, -American J ournal of [21] SimmiBagga, G.N . Singh, "Applicati ons o f D ataMining", IJ SETT
Appli ed Sciences, Vol. 10, Issue 10, 201 3, ISSN 1554-3641 , pp. [22] PuiMun Lee, "Use Of Data Mining In Business Analytics To
11 60-11 65. Support Bu siness Competitiveness", Revi ew of Bu siness
[18] J .Rauch, P.Berka, Knowl edge Discovery in Financial D ata- a acse Informati on Systems- Second Qu arter 201 3 Volume 17, Number 2 .
study, Neural Network World 4(5), 1997, pp 427-4 37. [23] Anita Kumari Nanda and Broj o Ki shore Mishra, " Application of
[19] A. B. Devale andDr. R. V. Kulkarni , Applicati ons o f d atamining Fuzzy Data Mining in E-Government", 1st Int. Conf On
techniqu es in life insurance, Internati onal J ournal of D ataMining & Computing, Communicati on and Sensor Network s-CCSN' 20 12,
Knowl edge Management Process, Vol.2, Issue 4, July 2012, ISSN Purushottam Institute o f Engineering & Technology, Rourkela,
2230-9608, pp. 31-40. Nov 22-23, 2012.

Copyright SMART-2016 ISBN: 978-1-5090-3543-4 89

You might also like