Business Intelligence Project: Using Predictive Analytics to Improve A Business
Abstract
Data mining is a powerful method of extracting knowledge/information from large amount of data. It has received great attention in recent years because of growing amount of data and the persistent need of turning the huge data into information. It is widely adapted in many fields like bioinformatics, business analytics, marketing, security and many more. For large scale companies Predictive Analysis, Frequent Pattern Mining algorithms and various data mining techniques have proved very effective techniques for improving their sales. One such algorithm Apriori Algorithm can be used for the coffee shop for improving the sale. Apriori algorithm can be used efficiently to train a program with historical data of frequently bought together items and it can predict or suggest other related items to Customers. Such a system when integrated with any business can make it easier for Customers to find the items they may be looking for.
Background and Aim:
Predictive Analytics has proven extremely effective in recent times for huge companies with large data, this is true for companies such as “Google” and “Amazon” whereby in the case of amazon for the year 2020, 35% of their sales came from the suggested items in the “people who have bought this item also got..”, this would be roughly 56$ Billion from a total of 163$ Billion in Amazon’s online stores Revenue for 2020.
This increase proved to be a lucrative technology and we believe that today, even small businesses should start looking at such methods. The three things that we will be focusing on in this project will be; Inventory, customer analysis and finally sales.
We are aiming to be able to raise awareness on how current technological advances can help small businesses thrive in such uncertain times, given the severe hit taken by small businesses (50% of restaurants in New York are out of business due to COVID-19 pandemic).
Methodology:
We will be teaming with a local small coffee shop owner to get real data of inventory, sales and customer data. We will create a python program that will help create a model that can be applied to improve sales, customer journey and inventory tracking. We will then use our finding and apply it in the coffee shop and determine the rate of success of our model on the business.
Keywords:
Big data, Predictive analysis, Apriori algorithm, Frequent pattern, Data mining
Data mining is a powerful method of extracting knowledge/information from large amount of data. It has great attention in recent years because of growing amount of data and the persistent need of turning the huge data into information. It is widely adapted in many fields like bioinformatics, business analytics, marketing, security and many more.
An important task of data mining is discovering frequent patterns that play an important role in clustering, associations mining, correlations, etc. Frequent patterns are patterns that appear in a data set frequently, such as itemsets, subsequences, or substructures. An itemset A (or subsequence, or substructure) is said to be frequent if it satisfies the predetermined minimum support count, where
Predictive Analytics is the process of transforming data into future insights. The backbone of predictive analytics are models, this is done by looking at both historical and current data to provide actionable insights.
Many current studies have been conducted in the use of predictive analytics and they have been translated in applications (usually in beta stages) that are slowly emerging in the market but have still not been accepted by the general public. A few examples of these great softwares are “RapidMiner” and “IBM SPSS Modeler”, whereby these apps unify data science lifecycles from data preprocessing to machine learning and predictive analytics.
Time series and sequential data mining is one of the most important problems from 10 challenges identified in paper [4]. Clustering, classification, and trend prediction of these data is an important open research topic. Another problem identified in [4] is mining complex data in the form of graphs.
Mining frequent tree pattern is an important open research area. Previous research studies highly suggest the pattern growth method for efficient pattern mining. Authors of [6] have developed a pattern growth method for mining frequent tree patterns. Two algorithms, Chopper and XSpanner, have been devised, from which XSpanner algorithm is faster than Chopper. These two algorithms perform better than TreeMinerV of M.J.Zaki, Effciently mining frequent trees in a forest of KDD02 [11]. In that, Chopper consists of two separate phases (i) mining sequential patterns and (ii) the extraction of frequent tree patterns. It generates and tests all possible tree patterns of the database. XSpanner algorithm combines these two phases of Chopper algorithm.
Table 1: Comaprison of various Association Rule Mining Algorithms[12]
Algorithm | Adavntages | Disadvantage | Applications |
AIS | Focuses on improving the quality of database and process the dicision support queries. | Candidate set generated on the fly. | Not frequently used, but when used is used for small problems. |
Easy to use. | Size of candidate set large. | ||
Better than STEM.
|
Requires multiple scans on whole database. | ||
Needs more memory. | |||
STEM | Seperates generation from counting. | Very large execution time. | Not frequently used. |
Size of candidate set is large. | |||
Apriori | Fast, more efficient than AIS | Takes a lot of memory.
|
Best for closed itemsets. |
Less candidate sets. Generates candidate sets on the from only those items that were found large. | |||
Apriori TID | Doesn’t use whole database to count candidate sets. | – | Used for smaller problems. |
Better than STEM and it is fast. | |||
Apriori Hybrid | Better than Aprior and Apriori TID | Used where Apriori and Ariori TID can be used. | |
FP-Growth | Only 2-passes of dataset | Using tree structure creates complexity | Used in cases of large problems as it doesn’t require generation of candidate sets. |
Compresses dataset | Not suitable for incremental mining. | ||
No candidate set generation required. | |||
Rapid Association rule mining | Avoids candidate generation process | Requires more memory | |
Faster than FP-Tree algorithm |
In the best case scenario we would like to empower small businesses around the world by giving them more comfort and security when having to make business decisions, this will be done by helping them understand their business in ways that were previously not possible nor feasible.
We will be using methods such as Apriori Algorithm[3] to help us understand customers’ buying patterns to be able to use this to our advantage in a way that we can bundle and group these items with a discount and theoretically this would reflect in higher sales for the business.
We would like to contribute to the global knowledge of small business operations the few techniques that we will be developing in hopes to empower these businesses to survive and prosper in a rapidly changing marketplace.
Apriori Algorithm:
Apriori algorithm is given by R. Agrawal and R. Srikant in 1994 for finding frequent itemsets in a dataset for boolean association rule. Name of the algorithm is Apriori since it uses prior knowledge of frequent itemset properties. We will use an iterative method identify where k-frequent itemsets are used to find k+1 itemsets.
To improve the efficiency of iterative generation of frequent itemsets, an important property is used called Apriori property which helps by reducing the search space.
Apriori Property:
All non-empty subset of frequent itemset must be frequent. Apriori algorithm assumes that “All subsets of a frequent itemset must be frequent (Apriori property). If an itemset is infrequent, all its supersets will be infrequent.”
Step-1: Determine the support of itemsets from a database (CSV file) of transactions, and select the minimum support and confidence.
Step-2: Include all supports in the transaction with support above the configured minimum support value.
Step-3: Find all the rules of these subsets that have higher confidence value than the threshold or minimum confidence.
Step-4: Sort the rules as the decreasing order of lift.
Consider the following dataset and we will find frequent itemsets and generate association rules for them.
Items bought together |
I1, I2, I5 |
I2, I4 |
I2, I3 |
I1, I2, I4 |
I1, I3 |
I2, I3 |
I1, I3 |
I1, I2, I3, I5 |
I1, I2, I3 |
minimum support count is 2
Step-1:
(I) Create a table of itemsets and support counts of each item present in dataset – Call it candidate set C1
Itemset | Support Count |
I1 | 6 |
I2 | 7 |
I3 | 6 |
I4 | 2 |
I5 | 2 |
(II) Compare item support count for items in candidate set C1 and remove entries which has support count less than the minimum support count (i.e. less than 2). The resultant set is Itemset L1.
Itemset | Support Count |
I1 | 6 |
I2 | 7 |
I3 | 6 |
I4 | 2 |
I5 | 2 |
Step-2:
(I)
Itemset | Support Count |
Are you busy and do not have time to handle your assignment? Are you scared that your paper will not make the grade? Do you have responsibilities that may hinder you from turning in your assignment on time? Are you tired and can barely handle your assignment? Are your grades inconsistent?
Whichever your reason is, it is valid! You can get professional academic help from our service at affordable rates. We have a team of professional academic writers who can handle all your assignments.
Students barely have time to read. We got you! Have your literature essay or book review written without having the hassle of reading the book. You can get your literature paper custom-written for you by our literature specialists.
Do you struggle with finance? No need to torture yourself if finance is not your cup of tea. You can order your finance paper from our academic writing service and get 100% original work from competent finance experts.
Computer science is a tough subject. Fortunately, our computer science experts are up to the match. No need to stress and have sleepless nights. Our academic writers will tackle all your computer science assignments and deliver them on time. Let us handle all your python, java, ruby, JavaScript, php , C+ assignments!
While psychology may be an interesting subject, you may lack sufficient time to handle your assignments. Don’t despair; by using our academic writing service, you can be assured of perfect grades. Moreover, your grades will be consistent.
Engineering is quite a demanding subject. Students face a lot of pressure and barely have enough time to do what they love to do. Our academic writing service got you covered! Our engineering specialists follow the paper instructions and ensure timely delivery of the paper.
In the nursing course, you may have difficulties with literature reviews, annotated bibliographies, critical essays, and other assignments. Our nursing assignment writers will offer you professional nursing paper help at low prices.
Truth be told, sociology papers can be quite exhausting. Our academic writing service relieves you of fatigue, pressure, and stress. You can relax and have peace of mind as our academic writers handle your sociology assignment.
We take pride in having some of the best business writers in the industry. Our business writers have a lot of experience in the field. They are reliable, and you can be assured of a high-grade paper. They are able to handle business papers of any subject, length, deadline, and difficulty!
We boast of having some of the most experienced statistics experts in the industry. Our statistics experts have diverse skills, expertise, and knowledge to handle any kind of assignment. They have access to all kinds of software to get your assignment done.
Writing a law essay may prove to be an insurmountable obstacle, especially when you need to know the peculiarities of the legislative framework. Take advantage of our top-notch law specialists and get superb grades and 100% satisfaction.
We have highlighted some of the most popular subjects we handle above. Those are just a tip of the iceberg. We deal in all academic disciplines since our writers are as diverse. They have been drawn from across all disciplines, and orders are assigned to those writers believed to be the best in the field. In a nutshell, there is no task we cannot handle; all you need to do is place your order with us. As long as your instructions are clear, just trust we shall deliver irrespective of the discipline.
Our essay writers are graduates with bachelor's, masters, Ph.D., and doctorate degrees in various subjects. The minimum requirement to be an essay writer with our essay writing service is to have a college degree. All our academic writers have a minimum of two years of academic writing. We have a stringent recruitment process to ensure that we get only the most competent essay writers in the industry. We also ensure that the writers are handsomely compensated for their value. The majority of our writers are native English speakers. As such, the fluency of language and grammar is impeccable.
There is a very low likelihood that you won’t like the paper.
Not at all. All papers are written from scratch. There is no way your tutor or instructor will realize that you did not write the paper yourself. In fact, we recommend using our assignment help services for consistent results.
We check all papers for plagiarism before we submit them. We use powerful plagiarism checking software such as SafeAssign, LopesWrite, and Turnitin. We also upload the plagiarism report so that you can review it. We understand that plagiarism is academic suicide. We would not take the risk of submitting plagiarized work and jeopardize your academic journey. Furthermore, we do not sell or use prewritten papers, and each paper is written from scratch.
You determine when you get the paper by setting the deadline when placing the order. All papers are delivered within the deadline. We are well aware that we operate in a time-sensitive industry. As such, we have laid out strategies to ensure that the client receives the paper on time and they never miss the deadline. We understand that papers that are submitted late have some points deducted. We do not want you to miss any points due to late submission. We work on beating deadlines by huge margins in order to ensure that you have ample time to review the paper before you submit it.
We have a privacy and confidentiality policy that guides our work. We NEVER share any customer information with third parties. Noone will ever know that you used our assignment help services. It’s only between you and us. We are bound by our policies to protect the customer’s identity and information. All your information, such as your names, phone number, email, order information, and so on, are protected. We have robust security systems that ensure that your data is protected. Hacking our systems is close to impossible, and it has never happened.
You fill all the paper instructions in the order form. Make sure you include all the helpful materials so that our academic writers can deliver the perfect paper. It will also help to eliminate unnecessary revisions.
Proceed to pay for the paper so that it can be assigned to one of our expert academic writers. The paper subject is matched with the writer’s area of specialization.
You communicate with the writer and know about the progress of the paper. The client can ask the writer for drafts of the paper. The client can upload extra material and include additional instructions from the lecturer. Receive a paper.
The paper is sent to your email and uploaded to your personal account. You also get a plagiarism report attached to your paper.
Delivering a high-quality product at a reasonable price is not enough anymore.
That’s why we have developed 5 beneficial guarantees that will make your experience with our service enjoyable, easy, and safe.
You have to be 100% sure of the quality of your product to give a money-back guarantee. This describes us perfectly. Make sure that this guarantee is totally transparent.
Read moreEach paper is composed from scratch, according to your instructions. It is then checked by our plagiarism-detection software. There is no gap where plagiarism could squeeze in.
Read moreThanks to our free revisions, there is no way for you to be unsatisfied. We will work on your paper until you are completely happy with the result.
Read moreYour email is safe, as we store it according to international data protection rules. Your bank details are secure, as we use only reliable payment systems.
Read moreBy sending us your money, you buy the service we provide. Check out our terms and conditions if you prefer business talks to be laid out in official language.
Read more