Below are the top sources and datasets available for download to jumpstart your association rule mining projects. 1. Online Retail Dataset (UCI Machine Learning Repository)
Finding the right data is the first step in mastering the Apriori algorithm for market basket analysis. Since this unsupervised machine learning model identifies frequent itemsets and associations within transactional data, you need datasets that reflect real-world shopping patterns.
It’s clean but offers enough complexity for time-series and sequential mining beyond basic Apriori. Download: Available at the UCI Machine Learning Repository. 2. Market Basket Optimization (Kaggle)
This is the gold standard for Apriori practitioners. It contains actual transactions from a UK-based non-store online retailer, making it perfect for discovering "if-then" relationships between products. Over 500,000.
For those focusing on big data and high-performance algorithms, the FIMI (Frequent Itemset Mining Implementations) repository offers an anonymized dataset from a Belgian store. What is the Apriori algorithm? - IBM
Below are the top sources and datasets available for download to jumpstart your association rule mining projects. 1. Online Retail Dataset (UCI Machine Learning Repository)
Finding the right data is the first step in mastering the Apriori algorithm for market basket analysis. Since this unsupervised machine learning model identifies frequent itemsets and associations within transactional data, you need datasets that reflect real-world shopping patterns.
It’s clean but offers enough complexity for time-series and sequential mining beyond basic Apriori. Download: Available at the UCI Machine Learning Repository. 2. Market Basket Optimization (Kaggle)
This is the gold standard for Apriori practitioners. It contains actual transactions from a UK-based non-store online retailer, making it perfect for discovering "if-then" relationships between products. Over 500,000.
For those focusing on big data and high-performance algorithms, the FIMI (Frequent Itemset Mining Implementations) repository offers an anonymized dataset from a Belgian store. What is the Apriori algorithm? - IBM