Data Analytics and Big Data by soraya sedkaoui
Author:soraya sedkaoui
Language: eng
Format: epub
Published: 2018-06-19T16:00:00+00:00
5.4.3. Data mining
Before one attempts to extract and acquire useful knowledge from data, it is important to understand the overall approach or the process that leads to finding new knowledge.
The process defines a sequence of steps (with eventual feedback) that should be followed to discover knowledge in data (see the knowledge discovery process). To advance through each step
successfully, we must apply effective data collection, description, analysis and interpretation [PIE 15]. Each step is usually realized with the help of available software tools. Data mining is a particular step in this process – application of specific algorithms for extracting models from data.
The additional steps in the process, such as data preparation, data selection, data cleaning, incorporation of appropriate prior knowledge, and proper interpretation of the results of mining, ensure that useful knowledge is derived from the data.
Data mining and knowledge discovery combines theory and heuristics toward extracting knowledge. To this end, data cleaning, learning and visualization might be also employed.
According to the Gartner Group, this process can be repetitive or interactive depending on the target objectives. We can say that the main task of data mining is using methods to automatically extract useful information from these data and make them available to decision-makers.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8309)
Azure Data and AI Architect Handbook by Olivier Mertens & Breght Van Baelen(6802)
Building Statistical Models in Python by Huy Hoang Nguyen & Paul N Adams & Stuart J Miller(6777)
Serverless Machine Learning with Amazon Redshift ML by Debu Panda & Phil Bates & Bhanu Pittampally & Sumeet Joshi(6666)
Data Wrangling on AWS by Navnit Shukla | Sankar M | Sam Palani(6450)
Driving Data Quality with Data Contracts by Andrew Jones(6394)
Machine Learning Model Serving Patterns and Best Practices by Md Johirul Islam(6151)
Learning SQL by Alan Beaulieu(6004)
Weapons of Math Destruction by Cathy O'Neil(5795)
Big Data Analysis with Python by Ivan Marin(5394)
Data Engineering with dbt by Roberto Zagni(4400)
Solidity Programming Essentials by Ritesh Modi(4048)
Time Series Analysis with Python Cookbook by Tarek A. Atwan(3907)
Pandas Cookbook by Theodore Petrou(3610)
Blockchain Basics by Daniel Drescher(3306)
Hands-On Machine Learning for Algorithmic Trading by Stefan Jansen(2914)
Feature Store for Machine Learning by Jayanth Kumar M J(2820)
Learn T-SQL Querying by Pam Lahoud & Pedro Lopes(2803)
Mastering Python for Finance by Unknown(2748)
