I want to go through the Wikipedia series on Machine Learning and Data mining. Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.
In data analysis, anomaly detection (also referred to as outlier detection and sometimes as novelty detection) is generally understood to be the identification of rare items, events or observations which deviate significantly from the majority of the data and do not conform to a well defined notion of normal behavior. Such examples may arouse suspicions of being generated by a different mechanism, or appear inconsistent with the remainder of that set of data.
Three board categories of anomaly detection techniques exist:
Supervised anomaly detection technique require a data set that has been labeled as normal and abnormal and involves training a classifier. However, this approach is rarely used in anomaly detection due to the general unavailability of labelled data and inherent nature of the classes
Semi-supervised anomaly detection techniques assume that some portion of the data is labelled. The techniques construct a model representing normal behavior from a given normal training data set, and then test the likelihood of a test instance being generated by the model
Unsupervised anomaly detection techniques assume the data is unlabeled and are by far the most common used due to their wider and relevant application.
There is no agreed upon definition for what qualifies as an anomaly, but here are some example definitions:
An outlier is an observation which deviates so much from the other observations as to arouse suspicions that it was generated by a different mechanism
Anomalies are instances of collections of data that occur very rarely in the data set and whose features differ significantly form most of the data
An outlier is an observation which appears to be inconsistent with the remainder of that set of data
Anomaly detection is applicable in a very large number and variety of domains, and is an important subarea of unsupervised machine learning.
Are you sure you want to delete this article section? You can not undo this change.
Add a Comment
Annotate Article
Share Article
Successfully copied article URL to clipboard!
Something went wrong copying the article URL to the clipboard.
Successfully copied editor state to clipboard!
A previous, saved version of this article is saved. To compare the current version of the article with the previous version - or to replace the current version with the saved version - click the icon above the text editor.
Successfully copied URL to clipboard!
Copied code to clipboard.
Copied TeX code to clipboard.
Uploaded file must be an image of type .jpeg, .jpg, .png, .webp, .gif, .bmp, or .svg.
Image Size must be less than 5MB.
Uploaded file must be an image of type jpeg, jpg, png, webp, avif, tiff, or svg.
Something went wrong uploading the image to the database. Try reloading the page.
The maximum number of images you can upload is 30.
There was an error taking an image using the device's camera. Try uploading an image instead.
Uploaded file must be an audio file.
Audio file must be an audio file of type .m4a, .flac, .mp3, .mp4, .wav, .wma, .aac, .webm, or .mpeg.
Audio file must be less than 300 MB in size.
Something went wrong uploading the audio file. Try reloading the page.
The maximum number of audio files you can upload is 10.
There was an error capturing an audio recording using the device's microphone. Try uploading an audio recording or video instead.
Uploaded file must be an video file.
Video file must be an audio file of type .mp4, .mov, .avi, .wmv, .avchd, .webm, or .flv.
Video file must be less than 300 MB in size.
Something went wrong uploading the video file. Try reloading the page.
The maximum number of video files you can upload is 10.
There was an error taking a video using the device's camera / microphone. Try uploading a video instead.
Comments
You have to be logged in to add a comment
User Comments
There are currently no comments for this article.