I want to go through the Wikipedia series on Machine Learning and Data mining. Data mining is the process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.
Bootstrap aggregating, also called bagging, is a machine learning ensemble meta-algorithm designed to improve the stability and accuracy of ML classification and regression algorithms. It also reduced variance and overfitting. Although it is usually applied to decision tree methods, it can be used with any type of method.
There are three types of datasets in bootstrap aggregating. These are the original, bootstrap, and out-of-bag datasets. The bootstrap dataset is made by randomly picking objects from the original dataset. It must be same size of the original dataset, but the bootstrap dataset can have duplicate objects. The out-of-bag dataset represents the instances that were not in the bootstrap dataset. The next step of the algorithm involves the generation of decision trees from the bootstrapped dataset.
Advantages:
Many weak learners aggregated typically outperform a single learner over the entire set, and have less overfit
Reduces variance in high-variance low-bias weak learner, which can improve efficiency
Can be performed in parallel, as each separate bootstrap can be processed on its own before aggregation
Disadvantages:
For a weak learner with high variance, bagging will also carry high bias into its aggregate
Loss of interpretability of a model
Can be computationally expensive depending on the dataset
Are you sure you want to delete this article section? You can not undo this change.
Add a Comment
Annotate Article
Share Article
Successfully copied article URL to clipboard!
Something went wrong copying the article URL to the clipboard.
Successfully copied editor state to clipboard!
A previous, saved version of this article is saved. To compare the current version of the article with the previous version - or to replace the current version with the saved version - click the icon above the text editor.
Successfully copied URL to clipboard!
Copied code to clipboard.
Copied TeX code to clipboard.
Uploaded file must be an image of type .jpeg, .jpg, .png, .webp, .gif, .bmp, or .svg.
Image Size must be less than 5MB.
Uploaded file must be an image of type jpeg, jpg, png, webp, avif, tiff, or svg.
Something went wrong uploading the image to the database. Try reloading the page.
The maximum number of images you can upload is 30.
There was an error taking an image using the device's camera. Try uploading an image instead.
Uploaded file must be an audio file.
Audio file must be an audio file of type .m4a, .flac, .mp3, .mp4, .wav, .wma, .aac, .webm, or .mpeg.
Audio file must be less than 300 MB in size.
Something went wrong uploading the audio file. Try reloading the page.
The maximum number of audio files you can upload is 10.
There was an error capturing an audio recording using the device's microphone. Try uploading an audio recording or video instead.
Uploaded file must be an video file.
Video file must be an audio file of type .mp4, .mov, .avi, .wmv, .avchd, .webm, or .flv.
Video file must be less than 300 MB in size.
Something went wrong uploading the video file. Try reloading the page.
The maximum number of video files you can upload is 10.
There was an error taking a video using the device's camera / microphone. Try uploading a video instead.
Comments
You have to be logged in to add a comment
User Comments
There are currently no comments for this article.