{"id":300,"date":"2022-01-18T00:00:00","date_gmt":"2022-01-18T00:00:00","guid":{"rendered":"https:\/\/tac.debuzzify.com\/?p=300"},"modified":"2023-06-21T12:55:08","modified_gmt":"2023-06-21T12:55:08","slug":"data-augmentation-in-python","status":"publish","type":"post","link":"https:\/\/www.the-analytics.club\/data-augmentation-in-python\/","title":{"rendered":"This Tiny Python Package Creates Huge Augmented Datasets"},"content":{"rendered":"\n\n\n<p>After months of hard work, you and your team have gathered a vast amount of data for your machine-learning project.<\/p>\n\n\n\n<p>The project budget is almost over, and what\u2019s left is only enough for training the model.<\/p>\n\n\n\n<p>But as soon as you train the model, you start to see the model isn\u2019t generalizing the problem well. The data you collected is not enough. The training accuracy is so good, but on the validation set, it drops drastically.<\/p>\n\n\n\n<p>Technically, this is what we famously call <a href=\"https:\/\/www.ibm.com\/cloud\/learn\/overfitting\" target=\"_blank\" rel=\"noopener\">overfitting<\/a>.<\/p>\n\n\n\n<p>There are different ways to deal with overfitting. But your team concludes none of them are working.<\/p>\n\n\n\n<p>What\u2019s left is one of two options:<\/p>\n\n\n\n<ul>\n<li>Collect more data<\/li>\n\n\n\n<li>Create copies of existing data points with slight adjustments ( data augmentation)<\/li>\n<\/ul>\n\n\n\n<p>Data augmentation is proven to <a href=\"https:\/\/www.sciencedirect.com\/science\/article\/abs\/pii\/S0957417420305200\" target=\"_blank\" rel=\"noopener\">improve machine learning model accuracy<\/a> without collecting further data. It\u2019s a widespread technique many practitioners frequently use.<\/p>\n\n\n\n<p>Collecting data is a costly task in many cases. You may have to pay for equipment for permissions, and not to mention, to label them after collection.<\/p>\n\n\n\n<p><i>Related:<\/i> <a href=\"https:\/\/www.the-analytics.club\/is-deep-learning-right-for-you\"><b><i>How to know if deep learning is right for you?<\/i><\/b><\/a><\/p>\n\n\n\n<p>Take, for instance, the medical image classification problem. There are <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/books\/NBK236546\/\" target=\"_blank\" rel=\"noopener\">legal restrictions<\/a> on how you should collect healthcare data. And after collection, to label them, you need the expertise of skilled professionals such as doctors.<\/p>\n\n\n\n<p>This post will discuss an image data augmentation tool called <a href=\"https:\/\/albumentations.ai\/\" target=\"_blank\" rel=\"noopener\">Albumentations<\/a> through examples. It\u2019s an open-source Python library released under the MIT license.<\/p>\n\n\n\n<p>You can install it with the following command from the PyPI repository. If you are looking for advanced instructions, please consult the official <a href=\"https:\/\/albumentations.ai\/docs\/\" target=\"_blank\" rel=\"noopener\">documentation<\/a>.<\/p>\n\n\n\n<p><script><br \/>\n            \/* This is ugly, but to ensure we only create this function once, and only call<br \/>\n            each JS library once, we need to check here in case there are multiple Wagtail<br \/>\n            blocks on this page. This will ensure we only load the minimum payload. *\/<br \/>\n            if(typeof loadPrismLanguage != 'function') {<br \/>\n                window.loadPrismLanguage = function(language) {<br \/>\n                    var libraries = [<br \/>\n                        {<br \/>\n                            \"id\": \"code-block-prismjs\",<br \/>\n                            \"url\": \"\/\/cdnjs.cloudflare.com\/ajax\/libs\/prism\/1.25.0\/prism.min.js\"<br \/>\n                        },<br \/>\n                        {<br \/>\n                            \"id\": \"code-block-prismjs-\" + language,<br \/>\n                            \"url\": \"\/\/cdnjs.cloudflare.com\/ajax\/libs\/prism\/1.25.0\/components\/prism-\" + language + \".min.js\"<br \/>\n                        },<br \/>\n        {<br \/>\n            \"id\": \"code-block-line-numbers\",<br \/>\n            \"url\": \"\/\/cdnjs.cloudflare.com\/ajax\/libs\/prism\/1.25.0\/plugins\/line-numbers\/prism-line-numbers.min.js\"<br \/>\n        }<\/p>\n<p>                    ];<\/p>\n<p>                    for(const library of libraries) {<br \/>\n                        if(document.getElementById(library[\"id\"]) == null) {<br \/>\n                            var s = document.createElement(\"script\");<br \/>\n                            s.id = library[\"id\"];<br \/>\n                            s.type = \"text\/javascript\";<br \/>\n                            s.src = library[\"url\"];<br \/>\n                            s.async = false;<br \/>\n                            document.body.appendChild(s);<br \/>\n                        }<br \/>\n                    }<br \/>\n                };<br \/>\n            }<\/p>\n<p>            loadPrismLanguage('bash');<\/p>\n<p>            language_class_name = 'language-bash';<br \/>\n            <\/script><\/p>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro padding-bottom-disabled\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:16px;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:20px\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#2e3440ff\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" data-code=\"pip install -U albumentations\" style=\"color:#d8dee9ff;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki nord\" style=\"background-color: #2e3440ff\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color: #D8DEE9FF\">pip install <\/span><span style=\"color: #81A1C1\">-<\/span><span style=\"color: #D8DEE9FF\">U albumentations<\/span><\/span><\/code><\/pre><span style=\"display:flex;align-items:flex-end;padding:10px;width:100%;justify-content:flex-end;background-color:#2e3440ff;color:#c8d0e0;font-size:12px;line-height:1;position:relative\">Python<\/span><\/div>\n\n\n\n<p><script><br \/>\n                var block_num = (typeof block_num === 'undefined') ? 0 : block_num;<br \/>\n                block_num++;<br \/>\n                document.getElementById('target-element-current').className = language_class_name;<br \/>\n                document.getElementById('target-element-current').id = 'target-element-' + block_num;<br \/>\n            <\/script><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What data augmentation does?<\/h2>\n\n\n\n<p>Data augmentation creates copies of existing data points with some transformation. For instance, we can crop and rotate an image to look new to our training model. We could multifold our dataset and train the model to improve its accuracy with this ensembling method.<\/p>\n\n\n\n<p>Here\u2019s an illustration of what it means.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"630\" height=\"354\" src=\"https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-40.png\" alt=\"This is how data augmentation works. We can create multiple copies of the original image with sight variations. So that the machine learning model will now find it different and learn to recognize all of them. Rather than doing it manually, we can use a Python library such as Albumentation to integrate it into our pipeline.\" class=\"wp-image-774\" title=\"\" srcset=\"https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-40.png 630w, https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-40-300x169.png 300w\" sizes=\"(max-width: 630px) 100vw, 630px\" \/><\/figure><\/div>\n\n\n<p>We transformed the base image using different techniques. In many photos, we\u2019ve used more than one technique in combination. By continuing this process, we can generate tons of data points.<\/p>\n\n\n\n<p><em>Related: <a href=\"https:\/\/towardsdatascience.com\/transfer-learning-in-deep-learning-641089950f5d\" target=\"_blank\" rel=\"noopener\">Transfer Learning: The highest leverage deep learning skill you can learn.<\/a><\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Creating an Image augmentation pipeline using Albumentations.<\/h2>\n\n\n\n<p>Creating an augmentation pipeline using Albumentations is very straightforward.<\/p>\n\n\n\n<p>Initially, we need to compose an augmentation pipeline by configuring a list of transformations. Then we can use any image processing library, such as Pillow or OpenCV, to read images from the filesystem. Every time we pass an image through the list of transformations we configured, it gives us an altered image.<\/p>\n\n\n\n<p>Here\u2019s an example usage you can replicate to get started.<\/p>\n\n\n\n<p><script><br \/>\n            \/* This is ugly, but to ensure we only create this function once, and only call<br \/>\n            each JS library once, we need to check here in case there are multiple Wagtail<br \/>\n            blocks on this page. This will ensure we only load the minimum payload. *\/<br \/>\n            if(typeof loadPrismLanguage != 'function') {<br \/>\n                window.loadPrismLanguage = function(language) {<br \/>\n                    var libraries = [<br \/>\n                        {<br \/>\n                            \"id\": \"code-block-prismjs\",<br \/>\n                            \"url\": \"\/\/cdnjs.cloudflare.com\/ajax\/libs\/prism\/1.25.0\/prism.min.js\"<br \/>\n                        },<br \/>\n                        {<br \/>\n                            \"id\": \"code-block-prismjs-\" + language,<br \/>\n                            \"url\": \"\/\/cdnjs.cloudflare.com\/ajax\/libs\/prism\/1.25.0\/components\/prism-\" + language + \".min.js\"<br \/>\n                        },<br \/>\n        {<br \/>\n            \"id\": \"code-block-line-numbers\",<br \/>\n            \"url\": \"\/\/cdnjs.cloudflare.com\/ajax\/libs\/prism\/1.25.0\/plugins\/line-numbers\/prism-line-numbers.min.js\"<br \/>\n        }<\/p>\n<p>                    ];<\/p>\n<p>                    for(const library of libraries) {<br \/>\n                        if(document.getElementById(library[\"id\"]) == null) {<br \/>\n                            var s = document.createElement(\"script\");<br \/>\n                            s.id = library[\"id\"];<br \/>\n                            s.type = \"text\/javascript\";<br \/>\n                            s.src = library[\"url\"];<br \/>\n                            s.async = false;<br \/>\n                            document.body.appendChild(s);<br \/>\n                        }<br \/>\n                    }<br \/>\n                };<br \/>\n            }<\/p>\n<p>            loadPrismLanguage('python');<\/p>\n<p>            language_class_name = 'language-python';<br \/>\n            <\/script><\/p>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro padding-bottom-disabled\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:16px;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:20px\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#2e3440ff\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" data-code=\"import albumentations as A\nfrom PIL import Image\nimport numpy as np\n\n# Create a pipline with 4 different transformations. \ntransform = A.Compose(\n    [\n        A.RandomCrop(width=256, height=256),\n        A.HorizontalFlip(p=0.5),\n        A.RandomBrightnessContrast(brightness_limit=.5, contrast_limit=.3),\n        A.Rotate(),\n    ]\n)\n\n# Read the image and convert it to a numpy array\npillow_image = Image.open(&quot;image.original.jpg&quot;)\nimage = np.array(pillow_image)\n\n# Apply transformation\ntransformed = transform(image=image)\n\n# Access and show transformation\ntransformed_image = transformed[&quot;image&quot;]\nimg = Image.fromarray(transformed_image)\n\nimg.show()\" style=\"color:#d8dee9ff;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki nord\" style=\"background-color: #2e3440ff\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color: #81A1C1\">import<\/span><span style=\"color: #D8DEE9FF\"> albumentations <\/span><span style=\"color: #81A1C1\">as<\/span><span style=\"color: #D8DEE9FF\"> A<\/span><\/span>\n<span class=\"line\"><span style=\"color: #81A1C1\">from<\/span><span style=\"color: #D8DEE9FF\"> PIL <\/span><span style=\"color: #81A1C1\">import<\/span><span style=\"color: #D8DEE9FF\"> Image<\/span><\/span>\n<span class=\"line\"><span style=\"color: #81A1C1\">import<\/span><span style=\"color: #D8DEE9FF\"> numpy <\/span><span style=\"color: #81A1C1\">as<\/span><span style=\"color: #D8DEE9FF\"> np<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color: #616E88\"># Create a pipline with 4 different transformations. <\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">transform <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">Compose<\/span><span style=\"color: #ECEFF4\">(<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">    <\/span><span style=\"color: #ECEFF4\">[<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">        A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">RandomCrop<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9\">width<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #B48EAD\">256<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #D8DEE9\">height<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #B48EAD\">256<\/span><span style=\"color: #ECEFF4\">),<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">        A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">HorizontalFlip<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9\">p<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #B48EAD\">0.5<\/span><span style=\"color: #ECEFF4\">),<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">        A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">RandomBrightnessContrast<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9\">brightness_limit<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #B48EAD\">.5<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #D8DEE9\">contrast_limit<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #B48EAD\">.3<\/span><span style=\"color: #ECEFF4\">),<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">        A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">Rotate<\/span><span style=\"color: #ECEFF4\">(),<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">    <\/span><span style=\"color: #ECEFF4\">]<\/span><\/span>\n<span class=\"line\"><span style=\"color: #ECEFF4\">)<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color: #616E88\"># Read the image and convert it to a numpy array<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">pillow_image <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> Image<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">open<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #A3BE8C\">image.original.jpg<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #ECEFF4\">)<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">image <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> np<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">array<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9FF\">pillow_image<\/span><span style=\"color: #ECEFF4\">)<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color: #616E88\"># Apply transformation<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">transformed <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #88C0D0\">transform<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9\">image<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\">image<\/span><span style=\"color: #ECEFF4\">)<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color: #616E88\"># Access and show transformation<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">transformed_image <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> transformed<\/span><span style=\"color: #ECEFF4\">[<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #A3BE8C\">image<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #ECEFF4\">]<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">img <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> Image<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">fromarray<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9FF\">transformed_image<\/span><span style=\"color: #ECEFF4\">)<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">img<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">show<\/span><span style=\"color: #ECEFF4\">()<\/span><\/span><\/code><\/pre><span style=\"display:flex;align-items:flex-end;padding:10px;width:100%;justify-content:flex-end;background-color:#2e3440ff;color:#c8d0e0;font-size:12px;line-height:1;position:relative\">Python<\/span><\/div>\n\n\n\n<p><script><br \/>\n                var block_num = (typeof block_num === 'undefined') ? 0 : block_num;<br \/>\n                block_num++;<br \/>\n                document.getElementById('target-element-current').className = language_class_name;<br \/>\n                document.getElementById('target-element-current').id = 'target-element-' + block_num;<br \/>\n            <\/script><\/p>\n\n\n\n<p>In the above example, we\u2019ve used four types of transformations.<\/p>\n\n\n\n<ol>\n<li> <span style=\"font-size: revert;\">We cropped the image starting from random locations. We configured the RandomCrop API to result in a picture of 256&#215;256 size.<\/span> <\/li>\n\n\n\n<li> <span style=\"font-size: revert;\">We used a horizontal flip. Note that this operation is not applied all the time. We\u2019ve configured a probability value of 0.5 for it. It means every image going through this pipeline has a 50% chance of being flipped horizontally.<\/span> <\/li>\n\n\n\n<li> <span style=\"font-size: revert;\">The random brightness and contrast API will change the respective image features for limits of 0.5 and 0.3. We haven\u2019t explicitly mentioned a probability value as we did with the horizontal flip. But the API has a default of 0.5. Hence, every image has a 50% chance of slight brightness and contrast modification. When it applies, brightness won\u2019t be altered by more than 50%, and contrast to a maximum of 30%.<\/span> <\/li>\n\n\n\n<li> <span style=\"font-size: revert;\">Finally, we randomly rotate the image. We haven\u2019t overridden any default values here.<\/span> <\/li>\n<\/ol>\n\n\n\n<p>Then we read an image from the filesystem using Pillow, a widespread Python library for image processing. We\u2019ve also transformed it into a NumPy array.<\/p>\n\n\n\n<p>Lastly, we sent our image through the configured pipeline and displayed the result.<\/p>\n\n\n\n<p>Albumentation has a couple of dozens of such transformations. You can learn more about them in detail from their <a href=\"https:\/\/albumentations.ai\/docs\/getting_started\/transforms_and_targets\/\" target=\"_blank\" rel=\"noopener\">API documentation<\/a>.<\/p>\n\n\n\n<p><em>Related: <a href=\"\/the-prefect-way-to-automate-orchestrate-data-pipelines\"><strong>The Prefect Way to Automate &amp; Orchestrate Data Pipelines<\/strong><\/a><\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">How to create augmented data with annotations?<\/h2>\n\n\n\n<p>Most computer vision applications have to deal with annotated images. These are objects marked and labeled in a photo for training the ML model.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"630\" height=\"354\" src=\"https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-41.png\" alt=\"A good data augmentation tool should re-calculate the positions of the annotated objects in the transformed data automatically. For instance, this image is flipped and the position of the dog and cat are different now. However, the Python library for data augmentation, Albumentation re-annotated them well on the resulting image as well.\" class=\"wp-image-775\" title=\"\" srcset=\"https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-41.png 630w, https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-41-300x169.png 300w\" sizes=\"(max-width: 630px) 100vw, 630px\" \/><\/figure><\/div>\n\n\n<p>When augmenting such datasets, we also need to know the new positions of those annotated objects.<br><\/p>\n\n\n\n<p>Our augmentation technique should re-calculate the coordinates accordingly. This task is effortless in Albumentation. We need to provide the initial coordinates and category id\u2019s annotated objects to the pipeline. The result will have its new position.<\/p>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro padding-bottom-disabled\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:16px;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:20px\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#2e3440ff\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" data-code=\"bboxes = [[0, 128, 300, 420], [366.7, 340, 270, 230]]\ncategory_ids = [1, 2]\n\ntransform = A.Compose(\n    [A.HorizontalFlip(p=0.5), A.Rotate()],\n    bbox_params=A.BboxParams(format=&quot;coco&quot;, label_fields=[&quot;category_ids&quot;]), # Configuring pipeline for annotation\n)\n\n# Passing annotation coordinates and categories with the image\ntransformed = transform(image=image, bboxes=bboxes, category_ids=category_ids)\n\" style=\"color:#d8dee9ff;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki nord\" style=\"background-color: #2e3440ff\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color: #D8DEE9FF\">bboxes <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #ECEFF4\">[[<\/span><span style=\"color: #B48EAD\">0<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #B48EAD\">128<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #B48EAD\">300<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #B48EAD\">420<\/span><span style=\"color: #ECEFF4\">],<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #ECEFF4\">[<\/span><span style=\"color: #B48EAD\">366.7<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #B48EAD\">340<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #B48EAD\">270<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #B48EAD\">230<\/span><span style=\"color: #ECEFF4\">]]<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">category_ids <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #ECEFF4\">[<\/span><span style=\"color: #B48EAD\">1<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #B48EAD\">2<\/span><span style=\"color: #ECEFF4\">]<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">transform <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">Compose<\/span><span style=\"color: #ECEFF4\">(<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">    <\/span><span style=\"color: #ECEFF4\">[<\/span><span style=\"color: #D8DEE9FF\">A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">HorizontalFlip<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9\">p<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #B48EAD\">0.5<\/span><span style=\"color: #ECEFF4\">),<\/span><span style=\"color: #D8DEE9FF\"> A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">Rotate<\/span><span style=\"color: #ECEFF4\">()],<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">    <\/span><span style=\"color: #D8DEE9\">bbox_params<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\">A<\/span><span style=\"color: #ECEFF4\">.<\/span><span style=\"color: #88C0D0\">BboxParams<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9\">format<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #A3BE8C\">coco<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #D8DEE9\">label_fields<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #ECEFF4\">[<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #A3BE8C\">category_ids<\/span><span style=\"color: #ECEFF4\">&quot;<\/span><span style=\"color: #ECEFF4\">]),<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #616E88\"># Configuring pipeline for annotation<\/span><\/span>\n<span class=\"line\"><span style=\"color: #ECEFF4\">)<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color: #616E88\"># Passing annotation coordinates and categories with the image<\/span><\/span>\n<span class=\"line\"><span style=\"color: #D8DEE9FF\">transformed <\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #88C0D0\">transform<\/span><span style=\"color: #ECEFF4\">(<\/span><span style=\"color: #D8DEE9\">image<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\">image<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #D8DEE9\">bboxes<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\">bboxes<\/span><span style=\"color: #ECEFF4\">,<\/span><span style=\"color: #D8DEE9FF\"> <\/span><span style=\"color: #D8DEE9\">category_ids<\/span><span style=\"color: #81A1C1\">=<\/span><span style=\"color: #D8DEE9FF\">category_ids<\/span><span style=\"color: #ECEFF4\">)<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><span style=\"display:flex;align-items:flex-end;padding:10px;width:100%;justify-content:flex-end;background-color:#2e3440ff;color:#c8d0e0;font-size:12px;line-height:1;position:relative\">Python<\/span><\/div>\n\n\n\n<p>We\u2019ve altered the Compose method in the above example that creates the augmentation pipeline. We added another input, bbox_params, with its configurations.<\/p>\n\n\n\n<p>By this, we\u2019re telling Albumentation to use the coco format for augmenting annotated images and use the category_label to find its label. Coco is one of the <a href=\"https:\/\/albumentations.ai\/docs\/getting_started\/bounding_boxes_augmentation\/\" target=\"_blank\" rel=\"noopener\">four types of reannotation methods<\/a> available in this library.<\/p>\n\n\n\n<p>Finally, we pass two extra arguments at the run-time with every image. The first one defines the boxes by it\u2019s starting coordinates, width, and height. Then the labels of each container we defined.<\/p>\n\n\n\n<p>Here\u2019s how the result looks.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"630\" height=\"291\" src=\"https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-42.png\" alt=\"Resulting image has re-annotated objects.\" class=\"wp-image-776\" title=\"\" srcset=\"https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-42.png 630w, https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/image-42-300x139.png 300w\" sizes=\"(max-width: 630px) 100vw, 630px\" \/><\/figure><\/div>\n\n\n<p>After transformation, the positions of the objects have changed. Yet, the pipeline has correctly spotted its new place.<\/p>\n\n\n\n<p><em>Related: <a href=\"\/how-to-speed-up-python-data-pipelines-up-to-91x\"><strong>How to speed up your data science pipelines up to 91X?<\/strong><\/a><\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Final Thought<\/h2>\n\n\n\n<p>Data augmentation saves an incredible amount of time, effort, and your budget by reusing existing images. We could do this with any image processing library. But some specific tasks may require extra effort to do it correctly.<\/p>\n\n\n\n<p>For instance, annotated images require re-annotating the objects on the augmented image. A tool like Albumentations comes in handy in those cases.<\/p>\n\n\n\n<p>This post is a short introduction to what we can do with this Python library. I hope next time you train a machine learning model, you\u2019d use it to improve the accuracy without considering further data collection.<\/p>\n\n\n\n<p><em>Related: <a href=\"https:\/\/towardsdatascience.com\/3-ways-to-deploy-machine-learning-models-in-production-cdba15b00e\" target=\"_blank\" rel=\"noopener\"><strong>3 Ways to deploy machine learning models in Production<\/strong><\/a><\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<blockquote class=\"wp-block-quote\">\n<p>Thanks for the read, friend. It seems you and I have lots of common interests. Say Hi to me on <a href=\"https:\/\/www.linkedin.com\/in\/thuwarakesh\/\" target=\"_blank\" rel=\"nofollow noopener\">LinkedIn<\/a>, <a href=\"https:\/\/twitter.com\/Thuwarakesh\" target=\"_blank\" rel=\"nofollow noopener\">Twitter<\/a>, and <a href=\"https:\/\/thuwarakesh.medium.com\/subscribe\" target=\"_blank\" rel=\"nofollow noopener\">Medium<\/a>. I\u2019ll break the ice for you.<\/p>\n\n\n\n<p>Not a Medium member yet? Please use this link to <a href=\"https:\/\/thuwarakesh.medium.com\/membership\" target=\"_blank\" rel=\"nofollow noopener\">become a member<\/a> because I earn a commission for referring at no extra cost for you.<\/p>\n<\/blockquote>\n","protected":false},"excerpt":{"rendered":"<p>How to create an image data augmentation pipeline to generate tons of synthetic data points?<\/p>\n","protected":false},"author":2,"featured_media":88,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kad_blocks_custom_css":"","_kad_blocks_head_custom_js":"","_kad_blocks_body_custom_js":"","_kad_blocks_footer_custom_js":"","_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"footnotes":""},"categories":[4,12],"tags":[],"taxonomy_info":{"category":[{"value":4,"label":"Data Science"},{"value":12,"label":"MLOps"}]},"featured_image_src_large":["https:\/\/www.the-analytics.club\/wp-content\/uploads\/2023\/06\/data-augment.jpg",700,484,false],"author_info":{"display_name":"Thuwarakesh","author_link":"https:\/\/www.the-analytics.club\/author\/thuwarakesh\/"},"comment_info":0,"category_info":[{"term_id":4,"name":"Data Science","slug":"data-science","term_group":0,"term_taxonomy_id":4,"taxonomy":"category","description":"","parent":0,"count":22,"filter":"raw","cat_ID":4,"category_count":22,"category_description":"","cat_name":"Data Science","category_nicename":"data-science","category_parent":0},{"term_id":12,"name":"MLOps","slug":"mlops","term_group":0,"term_taxonomy_id":12,"taxonomy":"category","description":"","parent":0,"count":13,"filter":"raw","cat_ID":12,"category_count":13,"category_description":"","cat_name":"MLOps","category_nicename":"mlops","category_parent":0}],"tag_info":false,"_links":{"self":[{"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/posts\/300"}],"collection":[{"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/comments?post=300"}],"version-history":[{"count":1,"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/posts\/300\/revisions"}],"predecessor-version":[{"id":777,"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/posts\/300\/revisions\/777"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/media\/88"}],"wp:attachment":[{"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/media?parent=300"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/categories?post=300"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.the-analytics.club\/wp-json\/wp\/v2\/tags?post=300"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}