تحلیل و طبقه‌بندی کاربری اراضی با استفاده از تصاویر هوایی و یادگیری عمیق مبتنی بر معماری U-Net

فرهنگی, مهدی; میلان, اصغر; صادقیان, سعید

doi:10.22061/jrsgr.2024.11250.1083

نوع مقاله : مقاله پژوهشی

نویسندگان

گروه مهندسی نقشه برداری، دانشکده مهندسی عمران آب و محیط زیست، دانشگاه شهید بهشتی، تهران، ایران

https://doi.org/10.22061/jrsgr.2024.11250.1083

چکیده

پیشینه و اهداف: طبقه‌بندی دقیق کاربری اراضی برای مدیریت مؤثر منابع طبیعی، برنامه‌ریزی شهری، کشاورزی دقیق و پایش محیط زیست ضروری است. طبقه‌بندی‌ به پیش‌بینی و پیشگیری از مشکلات زیست‌محیطی کمک می‌کند. روش‌هایی مانند تصاویر ماهواره‌ای و هوایی با وضوح بالا، GIS و تکنیک‌های یادگیری عمیق از جمله شبکه‌های عصبی کانولوشنال (CNN) و معماری U-Net دقت بالایی در تحلیل و طبقه‌بندی تصاویر هوایی ارائه می‌دهند. شبکه U-Net با ساختار منحصر به فرد خود در تعریف مرزهای کاربری اراضی برتری دارد. این مطالعه بر روی منطقه‌ای در لهستان تمرکز دارد و از مدل U-Net برای افزایش دقت و کارایی طبقه‌بندی از طریق تکنیک‌های منظم‌سازی و بهینه‌ساز Adam استفاده می‌کند.
روش‌ها‌: در این تحقیق از تصاویر هوایی با قدرت تفکیک ۲۵ سانتی‌متر و در باند مرئی برای تحلیل و طبقه‌بندی کاربری اراضی استفاده شده است. مدل U-Net به دلیل معماری خاص خود، شامل بلوک‌های کانولوشن و فعال‌سازی ReLU، برای استخراج ویژگی‌های مکانی دقیق و حفظ جزئیات تصویر انتخاب شد. به منظور افزایش دقت و جلوگیری از بیش برازش، از تکنیک‌های منظم‌سازی مانند حذف تصادفی (Dropout) و Regularization L2 بهره گرفته شد. همچنین، برای بهبود همگرایی مدل، روش‌های افزون‌سازی داده و تکنیک توقف زودهنگام (Early Stopping) به کار رفته است. تصاویر هوایی به قطعات کوچک‌تر با ابعاد ۲۵۶×۲۵۶ پیکسل تقسیم و به سه مجموعه آموزشی، اعتبارسنجی و آزمون تقسیم شدند.
یافته‌ها: الگوریتم U-Net بر روی داده‌های شهر پوزنان در کشور لهستان اعمال شد. این داده‌ها توسط کارشناسان برچسب‌گذاری شدند و شامل چهار نوع کاربری اراضی: ساختمان‌ها، جنگل‌ها، جاده‌ها و آب هستند. از 769 تصویر برچسب‌گذاری شده، 576 تصویر برای آموزش مدل (که پس از تکنیک افزون‌سازی داده به 2304 تصویر گسترش یافت)، 183 تصویر برای اعتبارسنجی و 10 تصویر نیز برای آزمایش استفاده شد. این مدل، با استفاده از زبان برنامه‌نویسی Python و کتابخانه Keras بر بستر TensorFlow توسعه یافته و در Google Colab آموزش داده شد و پس از 96 تکرار به دقت بالایی رسید و با نقشه‌های برچسب‌گذاری شده توسط کارشناسان اعتبارسنجی شد. در حالی که مدل U-Net در دسته‌بندی کلی عملکرد خوبی داشت، با چالش‌هایی در کلاس‌های نادر مانند آب مواجه شد. افزایش داده‌ها و نمونه‌های بیشتر برای این کلاس‌ها می‌تواند دقت را بهبود بخشد. دقت‌های آموزش و اعتبارسنجی به ترتیب به 0.95 و 0.85 رسید و خطای اعتبارسنجی در حدود 0.5 تثبیت شد. مدل U-Net بهبودهای قابل توجهی در دقت کلی، ضریب کاپا و امتیاز ژاکارد نسبت به مطالعات قبلی نشان داد که حاکی از اهمیت داده‌های با کیفیت و تنظیم دقیق پارامترها است.
نتیجه‌گیری: در این مطالعه، مدل یادگیری عمیق U-Net، برای دسته‌بندی دقیق کاربری اراضی با استفاده از تصاویر هوایی مورد ارزیابی قرار گرفت. نتایج نشان می‌دهد که این مدل به طور مؤثر انواع کاربری اراضی را با دقت بالا شناسایی و تفکیک کرده است. ساختار U-Net دقت کلی 92.47%، امتیاز ژاکارد 54.45% و ضریب کاپا 79.59% را به دست آورد. این نتایج توانایی قوی مدل را در تعریف مرزهای کلاس‌ها نشان می‌دهد. بهبودهای آینده می‌تواند شامل استفاده از تصاویر چند طیفی و فراطیفی برای اطلاعات دقیق‌تر، ترکیب U-Net با سایر شبکه‌ها مانند ANN، بهینه‌سازی فراپارامترها با استفاده از روش‌های جستجوی پیشرفته و به‌کارگیری یادگیری انتقالی، به ویژه در شرایط با داده‌های محدود باشد. اجرای این استراتژی‌ها می‌تواند دقت و کارایی در دسته‌بندی کاربری اراضی را افزایش دهد و کاربردهای وسیع‌تری در زمینه‌های علمی و عملی ارائه دهد.

کلیدواژه‌ها

موضوعات

فتوگرامتری

عنوان مقاله [English]

Analysis and classification of land use using aerial imagery and deep learning based on U-Net architecture

نویسندگان [English]

M. Farhangi
A. Milan
S. Sadeghian

Department of Surveying Engineering, Faculty of Civil, Water and Environmental Engineering, Shahid Beheshti University, Tehran, Iran

چکیده [English]

Background and Objectives: Accurate land use classification is essential for effective natural resource management, urban planning, precision agriculture, and environmental monitoring. Such classification helps predict and prevent environmental issues. Methods like high-resolution satellite and aerial imagery, GIS, and deep learning techniques, including Convolutional Neural Networks (CNN) and the U-Net architecture, offer high precision in analyzing and classifying aerial images. The U-Net network, known for its unique structure, excels in defining land use boundaries. This study focuses on a region in Poland, using the U-Net model to enhance classification accuracy and efficiency through regularization techniques and the Adam optimizer.
Methods: This research used high-resolution aerial images and a deep learning model based on the U-Net architecture to achieve precise land use classification. The approach aimed at improving classification accuracy across four land use categories. High-resolution aerial images were collected, corrected geometrically and radiometrically to create orthophotos. These images were labeled and cropped to 256x256 pixels, with data augmentation techniques such as rotation and flipping applied. The dataset was divided into training (75%), validation (25%), and testing (5% of the validation set). The U-Net model includes convolutional blocks with 3x3 kernels, normalization layers, and dropout layers, organized into encoding, decoding, and output layers. Hyperparameters included the Adam optimizer, a learning rate of 0.0001, and a batch size of 16. Model performance was evaluated using metrics like overall accuracy, kappa coefficient, and Jaccard score.
Findings: The algorithm was tested on data from Poznań, Poland, utilizing high-resolution aerial images from 2021 with a 25 cm spatial resolution. The data, labeled by experts, covered four land use types: buildings, forests, roads, and water. Out of 769 labeled images, 576 were used for training (expanded to 2304 samples after augmentation), 183 for validation, and 10 for testing. The model, developed using Python and Keras on TensorFlow and trained in Google Colab, achieved high accuracy after 96 iterations, validated against expert-labeled maps. While the U-Net model performed well in general classification, it encountered challenges with rare classes like water. Data augmentation and more samples for such classes could improve accuracy. The training and validation accuracy reached 0.95 and 0.85, respectively, with validation errors stabilizing around 0.5. The U-Net model demonstrated significant improvements in accuracy, kappa coefficient, and Jaccard index compared to previous studies, underscoring the importance of high-quality data and precise parameter tuning.
Conclusion: The study assessed the U-Net deep learning model for accurate land use classification using aerial images. Results indicate that the model effectively identified and differentiated between land use types with high precision. The U-Net structure achieved an overall accuracy of 92.47%, a Jaccard index of 54.45%, and a kappa coefficient of 79.59%. These results demonstrate the model’s strong capability in defining class boundaries. Future improvements could involve utilizing multispectral and hyperspectral images for more detailed information, combining U-Net with other networks like ANN, optimizing hyperparameters with advanced search methods, and employing transfer learning, especially with limited training data. Implementing these strategies could enhance accuracy and efficiency in land use classification, with broader applications in scientific and practical fields.

کلیدواژه‌ها [English]

Land use classification
Aerial imagery
Deep learning
Convolutional neural network
U-Net

COPYRIGHTS
© 2024 The Author(s). This is an open-access article distributed under the terms and conditions of the Creative Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) (https://creativecommons.org/licenses/by-nc/4.0/)