آشکارسازی تغییرات سه‌بعدی در مناطق شهری با استفاده از مدل UNet++

حسنلو, مهدی; ابراهیمی, علیرضا

doi:10.22061/jrsgr.2026.12415.1107

نوع مقاله : مقاله پژوهشی

نویسندگان

¹ دانشکده مهندسی نقشه برداری و اطلاعات مکانی، دانشکدگان فنی، دانشگاه تهران، تهران، ایران

² دانشکده مهندسی نقشه‌برداری و اطلاعات مکانی، دانشکدگان فنی، دانشگاه تهران، تهران، ایران

https://doi.org/10.22061/jrsgr.2026.12415.1107

چکیده

پیشینه و اهداف: امروزه با توسعه شهر نشینی، ضرورت به روزرسانی خودکار نقشه‌های تغییرات بیش از پیش اهمیت یافته است. به روزرسانی نقشه‌های مکانی، نظارت بر ساخت و ساز و پایش مناطق توسعه یافته نیازمند اطلاعات دقیق از بافت شهری و نقشه‌های جدید تغییرات می‎‌باشد. از آنجا که روش‌های سنتی پایش اغلب به تحلیل دو بعدی تغییرات پرداخته و حساسیت کافی در ثبت تغییرات بعد سوم یا ارتفاع را ندارند، لذا این محدودیت موجب عدم تشخیص ساخت و ساز‌های طبقه‌ای شده که در نتیجه آن، نقص اطلاعات در پایش را به همراه خواهد داشت. با پیشرفت در سنجش ازدور و روش‌های نوین یادگیری عمیق، آشکارسازی تغییرات سه بعدی شهری امکان پذیر شده است که تاکنون نتایج برتری را نسبت به روش‌های کلاسیک ارائه داده‌اند. این پژوهش با هدف بهبود عملکرد در آشکارسازی تغییرات سه بعدی شهری، رویکردی بر مبنای یادگیری عمیق با استفاده از داده‌هایی با منابع مختلف ارائه می‌دهد تا انواع تغییرات ساختمانی به طور خودکار شناسایی و تفکیک نماید. هدف اصلی تفکیک چهار نوع تغییر ساختمانی از جمله ساخت جدید، تخریب کامل، افزایش ارتفاع و کاهش ارتفاع در کنار نواحی بدون تغییر می‌باشد تا نقشه جامعی از تغییرات سه بعدی شهری ارائه گردد.
روش‌ها‌: مجموعه داده مورد استفاده در این پژوهش شامل تصاویر هوایی رنگی (RGB) با وضوح مکانی بالا به همراه داده‌های مدل رقومی سطح (DSM) مربوط به دو بازه‌ی زمانی مختلف از محدوده شهری والادولید اسپانیا می‌باشد. داده ورودی شامل تصاویر رنگی قبل و بعد از تغییر به همراه مدل ارتفاعی متناظر آن به صورت یک ورودی هشت بانده با یکدیگر ترکیب شده تا شبکه به طور همزمان اطلاعات طیفی و ارتفاعی را مشاهده کند و جهت ورود به فرآیند آموزش مدل، داده‌ها به نسبت 90 به 10درصد به دو دسته آموزش و آزمایش تقسیمبندی می‌شوند. جهت افزایش تنوع داده‌های آموزش و جلوگیری از برازش بیش از حد از روش‌هایی چون وارونسازی افقی و عمودی، چرخش تصادفی و تاری گاوسی بهره گرفته شده است. معماری مدل متشکل از یک شبکه‌ی عمیق ResNet-34به عنوان بخش ویژگی یاب و از شبکه UNet++ جهت بازسازی پیکسل به پیکسل تغییرات می‌باشد. همچنین برای به روزرسانی پارامترهای مدل از الگوریتم Adam بهره گرفته شده است. در مرحله نخست مدل شبکه عمیق به صورت دودویی (تغییر/عدم تغییر) آموزش داده شده و عملکرد آن در مقایسه با روش کلاسیک جنگل تصادفی، تفاضل یا نسبت گیری و روش ترکیبی تحلیل مولفه‌های اصلی (PCA) و خوشه بندی K-Means مقایسه می‌شود. پس از آن شبکه با برچسب‌های پنج کلاسه شامل چهار نوع تغییر گفته شده و برچسب عدم تغییر مجددا آموزش داده و جهت بهبود فرآیند یادگیری از یک تابع هزینه ساز بر مبنای بهینه سازی مستقیم معیار (IoU) استفاده شده است. جهت مقایسه عملکرد مدل‌ها از معیارهایی نظیر Accuracy، Recall، F1-score و Precision استفاده شده است.
یافته‌ها: در مرحله نخست این شبکه پس از 50 اپک آموزش اغلب تغییرات واقعی را شناسایی کرده و در عین حال نرخ هشدار غلط را پایین نگه داشته است که معیارهای ارزیابی Recall، Accuracy و F1-score با مقادیر 98.5% و 98.5% و 0.92 این نتایج را تأکید می‌کنند و این ارقام به طور محسوسی بهتر از روش‌های کلاسیک بوده‌. همچنین در مدل یادگیری عمیق بر خلاف دیگر روش‌ها تقریبا تمامی موارد ساخت و ساز یا تخریب در مقیاس‌های کوچک تشخیص داده شده است. در مرحله پنج کلاسه، تغییرات به خوبی توسط مدل شناسایی و طبقه بندی شده‌اند و معیارهای Recall، Accuracy و F1-score به ترتیب برابر با 96.32% و 96% و 0.95 می‌باشد. تمامی سازه‌های جدید احداث شده و ساختمان‌های کاملا تخریب شده در نقشه خروجی مدل با برچسب درست تشخیص داده شده اند و بخش وسیعی از مناطق بدون تغییر هیچ برچسب اشتباهی دریافت نکرده‌اند.
نتیجه‌گیری: نتایج نشان‌دهنده آن است که ترکیب داده‌های ارتفاعی با تصاویر دو بعدی و به کارگیری مدل‌های یادگیری عمیق به طرز چشمگیری کاستی‌های روش سنتی تشخیص تغییرات را برطرف کرده و افزایش دقت را به همراه خواهد داشت. این مدل توسعه یافته قادر است که نه تنها مکان تغییر بلکه نوع تغییر را شناسایی کند. لازم به ذکر است که از این شیوه نوین می‌توان جهت پایش ساخت و سازهای غیر مجاز، به روز رسانی پایگاه‌های داده مکانی و ارزیابی تغییرات شهری استفاده شود. مهمترین محدودیت این پژوهش وابستگی به داده‌های ارتفاعی دقیق می‌باشد که تهیه منظم آن در تمامی شهرها امکان پذیر نمی‌باشد. به علاوه آموزش مدل‌های عمیق نیازمند داده‌های برچسب دار گسترده و توان پردازشی بالا می‌باشد که ممکن است در کاربردهای عملی محدودیت ایجاد کند.

کلیدواژه‌ها

موضوعات

سنجش از دور

عنوان مقاله [English]

Multi-Class 3D Change Detection in Urban Areas Using UNet++

نویسندگان [English]

M. Hasanlou ¹
A. Ebrahimi ²

¹ School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran. Tehran. Iran

² School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran

چکیده [English]

Background and Objectives: With the rapid expansion of urbanization, the need for automatic updating of change maps has become increasingly important. Accurate and up-to-date spatial information is essential for monitoring construction activities and tracking the development of urban areas. Traditional approaches to change detection are mostly limited to two-dimensional analysis and often lack sensitivity to vertical changes. This shortcoming fails to identify multi-story constructions, thereby limiting the completeness of monitoring outcomes. Recent advances in remote sensing and deep learning have enabled three-dimensional urban change detection, providing superior results compared to classical methods. This study aims to improve the performance of 3D urban change detection by introducing a deep learning approach that integrates multi-source data. The primary objective is to automatically identify and distinguish four types of building-related changes—new construction, complete demolition, height increase, and height decrease—alongside unchanged areas, to generate a comprehensive 3D change map.
Methods: The dataset employed in this research consists of high-resolution RGB aerial imagery and corresponding Digital Surface Model (DSM) data acquired from two different periods over Valladolid, Spain. The input data were prepared by stacking RGB images and DSMs from both epochs into an eight-band input, allowing the network to jointly analyze spectral and elevation information. The dataset was divided into training (90%) and testing (10%) subsets. To increase variability in the training data and reduce overfitting, augmentation techniques such as horizontal and vertical flipping, random rotation, and Gaussian blurring were applied. The proposed model architecture combines a ResNet-34 backbone for feature extraction with a UNet++ decoder for pixel-level change reconstruction. Model parameters were updated using the Adam optimizer. In the first stage, the deep network was trained in a binary setting (change/no-change) and evaluated against classical approaches, including Random Forest, image differencing/ratioing, and a PCA–K-Means hybrid method. In the second stage, the network was retrained for five-class classification, including the four change categories and the unchanged class, using a loss function optimized directly for the Intersection-over-Union (IoU) metric. Model performance was assessed using Accuracy, Recall, Precision, and F1-score.
Findings: In the binary classification stage, after 50 epochs of training, the network successfully identified most real changes while maintaining a low false alarm rate. Evaluation metrics confirmed this performance, with Recall and Accuracy both reaching 98.5% and an F1-score of 0.92, considerably outperforming the classical methods. Unlike traditional approaches, the deep learning model was able to detect almost all small-scale constructions and demolitions. In the five-class stage, the model effectively identified and classified change types, achieving a Recall of 96.32%, an Accuracy of 96%, and an F1-score of 0.95. All newly constructed and fully demolished buildings were correctly labeled in the output maps, and a large proportion of unchanged areas received no misclassification.
Conclusion: The findings demonstrate that combining elevation data with 2D imagery and leveraging deep learning architectures significantly mitigates the limitations of traditional change detection approaches and enhances accuracy. The developed model is capable of detecting not only the location but also the type of change. This approach has strong potential applications in monitoring unauthorized constructions, updating spatial databases, and assessing urban development. However, its effectiveness relies on the availability of accurate DSM data, which may not be consistently accessible for all urban areas. Additionally, the training of deep networks requires extensive labeled datasets and considerable computational resources, which could limit their applicability in operational contexts.

کلیدواژه‌ها [English]