Academic Journal

Analysis of the PointNet neural network architecture

التفاصيل البيبلوغرافية
العنوان: Analysis of the PointNet neural network architecture
المؤلفون: L. A. Shchenyavskaya, D. A. Gura, R. A. Dyachenko
المصدر: Вестник Дагестанского государственного технического университета: Технические науки, Vol 50, Iss 4, Pp 158-165 (2024)
بيانات النشر: Dagestan State Technical University, 2024.
سنة النشر: 2024
المجموعة: LCC:Technology
مصطلحات موضوعية: point cloud, three-dimensional space, paintnet architecture, input data, spatial objects, classification and segmentation tasks, global function, data visualization, Technology
الوصف: Objective. Most researchers convert point cloud data into ordinary three-dimensional voxel grids or image collections, which makes the data unnecessarily voluminous and causes problems when processing them. The purpose of the study is to analyze the architecture of the PointNet neural network. Method. A unified approach has been applied to solving various 3D recognition problems, ranging from object classification, detail segmentation to semantic scene analysis. Result. A comparative analysis of the classification of 2d and 3d objects was carried out, the layers and functions through which classification occurs were studied in detail. A type of neural network is considered that directly uses point clouds, which takes into account the invariance of permutations of points in the input data. The network is determined to provide a unified architecture for applications ranging from object classification, part segmentation, and scene semantics. For semantic segmentation, the input data can be either a single object from the part area segmentation or a small part of the 3D scene. A neural network that is widely used for raster image editing, graphic design, and digital art is a deep point cloud architecture called PointNet. Conclusion. A new deep point cloud architecture, PointNet, is introduced. For object classification task, the input point cloud is directly selected from the shape or pre-segmented from the scene point cloud. To obtain a virtual model of the real world, neural network solutions are used, based on the assumption that there is an RGB point cloud obtained by an RGB-D camera from one or several angles.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: Russian
تدمد: 2073-6185
2542-095X
Relation: https://vestnik.dgtu.ru/jour/article/view/1404; https://doaj.org/toc/2073-6185; https://doaj.org/toc/2542-095X
DOI: 10.21822/2073-6185-2023-50-4-158-165
URL الوصول: https://doaj.org/article/45bc6cbdc04d44b89bfb6d293aea4fba
رقم الانضمام: edsdoj.45bc6cbdc04d44b89bfb6d293aea4fba
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20736185
2542095X
DOI:10.21822/2073-6185-2023-50-4-158-165