Learning Physically Realizable Skills for Online Packing of General 3D Shapes,ACM Transactions on Graphics

当前位置： X-MOL 学术 › ACM Trans. Graph. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Learning Physically Realizable Skills for Online Packing of General 3D Shapes
ACM Transactions on Graphics ( IF 6.2 ) Pub Date : 2023-06-06 , DOI: https://dl.acm.org/doi/10.1145/3603544
Hang Zhao, Zherong Pan, Yang Yu, Kai Xu

We study the problem of learning online packing skills for irregular 3D shapes, which is arguably the most challenging setting of bin packing problems. The goal is to consecutively move a sequence of 3D objects with arbitrary shapes into a designated container with only partial observations of the object sequence. Meanwhile, we take physical realizability into account, involving physics dynamics and constraints of a placement. The packing policy should understand the 3D geometry of the object to be packed and make effective decisions to accommodate it in the container in a physically realizable way. We propose a Reinforcement Learning (RL) pipeline to learn the policy. The complex irregular geometry and imperfect object placement together lead to huge solution space. Direct training in such space is prohibitively data intensive. We instead propose a theoretically-provable method for candidate action generation to reduce the action space of RL and the learning burden. A parameterized policy is then learned to select the best placement from the candidates. Equipped with an efficient method of asynchronous RL acceleration and a data preparation process of simulation-ready training sequences, a mature packing policy can be trained in a physics-based environment within 48 hours. Through extensive evaluation on a variety of real-life shape datasets and comparisons with state-of-the-art baselines, we demonstrate that our method outperforms the best-performing baseline on all datasets by at least \(12.8\% \) in terms of packing utility. We also release our datasets and source code to support further research in this direction.

中文翻译：

学习一般 3D 形状在线打包的物理可实现技能

我们研究了学习不规则 3D 形状在线打包技巧的问题，这可以说是装箱问题中最具挑战性的设置。目标是将一系列具有任意形状的 3D 对象连续移动到指定的容器中，仅对对象序列进行部分观察。同时，我们考虑了物理可实现性，涉及物理动力学和布局约束。打包策略应该了解要打包的对象的 3D 几何形状，并做出有效的决定以物理上可实现的方式将其容纳在容器中。我们提出了一个强化学习 (RL) 管道来学习该策略。复杂的不规则几何形状和不完美的物体放置一起导致了巨大的解决空间。在这样的空间中直接训练是非常密集的数据。相反，我们提出了一种理论上可证明的候选动作生成方法，以减少 RL 的动作空间和学习负担。然后学习参数化策略以从候选者中选择最佳位置。配备高效的异步 RL 加速方法和模拟就绪训练序列的数据准备过程，可以在 48 小时内在基于物理的环境中训练成熟的打包策略。通过对各种现实生活中的形状数据集进行广泛评估并与最先进的基线进行比较，我们证明我们的方法在所有数据集上至少优于 \(12.8\% \)包装实用程序。我们还发布了我们的数据集和源代码，以支持在这个方向上的进一步研究。然后学习参数化策略以从候选者中选择最佳位置。配备高效的异步 RL 加速方法和模拟就绪训练序列的数据准备过程，可以在 48 小时内在基于物理的环境中训练成熟的打包策略。通过对各种现实生活形状数据集的广泛评估以及与最先进的基线的比较，我们证明我们的方法在所有数据集上至少优于 \(12.8\% \)包装实用程序。我们还发布了我们的数据集和源代码，以支持在这个方向上的进一步研究。然后学习参数化策略以从候选者中选择最佳位置。配备高效的异步 RL 加速方法和模拟就绪训练序列的数据准备过程，可以在 48 小时内在基于物理的环境中训练成熟的打包策略。通过对各种现实生活形状数据集的广泛评估以及与最先进的基线的比较，我们证明我们的方法在所有数据集上至少优于 \(12.8\% \)包装实用程序。我们还发布了我们的数据集和源代码，以支持在这个方向上的进一步研究。配备高效的异步 RL 加速方法和模拟就绪训练序列的数据准备过程，可以在 48 小时内在基于物理的环境中训练成熟的打包策略。通过对各种现实生活中的形状数据集进行广泛评估并与最先进的基线进行比较，我们证明我们的方法在所有数据集上至少优于 \(12.8\% \)包装实用程序。我们还发布了我们的数据集和源代码，以支持在这个方向上的进一步研究。配备高效的异步 RL 加速方法和模拟就绪训练序列的数据准备过程，可以在 48 小时内在基于物理的环境中训练成熟的打包策略。通过对各种现实生活中的形状数据集进行广泛评估并与最先进的基线进行比较，我们证明我们的方法在所有数据集上至少优于 \(12.8\% \)包装实用程序。我们还发布了我们的数据集和源代码，以支持在这个方向上的进一步研究。我们证明了我们的方法在所有数据集上的性能优于基线至少 \(12.8\% \) 在打包效用方面。我们还发布了我们的数据集和源代码，以支持在这个方向上的进一步研究。我们证明了我们的方法在所有数据集上的性能优于基线至少 \(12.8\% \) 在打包效用方面。我们还发布了我们的数据集和源代码，以支持在这个方向上的进一步研究。

更新日期：2023-06-06

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>