2026-06-21 期

本期机器人顶刊精选

本期检索 2026-03-23 至 2026-06-21 期间上线的 663 篇机器人领域文献，覆盖 Science Robotics、IEEE T-RO、IJRR、RA-L、Autonomous Robots 与 Journal of Field Robotics。整体来看，「机器人基础模型 / 大模型」热度持续走高——从四旋翼基础策略 RAPTOR、到对大行为模型(LBM)灵巧操作的严谨实证、再到面向集群与多机器人的基础模型展望；具身硬件同样亮眼：3.6 米弹跳机器人、电流体纤维人工肌肉、登月可变形漫游车 SORA-Q 与微创脊柱手术机器人。以下为编辑精选 8 篇，以及 11 个研究方向各自的重点推荐。

共 663 篇RA-L · 465JFR · 64T-RO · 58Sci. Robotics · 32IJRR · 26AuRo · 18

Editor's Picks编辑精选

Sci. Robotics 2026-06-10

Precise aggressive aerial maneuvers with sensorimotor policies

Tianyue Wu, Guangtong Xu, Zihan Wang, Junxiao Lin, Tianyang Chen, Yuze Wu, Zhichao Han, Zhiyang Liu, et al.

仅凭机载轻量传感器，端到端感知-运动策略让四旋翼以 SE(3) 倾斜姿态高速穿越狭窄缝隙，把激进机动从依赖外部动捕推进到完全自主。

看点把「极限敏捷飞行」从实验室动捕条件搬到真实机载传感，是无人机自主性的标志性进展。

无人机 / 空中机器人导航 / SLAM / 自动驾驶机器人学习感知与传感

摘要 Abstract

Precise aggressive maneuvers with lightweight onboard sensors remain a key bottleneck in fully exploiting the maneuverability of drones. Such maneuvers are critical for expanding the systems’ accessible area by navigating through narrow openings in the environment. One of the most relevant problems is aggressive traversal through narrow gaps with quadrotors under constraints in the special Euclidean group of three dimensions [ SE ( 3 ) ], which requires the quadrotors to leverage a momentary tilted attitude and the asymmetry of the airframes to navigate through gaps. Here, we achieved such maneuvers by developing sensorimotor policies directly mapping onboard vision and proprioception into low-level control commands. The policies were trained using reinforcement learning (RL) with end-to-end policy distillation in simulation. We mitigated the model-free RL’s exploration challenge on the restricted solution space with an initialization strategy leveraging trajectories generated by a model-based planner. Careful sim-to-real design allowed the policy to control a quadrotor through narrow gaps with low clearances and high repeatability. For instance, the proposed method enabled a quadrotor to navigate a rectangular gap at a 5-centimeter clearance, tilted at an orientation up to 90°, without knowledge of the gap’s position or orientation. Without training on dynamic gaps, the policy could reactively servo the quadrotor to traverse through a moving gap. The proposed method was validated on challenging tracks of narrow, closely placed gaps. The flexibility of the policy learning method was demonstrated by developing policies on geometrically diverse gaps without relying on manually defined traversal poses and visual features.

Sci. Robotics 2026-05-13

RAPTOR: A foundation policy for quadrotor control

Jonas Eschmann, Dario Albani, Giuseppe Loianno

RAPTOR 训练出单一四旋翼「基础控制策略」，无需重新系统辨识即可零样本适配从未见过的机体与扰动，直击 RL 策略过拟合单一环境的痛点。

看点用「基础模型」思路解决控制策略的泛化与 sim-to-real，是本季最值得关注的范式之一。

无人机 / 空中机器人机器人学习

摘要 Abstract

Humans are remarkably data efficient when adapting to previously unseen conditions, like driving a new car. In contrast, modern robotic control systems, like neural network policies trained using reinforcement learning (RL), are highly specialized for single environments. Because of this overfitting, they are known to break down even under small differences like the simulation-to-reality gap and require system identification and retraining for even minimal changes to the system. Here, we present RAPTOR, a method for training a highly adaptive foundation policy for quadrotor control. Our method enables training a single, end-to-end neural network policy to control a wide variety of quadrotors. We tested 10 different real quadrotors, from 32 grams to 2.4 kilograms, that also differed in motor type (brushed versus brushless), frame type (soft versus rigid), propeller type (two, three, or four blades), and flight controller (PX4, Betaflight, Crazyflie, M5StampFly). We found that a tiny, three-layer policy with only 2084 parameters was sufficient for zero-shot adaptation to a wide variety of platforms. The adaptation through in-context learning was made possible by using a recurrence in the hidden layer. The policy was trained through our proposed meta-imitation learning algorithm, where we sampled 1000 quadrotors and trained a teacher policy for each of them using RL. The 1000 teachers were distilled into a single, adaptive student policy. We found that within milliseconds, the resulting foundation policy adapted zero-shot to unseen quadrotors. We tested the capabilities of the foundation policy under numerous conditions (trajectory tracking, indoor/outdoor, wind disturbance, poking, and different propellers).

Sci. Robotics 2026-04-15

A careful examination of large behavior models for multitask dexterous manipulation

Jose Barreiros, Andrew Beaulieu, Aditya Bhat, Rick Cory, Eric Cousineau, Hongkai Dai, Ching-Hsin Fang, Kunimatsu Hashimoto, et al.

对多任务灵巧操作的大行为模型(LBM)做了一次罕见严谨的真机评测，量化其真实能力与边界，为「通用机器人基础模型」的炒作降温、为评测立标。

看点在一片乐观叙事中提供了扎实可复现的实证基准，研究者必读。

操作与机械臂机器人学习感知与传感

摘要 Abstract

Robot manipulation has seen tremendous progress in recent years, with imitation learning policies enabling successful performance of dexterous and hard-to-model tasks. Concurrently, scaling data and model size has led to the development of capable language and vision foundation models, motivating large-scale efforts to create general-purpose robot foundation models. Although these models have garnered considerable enthusiasm and investment, meaningful evaluation of real-world performance remains a challenge, limiting the pace of development and inhibiting a nuanced understanding of current capabilities. Here, we rigorously evaluated multitask robot manipulation policies, referred to as large behavior models, by extending the diffusion policy paradigm across a corpus of simulated and real-world robot data. We proposed and validated an evaluation pipeline to rigorously analyze the capabilities of these models with statistical confidence. We compared against single-task baselines through blind, randomized trials in a controlled setting, using both simulation and real-world experiments. We found that multitask pretraining made the policies more successful and robust and enabled teaching complex new tasks more quickly, using a fraction of the data when compared with single-task baselines. Moreover, performance predictably increased as pretraining scale and diversity grows.

T-RO 2026-06-08

Parallel-Elastic Actuation with Reactive Latch Elevates Robotic Hopping Performance: Jump Height and Continuity

Songnan Bai, Runze Ding, Song Li, Ruihan Jia, Ruobing Wang, Zhiyuan Zhang, Fangzheng Wang, Pakpong Chirarattananon

并联弹性驱动 + 反应式闩锁机构优化能量收放，使腿式机器人最高弹跳达 3.6 米，连续性与高度同时刷新人类与动物纪录。

看点机构创新带来数量级性能跃升，给足式/弹跳机器人树立了新标杆。

无人机 / 空中机器人足式 / 四足机器人控制与动力学

摘要 Abstract

While many animals exhibit impressive hopping capabilities, machines have struggled to match their performance. Current hopping robots face limitations in power density, energy efficiency, and control stability. Here, we present a parallel-elastic actuation mechanism with a reactive latch that optimizes energy transfer, enabling a legged robot to achieve hopping heights and continuity previously unattainable. This mechanism efficiently stores and releases energy, extending the actuation period over the aerial phase while minimizing stance time. Our robot achieves a maximum hopping height of 3.6 meters, surpassing both human and animal records while demonstrating sustained, high-frequency hopping cycles with minimal power requirement. By integrating inertia-based onboard sensorimotor autonomy, we demonstrate stable, controlled hopping in environments without external aid. These results represent a step toward bridging the performance gap between biological and robotic locomotion, with potential to influence the design of future legged systems.

Sci. Robotics 2026-06-10

From ball to rover: Transformable palm-sized rover SORA-Q for autonomous lunar exploration

D. Hirano, M. Inazawa, M. Sutoh, M. Nagata, Y. Yoneda, K. Watanabe, H. Sawada, G. Sakoda, et al.

掌心大小、可由球形变形为两轮漫游的 SORA-Q，在严苛载荷与算力约束下实现自主月面探测，是真正飞向月球的微型机器人。

看点从工程约束到真实任务落地的完整闭环，行星探测机器人的代表作。

足式 / 四足机器人导航 / SLAM / 自动驾驶人机交互 / 遥操作

摘要 Abstract

Robotic technologies are expected to drive substantial advancements in planetary exploration and resource prospecting by performing a variety of tasks in extraterrestrial environments. In particular, miniature robots are ideally suited for integration into spacecraft with strict payload limitations, providing a cost-effective solution. However, the pursuit of autonomous exploration using these miniature robots presents challenges owing to constraints in computational power and battery capacity and reduced locomotion performance owing to their small size. Here, we introduce a two-wheeled centimeter-scale rover, designated Lunar Excursion Vehicle 2 (LEV-2), also known as SORA-Q (named after the Japanese words for space and sphere), which transforms into a wheeled configuration from a compact spherical form, enabling efficient traversal of soft lunar terrains. On 19 January 2024 (universal time coordinated), LEV-2 was deployed from a Japanese lunar lander, Smart Lander for Investigating Moon (SLIM), immediately before its landing on the lunar surface. After a lunar landing, the palm-sized rover accomplished autonomous lunar exploration by navigating around the SLIM lander, capturing images of both the SLIM lander and its environment and transmitting selected images through wireless communication on the lunar surface without reliance on ground-based teleoperation. This study details the system design of LEV-2 and presents the results of its in situ lunar activities, highlighting the efficacy of the proposed technologies necessary for mission implementation. Furthermore, we discuss the technical challenges encountered during the mission, including operational constraints and partial data loss, as well as the lessons learned for future exploration missions using small-scale space robots.

Sci. Robotics 2026-05-20

A minimally invasive robotic spinal surgical system for anterior lumbar nerve decompression

Qingxiang Zhao, Xiandi Wang, Xin Zhong, Runfeng Zhu, Peizhi Zhou, Dan Pu, Baitao Lin, Tao Li, et al.

面向前路腰椎神经减压的微创手术机器人系统，以更高远端灵活度与可视性弥补传统前路术式视野受限、减压不彻底的不足。

看点把临床痛点与机器人灵巧设计紧密结合，医疗机器人转化的优秀样本。

操作与机械臂导航 / SLAM / 自动驾驶机器人学习医疗 / 软体 / 微纳

摘要 Abstract

Lumbar degenerative diseases, primarily caused by pathological tissues compressing spinal nerves, typically necessitate surgical intervention—specifically lumbar nerve decompression—to alleviate pain. Although the anterior decompression approach demonstrates notable advantages, such as reduced bleeding and shorter postoperative hospitalization stays, compared with the conventional posterior approach, patients may still experience incomplete decompression because of various instrumental shortcomings, including restricted visibility and insufficiency of distal dexterity. In this study, we present a robotic surgical system for minimally invasive anterior lumbar nerve decompression, which comprises three slender robotic arms (2 millimeters in outer diameter) with high dexterity (18 degrees of freedom), facilitating effective navigation through the narrow intervertebral disc space to reach the posterior area. Each robot arm is based on concentric push-pull robot structure, forming three robotized instruments: an endoscope for visualization, a laser optical fiber for hemostasis and resection, and a gripper for tissue manipulation. These components are integrated through the hollow lumen of a slender trocar, and multi-instrument coordination enables effective decompression procedure with wide view. System performance was first validated using a three-dimensional–printed vertebral phantom model to confirm accessibility to bilateral articular processes. Subsequently, in vivo animal experiment and human cadaver tests were conducted to further demonstrate the full capabilities in performing minimally invasive lumbar nerve decompression. This study demonstrates the potential of the robotic system to facilitate surgical procedures in narrow, confined, and tortuous anatomical spaces, addressing the key limitations of conventional instruments in anterior lumbar nerve decompression.

Sci. Robotics 2026-05-20

Extreme dynamic symmetry enables omnidirectional and multifunctional robots

Jiaxun Liu, Boxi Xia, Boyuan Chen

提出「动态对称性 / 动态各向同性」设计原则：质心可达加速度越均匀，机器人在 1000+ 仿真形态中的轨迹跟踪、鲁棒性与能效越好。

看点把对称性从几何外形提升为「动力学能力」的统一设计语言，思想性强。

足式 / 四足机器人感知与传感控制与动力学

摘要 Abstract

Symmetry is a central organizing principle in natural systems, yet its use as a unifying design strategy in robotics has largely remained limited to geometric form. We show that symmetry can instead be leveraged at the level of dynamic actuation capability. We introduce dynamic symmetry, the uniformity of a robot’s attainable center-of-mass accelerations, and formalize it through a measure coined as dynamic isotropy. Across more than 1000 simulated morphologies, we found that higher dynamic symmetry consistently improved trajectory tracking, task success, robustness, resiliency, and energy efficiency, with the benefits becoming most pronounced as dynamic isotropy approached its theoretical limit. To study this regime systematically, we developed Argus, a family of spherical robots designed to explore the effects of increasing dynamic symmetry. Members of the Argus family vary in their actuation geometry and dynamic symmetry level while sharing a common architectural principle: radially oriented linear actuators that directly shape the robot’s center-of-mass dynamics. Among them, we built a physical 20-leg Argus variant that achieved near-extreme dynamic isotropy and demonstrated orientation-invariant locomotion, agile traversal of cluttered and deformable terrain, rapid self-stabilization, and resilience to partial actuator failures. Its distributed sensing further enabled omnidirectional perception and object interaction during continuous motion. These results show that designing robots for symmetry not only in morphology but also in their attainable dynamics provides a powerful and general pathway toward agility, robustness, and multifunctionality in uncertain terrestrial and extraterrestrial environments.

Sci. Robotics 2026-03-25 · 被引 1

Electrofluidic fiber muscles

O. K. Afsar, G. Pupillo, G. Vitucci, W. Babatain, H. Ishii, V. Cacucciolo

电流体纤维人工肌肉具备与骨骼肌相当的功率密度(50 W/kg)、20% 收缩率与 0.3 s 响应，且纤维形态可模块化、密集集成。

看点软体驱动长期受限于功率密度与集成度，这项工作直指核心瓶颈。

控制与动力学

摘要 Abstract

Actuators are to robots what muscles are to humans. They enable motion and determine strength and dexterity. The fiber form factor makes skeletal muscles modular, scalable, and densely integrated (50% of human body weight). In contrast, servo motors that drive today’s robots lack the flexibility and modularity of muscle fibers, limiting integration and dexterity. Here, we report electrofluidic fiber muscles, soft artificial muscles for robotic applications with power density comparable to skeletal muscles (50 watts per kilogram), contraction strains of 20%, and response time of 0.3 second. These 2-millimeter-thick muscles comprise antagonistic fluidic actuators driven by electrohydrodynamic fiber pumps in a closed circuit. They require no external liquid reservoir and are electrically driven, untethered, and silent. We demonstrated that performance is increased by pre-pressurizing the muscles at an optimal bias pressure. Applying bias pressure allowed the antagonist actuator to act as a reservoir for the agonist, enabled 200% higher operating voltages by preventing cavitation, and leveraged the nonlinear pressure-stroke response of the actuators, increasing strain threefold at a given pump pressure. We characterized and modeled their dynamics, identifying optimal bias pressures. Electrofluidic muscles scale by simply bundling fibers. By selecting the ratio between pumps and actuators, we programmed their performance for different robotic tasks: a fast lever (180 millimeters per second) that launches objects in <0.3 second; a strong bundle that lifts 4 kilograms (200 times its weight) with a 30-millimeter stroke; a woven muscle that bends a robot arm by 40° and is compliant enough for a human handshake.

By Direction分方向重点

🛸无人机 / 空中机器人 Aerial Robots & UAVs64 篇

Sci. Robotics 2026-06-10

Precise aggressive aerial maneuvers with sensorimotor policies

Tianyue Wu, Guangtong Xu, Zihan Wang, Junxiao Lin, Tianyang Chen, Yuze Wu, Zhichao Han, Zhiyang Liu, et al.

仅凭机载轻量传感器，端到端感知-运动策略让四旋翼以 SE(3) 倾斜姿态高速穿越狭窄缝隙，把激进机动从依赖外部动捕推进到完全自主。

看点把「极限敏捷飞行」从实验室动捕条件搬到真实机载传感，是无人机自主性的标志性进展。

无人机 / 空中机器人导航 / SLAM / 自动驾驶机器人学习感知与传感

摘要 Abstract

Sci. Robotics 2026-05-13

RAPTOR: A foundation policy for quadrotor control

Jonas Eschmann, Dario Albani, Giuseppe Loianno

RAPTOR 训练出单一四旋翼「基础控制策略」，无需重新系统辨识即可零样本适配从未见过的机体与扰动，直击 RL 策略过拟合单一环境的痛点。

看点用「基础模型」思路解决控制策略的泛化与 sim-to-real，是本季最值得关注的范式之一。

无人机 / 空中机器人机器人学习

摘要 Abstract

Sci. Robotics 2026-03-25 · 被引 1

Milliwatt ultrasound for navigation in visually degraded environments on palm-sized aerial robots

Manoj Velmurugan, Phillip Brush, Colin Balfour, Richard J. Przybyla, Nitin J. Sanket

Hesheng Wang, Michael Wang, Frank Park, Lijun Han, Huichan Zhao, XingXing Wang, Andra Keay, Shigeki Sugano, et al.

2025 IROS(浙江)现场辩论实录：人形机器人是否会很快取代大多数人类工作？正反双方观点集中呈现。

Sci. Robotics 2026-04-15

A careful examination of large behavior models for multitask dexterous manipulation

Jose Barreiros, Andrew Beaulieu, Aditya Bhat, Rick Cory, Eric Cousineau, Hongkai Dai, Ching-Hsin Fang, Kunimatsu Hashimoto, et al.

对多任务灵巧操作的大行为模型(LBM)做了一次罕见严谨的真机评测，量化其真实能力与边界，为「通用机器人基础模型」的炒作降温、为评测立标。

看点在一片乐观叙事中提供了扎实可复现的实证基准，研究者必读。

操作与机械臂机器人学习感知与传感

摘要 Abstract

Sci. Robotics 2026-04-29

A retrieval-augmented framework enabling VLM spatial awareness for object-centric robot manipulation

Kai Chen, Chengkun Li, Chang Tu, Jiahui Pan, Yiyao Ma, Wei Chen, Zhongxiang Zhou, Xuecheng Xu, et al.

RAM 以「检索增强 + 物体中心」方式，把视觉-语言模型的语义推理接入操作所需的精确几何，弥合语义到几何的鸿沟。

看点VLM 落地操作的关键一步：让大模型「会摆放」而不仅「会描述」。

操作与机械臂机器人学习感知与传感

摘要 Abstract

Connecting the semantic reasoning of vision-language models (VLMs) to the precise geometric demands of robotic manipulation remains a fundamental challenge. Although VLMs can interpret high-level commands, they lack the intrinsic spatial intelligence required for tasks demanding precise object placement, orientation, and physical reasoning. Here, we introduce Retrieval-Augmented Manipulation (RAM), an object-centric framework that endows general-purpose vision foundation models with the spatial reasoning necessary for robust manipulation. RAM bridges the semantic-to-geometric gap by grounding abstract concepts into an explicit, object-centric three-dimensional (3D) representation. This grounded information is then provided as augmented context to the VLM, empowering it to decompose complex instructions into a sequence of spatially precise and physically plausible subgoals. We demonstrate that RAM, in a zero-shot setting on a real-world robot, can execute these subgoals to fulfill complex spatial language instructions, complete spatially aware manipulation under the guidance of a single 2D image, and adaptively replan tasks by reasoning about physical constraints like object size and collisions. Quantitative evaluations on the Common Object in 3D (CO3D) dataset also validated that RAM’s core vision module generalizes to previously unseen object categories and is robust to variations in shape and occlusions. By providing a structured bridge between semantic intent and geometric execution, RAM represents a critical step toward developing more physically intelligent and general-purpose robotic systems.

Sci. Robotics 2026-04-29

Dexterous grasping with an active palm

Amos Matsiko

带主动掌心的触觉响应式夹爪，实现自适应抓取与更高自由度的灵巧操作。

看点手掌从被动支撑变为主动自由度，重新定义夹爪的灵巧上限。

操作与机械臂感知与传感

摘要 Abstract

A tactile-responsive gripper with an active palm enables adaptive grasping and dexterous manipulation of objects.

🧭导航 / SLAM / 自动驾驶 Navigation, SLAM & Driving240 篇

Sci. Robotics 2026-06-10

From ball to rover: Transformable palm-sized rover SORA-Q for autonomous lunar exploration

D. Hirano, M. Inazawa, M. Sutoh, M. Nagata, Y. Yoneda, K. Watanabe, H. Sawada, G. Sakoda, et al.

掌心大小、可由球形变形为两轮漫游的 SORA-Q，在严苛载荷与算力约束下实现自主月面探测，是真正飞向月球的微型机器人。

看点从工程约束到真实任务落地的完整闭环，行星探测机器人的代表作。

足式 / 四足机器人导航 / SLAM / 自动驾驶人机交互 / 遥操作

摘要 Abstract

IJRR 2026-06-10 · 被引 3

EgoExo++: Integrating on-demand exocentric visuals with 2.5D ground surface estimation for interactive teleoperation of underwater ROVs

Adnan Abdullah, Ruo Chen, Ioannis Rekleitis, Md Jahidul Islam

EgoExo++ 在视觉 SLAM 管线中按需从第一人称画面合成第三人称视角，并结合 2.5D 地面估计，提升水下 ROV 遥操作的态势感知与操控精度。

看点用视角合成破解遥操作视野受限，水下交互的实用创新。

无人机 / 空中机器人导航 / SLAM / 自动驾驶人机交互 / 遥操作

摘要 Abstract

Underwater ROVs (Remotely Operated Vehicles) are indispensable for subsea exploration and task execution, yet typical teleoperation engines based on egocentric (first-person) video feeds restrict human operators’ field-of-view and limit precise maneuvering in complex, unstructured underwater environments. To address this, we first propose EgoExo, a geometry-driven solution integrated into a visual SLAM pipeline that synthesizes on-demand exocentric (third-person) views from egocentric camera feeds. We further propose EgoExo++, which extends beyond 2D exocentric view synthesis (EgoExo) to augment a piecewise-planar 2.5D ground surface estimation on-the-fly. Its anchor-free aerial viewpoint supports ground-relative reasoning, such as clearance and terrain-based navigation marker following. The computations involved are closed-form and rely solely on egocentric views and monocular SLAM estimates, which makes it portable across existing teleoperation engines and robust to varying waterbody characteristics. We validate the geometric accuracy of our approach through extensive experiments of 2-DOF indoor navigation and 6-DOF underwater cave exploration in challenging low-light conditions. To assess operational benefits, we conduct two user studies with simulation and real-world data, each involving 15 participants, comparing baseline egocentric teleoperation and EgoExo++. Results indicate improved system usability (SUS), reduced perceived workload (NASA-TLX), and significant gains in objective teleoperation performance, including 16% faster missions, 5-fold reduction in path deviation ratio, and fewer collision events (2 vs 5 across trials). Furthermore, we highlight the role of EgoExo++ augmented visuals in supporting shared autonomy, operator training, and embodied teleoperation. This new interactive approach to ROV teleoperation presents promising opportunities for future research in subsea telerobotics. The source packages for EgoExo++ are available at: https://github.com/uf-robopi/EgoExo .

T-RO 2026-06-02

LiDAR Teach, Radar Repeat: Robust Cross-Modal Navigation in Degenerate and Varying Environments

Renxiang Xiao, Yichen Chen, Yuanfan Zhang, Qianyi Shao, Yushuai Chen, Yuxuan Han, Yunjiang Lou, Liang Hu

「激光雷达示教、毫米波雷达重复」的跨模态示教-重复导航，在退化与多变(恶劣天气)环境中保持长期鲁棒自主。

T-RO 2026-06-08

Zero-Shot Sim-to-Real 6-DoF Pose Estimation for Underwater Vehicles Based on Uncertainty-Guided Dense Correspondence

Qingbo Wei, Yi Yang, Xingqun Zhou, Zhiqiang Hu, Chuanzhi Fan, Quan Zheng, Zhichao Wang

ZUPose 仅用合成数据训练即可零样本部署于真实水下场景的单目 6-DoF 位姿估计，以不确定性引导的稠密对应应对 sim-to-real 与水下光学退化。

看点算法-数据协同设计破解水下标注稀缺，多 AUV 协同的基础件。

导航 / SLAM / 自动驾驶机器人学习感知与传感多机器人 / 集群

摘要 Abstract

Accurate 6-DoF relative pose estimation is essential for multi-AUV cooperative tasks. However, pose-annotated underwater data are difficult to obtain, limiting learning-based methods. We present ZUPose, a monocular pose estimator trained entirely on synthetic data and deployed directly in real underwater scenes. A key challenge is prediction noise introduced by the sim-to-real gap and underwater optical degradation. To address it, we adopt an algorithm-data co-design strategy. At the algorithm level, we develop an uncertainty-guided densecorrespondence framework in which the network jointly predicts dense correspondences and per-pixel uncertainty under a Laplace-based probabilistic formulation. The predicted uncertainty acts as a learned scale parameter to model correspondence noise and guide pose optimization. At the data level, we construct a physics-guided simulation pipeline to model underwater optical degradation and generate diverse synthetic images. In real turbid water, ZUPose achieves translation and rotation errors of 6.7 cm and 7.7$^\circ$, with both reduced by about half compared with the best-performing baseline. The method remains stable under overexposure and long-range observation, and dual-AUV navigation experiments further validate its practical viability.

Sci. Robotics 2026-05-20

Fusing LiDAR and vision to generate high-quality reconstructions

Amos Matsiko

基于神经辐射场、融合激光雷达与视觉的重建框架，兼顾几何精度与高质量外观。

看点把 LiDAR 的几何与相机的纹理在 NeRF 框架中统一，重建质量上台阶。

感知与传感

摘要 Abstract

A neural radiance field–based reconstruction framework merging LiDAR and vision data achieves geometric accuracy.

Sci. Robotics 2026-03-25 · 被引 1

The forgotten spectrum: Reviving ultrasound for robust autonomy

Xin Zhou, Fei Gao

观点文章：被视为「过时」的超声配合边缘 AI 去噪，可在视觉失效时显著提升自主系统的鲁棒性。

Sci. Robotics 2026-05-13

Cross-link collective: Entangled robotic matter with cohesive motion

Danna Ma, Baxi Chong, Daniel I. Goldman, Kirstin H. Petersen

受活性凝胶交联启发的「交联集体」：物理缠绕的机器人物质在无固定连接、无显式协调下保持内聚并协同运动。

看点用物理缠绕换取鲁棒可扩展的群体行为，集群机器人的新形态。

多机器人 / 集群

摘要 Abstract

Robotic applications increasingly demand systems that are resilient, adaptable, and scalable. One promising route is through collectives of simple modules, where complex group-level behavior emerges from local interactions. By omitting fixed topologies and tight coordination, this approach sacrifices predictability and conventional tools for behaviors inherently optimized through stochastic mechanical interactions. A key challenge is maintaining cohesion and functionality without fixed connections and explicit coordination. We introduce the cross-link collective, a physically entangled robotic system inspired by cross-linking in active gels. Through shape morphing and transient entanglement, individually immobile modules produce sustained collective motion. The mechanically intelligent robot matter favors chains and phase relationships that reduce joint torques and reconfigures in response to perturbations. We show that distributed control can be added to this substrate to further enhance cohesion. Leveraging weak, reversible connections, the cross-link collective is adaptable, scalable, and fault tolerant, offering insights to applications from soft matter and robotics.

AuRo 2026-06-10 · 被引 4

Large language models for multi-robot systems: a survey

Peihan Li, Zijian An, Shams Abrar, Lifeng Zhou

首篇系统梳理大语言模型在多机器人系统中应用的综述，按高层任务分配、中层运动规划、低层动作生成与人类干预分层归纳。

看点为「LLM × 多机器人」这一快速膨胀的方向提供清晰地图。

导航 / SLAM / 自动驾驶机器人学习多机器人 / 集群人机交互 / 遥操作

摘要 Abstract

T-RO 2026-06-04

Fault-Tolerant Multi-Modal Localization of Multi-Robots on Matrix Lie Groups

Mahboubeh Zarei, Robin Chhabra

在矩阵李群上提出容错多模态多机器人定位框架，给出李群上相关/非相关估计的复合、求差、求逆、平均与融合等随机运算。

T-RO 2026-04-22 · 被引 1

Optimal Energy Shaping and Force Amplification Framework for Task-Agnostic, Biomimetic Ankle Exoskeletons

Katharine Walters, Gray C. Thomas, Robert D. Gregg

面向可背驱下肢/踝外骨骼的任务无关控制：以最优能量塑形与力放大框架可解释地逼近生物力矩，并兼顾安全性。

看点在外骨骼控制的可解释性与安全性之间给出兼顾方案。

机器人学习医疗 / 软体 / 微纳控制与动力学

摘要 Abstract

Task-agnostic controllers for backdrivable lower-limb exoskeletons aim to reliably mimic biological torque while seamlessly adapting to changing movement patterns. However, current approaches relying on hidden state estimators or neural networks lack explainability and safety guarantees, while force amplification methods risk instability with an inherent tradeoff between sensitivity and robustness to control inputs. Energy shaping control uses a kinematic model-based framework to provide predictable, stable assistance, though its traditional passive form limits biomimetic performance. Previous work relaxed the strict passivity requirements to improve biomimicry but reduced the stability guarantees. This paper presents an optimization-based extension of the energy-shaping control framework that combines the stability benefits of energy shaping with the intuitive biomimicry of force amplification. Our framework enables controlled trade-offs between sensitivity to changing human impedance and high performance through adjustable cost contributions of force amplification and model-based terms. We provide theoretical guarantees of closed-loop stability to an invariant set under human joint impedance control, supported by empirical validation of stability characteristics of an ankle exoskeleton under varying controller passivity constraints. A study of ten able-bodied participants using bilateral ankle exoskeletons demonstrates that the biomimetic controller reduced biological ankle torque by 18.7% across various activities of daily life.

IJRR 2026-06-18

Energy-optimal linear quadratic tracking control for unmanned underwater vehicles in offshore aquaculture fish net-pen visual inspection

Thein Than Tun, Loulin Huang, Mark Anthony Preece

面向离岸养殖网箱视觉巡检的 UUV，提出能量最优线性二次跟踪控制，在有限电量下延展作业范围与时长。

看点把能量预算直接写进控制目标，野外作业机器人的务实之作。

控制与动力学

摘要 Abstract

Unmanned underwater vehicles (UUVs) have been deployed for fish net-pen visual inspection (FNVI) in offshore aquaculture. Limited energy capacity of onboard power supplies constrains the UUV’s working range and operating time. To minimize the energy consumption by the UUV during the FNVI of the Blue Endeavour Project (an offshore salmon farm of the New Zealand King Salmon Company), an energy-optimal linear quadratic tracking (EO-LQT) control scheme is proposed in this paper. For EO-LQTs implementation, a new Linear-Parameter-Varying (LPV) system that approximates the nonlinear UUV dynamics model with an accuracy of approximately 99% regardless of the operating points in real-time, with the modified versions of Bhāskara I’s sine approximation and Shirali’s cosine approximation, is developed. The use of the Lagrangian under the Principle of Least Action with the UUV’s kinetic energy and the non-quadratic thruster power function in the EO-LQT performance index (PI) is demonstrated. The steps to solve the Hamilton-Jacobi-Bellman (HJB) equation with the non-quadratic Hamiltonian H are detailed to derive the new analytical EO-LQT optimal control form. Five EO-LQT controllers with different PIs are tested against the conventional LQT (CO-LQT) controller in both high-fidelity simulations under simulated disturbance speed up to 0.9 m/s and pool experiments, reducing energy consumption up to 37.1%. As key comparison metrics for the pose tracking and energy consumption, the mean-absolute-error (MAE) and T200 thruster power function are used to validate the effectiveness of the proposed EO-LQT controllers, compared to the CO-LQT controller.

IJRR 2026-06-06

Control of the uncertain fully flexible link-joint robot manipulators: A free-drift adaptive fractional-order robust approach

Seyed Jalal Aldin Hoseini, Mazda Moattari, Saeed Zaare

针对不确定的全柔性连杆-关节机械臂，提出免漂移自适应分数阶鲁棒控制，实现快速低振动跟踪。

看点面向高度欠驱动柔性臂的鲁棒控制，理论与振动抑制并重。

操作与机械臂控制与动力学

摘要 Abstract

Jonas Eschmann, Dario Albani, Giuseppe Loianno

无人机 / 空中机器人机器人学习

摘要

RA-L 2026-06-15

导航 / SLAM / 自动驾驶感知与传感

摘要

Adverse weather conditions, particularly heavy snowfall, pose significant challenges to both human drivers and autonomous vehicles. Traditional image-based desnowing methods often introduce hallucination artifacts as they rely solely on spatial information, while video-based approaches require high frame rates and suffer from alignment artifacts at lower frame rates. Camera parameters, such as exposure time, also influence the appearance of snowflakes, making the problem difficult to solve and heavily dependent on network generalization. In this paper, we propose to address the challenge of desnowing by using event cameras, which offer compressed visual information with submillisecond latency, making them ideal for desnowing images, even in the presence of ego-motion. Our method leverages the fact that snowflake occlusions appear with a very distinctive streak signature in the spatiotemporal representation of event data. We design an attention-based module that focuses on events along these streaks to determine when a background point was occluded and use this information to recover its original intensity. We benchmark our method on DSEC-Snow, a new dataset created using a green-screen technique that overlays pre-recorded snowfall data onto the existing DSEC driving dataset, resulting in precise ground truth and synchronized image and event streams. Our approach outperforms state-of-the-art desnowing methods by 3 dB in PSNR for image reconstruction. Moreover, we show that off-the-shelf computer vision algorithms can be applied to our reconstructions for tasks such as depth estimation and optical flow, achieving a 20% performance improvement over other desnowing methods. Our work represents a crucial step towards enhancing the reliability and safety of vision systems in challenging winter conditions, paving the way for more robust, all-weather-capable applications.

Sci. Robotics 2026-03-25 · 被引 1

Robin R. Murphy

摘要

Four science fiction works describe realistic construction and mining robots enabling human habitation of the Moon.

T-RO 2026-05-22

Future-Trend-Aware Filter-Based PD-MRAC Method for Quadrotors With Unknown Strong Disturbances

Yanhua Yang, Chenxin Yu, Xiongtao Shi, Changchun Hua, James Lam, Youmin Gong, et al.

无人机 / 空中机器人控制与动力学

摘要

Robust flight in complex and windy environments is critical for both single and multiple quadrotors. Existing methods either learn disturbance model at high computational cost or use error-based adaptive control with a speed-stability trade-off that makes tuning difficult. To address these issues, this paper proposes a future-trend-aware filter-based PD-MRAC (Proportional-Derivative Model Reference Adaptive Control) for single quadrotor and a distributed PD-MRAC for multiple quadrotor formation. By embedding a trend-aware derivative term in the adaptive update laws, the controller obtains anticipatory information about the error evolution, enabling rapid adaptation while mitigating oscillations. For more disturbance-sensitive multi-quadrotors, we design a robust distributed protocol under a directed graph, improving resilience to disturbances. The approach maintains low computational cost and supports fast adaptive updates. Extensive simulations and real-world experiments validate improvement. For single quadrotor, RMSE reduced by around 57% versus the baselines and by around 12% versus the DJI Mavic 2. For multi-quadrotors, formation results show enhanced robustness in simulation and effective real-world indoor/outdoor experiments under strong winds. Our project page is at https://xiongtao-shi.github.io/PD-MRAC/ .

T-RO 2026-05-22

Simplifying Robotic Ultrasound Calibration via Conic Sections Geometry

Zixing Jiang, Yingbai Hu, Yichong Sun, Zheng Li

导航 / SLAM / 自动驾驶医疗 / 软体 / 微纳

摘要

Robotic ultrasound (US) systems represent an emerging frontier in medical imaging. A fundamental component of these systems is the rigid body transformation between the robot flange and the attached US probe, which enables mapping of visual data from image space to the robot's reference frame. Traditionally, calibrating this transformation has been a tedious process, complicated by equipment demands and operational constraints arising from the probe's narrow field of view. This work presents a novel calibration strategy based on conic sections geometry, which offers several key simplifications over existing approaches: 1). It requires no external equipment beyond a single cone phantom; 2). It operates with a small input size and imposes no strict alignment or motion constraints on the US scan plane during calibration; and 3). It employs a straightforward pattern analysis pipeline to process images acquired from phantom scans. Experimental validation results show that the proposed method achieves accuracy comparable to existing state-of-the-art approaches while delivering superior precision, thereby demonstrating enhanced calibration reproducibility enabled by its streamlined workflow. These advantages make this method particularly suitable for application in clinical scenarios that require frequent and efficient calibration.

IJRR 2026-05-19

Bioinspired multisegment knee exoskeletons with variable stiffness and kinematic compatibility

Ming Xu, Zhihao Zhou, Wenjie Lou, Xiaolin Dai, Run Wang, Sunil K. Agrawal, et al.

足式 / 四足机器人医疗 / 软体 / 微纳人机交互 / 遥操作

摘要

The knee joint plays a critical role in locomotion but is susceptible to overuse injuries, motivating the development of assistive exoskeletons. Current designs face a fundamental trade-off between achieving kinematic compatibility with the knee’s complex polycentric motion and providing effective variable-stiffness functionality for biomechanical support. This study presents a novel cable-driven multisegment exoskeleton to reconcile these competing requirements through an integrated biomimetic design. The proposed system employs redundant rotational joints and a linear guide rail to passively accommodate natural joint kinematics while enabling wide-range stiffness regulation (0–207 Nm/rad) via active cable length adjustment. This single-actuator approach achieves dynamic stiffness regulation, deterministic torque transmission with an effective moment arm exceeding 70 mm, and seamless state modulation within a low-profile structure (0.63 kg). Benchtop characterization confirmed precise stiffness control across the operational range (rmse ≤ 0.035 Nm/rad). Human subject experiments revealed significant muscular effort reduction during demanding tasks without compromising natural joint kinematics, including 23.9% decrease in peak vastus lateralis activation during incline walking and 29.2% reduction during squatting compared to unassisted conditions. These results validate the exoskeleton’s ability to reconcile anatomical compatibility with physiologically relevant stiffness regulation, representing a significant advance in knee assistive technology with broad applications in clinical rehabilitation and physical performance augmentation. This study bridges a critical gap in knee exoskeleton development, offering a unified solution for comfortable and effective assistance across dynamic tasks.

Sci. Robotics 2026-05-13

人形机器人足式 / 四足机器人机器人学习

摘要

Multi-objective reinforcement learning (MORL) is a powerful tool to learn Pareto-optimal policy families across conflicting objectives. However, unlike traditional RL algorithms, existing MORL algorithms do not effectively leverage large-scale parallelization to concurrently simulate thousands of environments, thus facing vastly increased computation time. Ultimately, this has limited the application of MORL towards complex multi-objective robotics problems. To address these challenges, we present 1) MORLAX, a new GPU-native, fast MORL algorithm, and 2) MO-Playground, apip-installable playground of GPU-accelerated multi-objective environments. Together, MORLAX and MO-Playground approximate Pareto sets within minutes, offering 26-271x speed-ups compared to legacy CPU-based approaches and up to 19x speed-ups over prior GPU-based approaches whilst learning superior Pareto front hypervolumes. We demonstrate MO-Playground's versatility by implementing a custom BRUCE humanoid robot environment and learning Pareto-optimal locomotion policies across 6 practical objectives in simulation, such as smoothness, efficiency and arm swinging.

Sci. Robotics 2026-05-13

Shape-morphing metamaterials with continuous relearning

Melisa Yashinski

摘要

A metamaterial chain uses a physical learning framework to learn, forget, and relearn different shape changes.

T-RO 2026-05-27

AuRo 2026-06-19

STEM: Semantic target search and exploration using MAVs in cluttered environments

Nikhil Sethi, Max Lodel, Laura Ferranti, Robert Babuška, Javier Alonso-Mora

无人机 / 空中机器人导航 / SLAM / 自动驾驶感知与传感

摘要

Autonomous target search is crucial for deploying Micro Aerial Vehicles (MAVs) in emergency response and rescue missions. Existing approaches either focus on 2D semantic navigation in structured environments – which is less effective in complex 3D settings, or on robotic exploration in cluttered spaces – which often lacks the semantic reasoning needed for efficient target search. This paper overcomes these limitations by proposing a novel framework that utilizes a semantically-guided viewpoint planner to minimize target search and exploration time in unstructured 3D environments using an MAV. Specifically, we develop a combinatorial planner that generates efficient semantic exploration plans by prioritizing viewpoints that likely lead to the target. To guide the planner towards the target, an active perception pipeline is developed that propagates semantic priorities of observed objects into neighboring frontier voxels for computing semantic information gains of frontier viewpoints. In addition, we demonstrate how LLM-based similarity scores can be leveraged as semantic priority input to our pipeline. Evaluations in two distinct simulation environments show that the proposed method consistently outperforms baselines by quickly finding the target while maintaining reasonable exploration times. Real-world experiments with an MAV further demonstrate the method’s ability to handle practical constraints like limited battery life, small sensor range, and semantic uncertainty.

JFR 2026-06-19

A Review on Search and Rescue Robots in Complex Scenarios: Key Technologies of Simultaneous Localisation and Mapping

Tianyi Chen, Adam Rushworth, Fuhua Jia

导航 / SLAM / 自动驾驶感知与传感控制与动力学

摘要

This paper presents a comprehensive review of robotics research in search and rescue (SAR) operations conducted in caverns, underground environments, disaster zones, and other areas where Global Navigation Satellite System (GNSS) signals are unavailable. The majority of applications for Simultaneous Localisation and Mapping (SLAM), despite its maturity, are still restricted to structured indoor settings or outdoor environments under normal weather conditions. Standard SLAM frameworks often experience degradation and malfunction or even fail when deployed in increasingly complex and unstructured scenarios. This review identifies three major challenges that robots face in SAR environments: (i) increasingly complex terrain, (ii) changing environments and visibility, and (iii) autonomous exploration requirements, along with corresponding technological evolutions in robot mobility, sensor technologies, and SLAM algorithms. A comprehensive and quantitative evaluation of existing approaches is provided, focusing on SLAM on uneven terrain, multisensor fusion, and active SLAM. Additionally, this paper outlines ongoing challenges for guiding future development toward more robust and reliable deployment‐oriented SLAM solutions for SAR applications. These include: (i) short‐term dynamics and structural changes that undermine data association and loop closure, (ii) observability loss and degeneracy in confined and cluttered spaces, and (iii) multirobot consistency under constrained communication. Two cross‐cutting constraints, which are sensor non‐stationarity and safety‐critical autonomy, are highlighted as key factors that turn deployable SAR SLAM into a system‐level reliability problem. Finally, potential research directions and a practical research roadmap toward robust, real‐time, and evaluable SAR SLAM are outlined.

RA-L 2026-06-15

Yiting Chen, Jiali Fan, Chenglong Li, Boliao Li, Zhenbo Wei, Jun Wang

导航 / SLAM / 自动驾驶感知与传感控制与动力学

摘要

Autonomous navigation in orchards is essential for enhancing operational efficiency and ensuring safe agricultural operations. However, autonomous navigation in orchard environments presents significant challenges due to uneven surfaces and limited visual information in natural environments. To address these issues, this study proposed a shortest‐path planning method for autonomous orchard navigation based on 3D LiDAR SLAM. First, a global 3D map was constructed using the LIO‐SAM algorithm. Ground points were then separated using the Cloth Simulation Filter (CSF), and terrain roughness information was extracted from the ground point cloud to identify rugged areas that might compromise robot stability. In parallel, an improved Random Forest model was used to segment fruit‐tree points, after which DBSCAN was applied to extract individual tree centers and the Kernel Density Estimation (KDE) method was used to estimate tree‐row direction. Finally, a cost map integrating fruit‐tree distribution and terrain roughness information was constructed, and an improved A* algorithm was employed to generate efficient and terrain‐adaptive paths. The proposed method was evaluated in both a simulation and a real pear orchard. The results showed that the proposed approach reduced traversal over rugged terrain by more than 50% and lowered estimated energy consumption by nearly 48%, while maintaining comparable path lengths and high computational efficiency. Field experiments further demonstrated reliable path‐following performance, with average lateral and longitudinal deviations within 0.18 meters and heading deviation below 3.1°. These findings highlight the practical value of incorporating terrain roughness into path planning for robust and efficient orchard navigation.

RA-L 2026-06-12

Embroidery Actuator Utilizing Embroidery Patterns to Generate Diverse Fabric Deformations

Yuki Ota, Yuki Funabora

摘要

This paper presents a novel Embroidery Actuator, a fabric-integrated pneumatic actuator that enables diverse and controllable deformations through embroidery pattern design. Unlike conventional fabric actuators that rely on fiber- or thread-shaped actuators, the proposed actuator is fabricated by directly stitching an inflatable tube onto the fabric using a cord-embroidery technique. The embroidered thread and the fabric jointly form a sleeve that constrains the expansion of the inflatable tube, converting internal pressure into targeted bending or stretching deformations. By varying the embroidery pattern, such as zigzag or cross configurations, different geometric constraints can be realized, allowing for flexible control of deformation direction and magnitude. Analytical deformation models based on theNeo-Hookean modelandLagrange's equationswere developed to predict the relationship between pneumatic pressure and bending angle. And then,experiments demonstrated that the actuator achieved 47 degrees of flexion on the fabric surface side and 165 degrees on the reverse side by altering the embroidery pattern. Additionally, the created model expressed deformation with an error margin of several degrees.

RA-L 2026-06-12

Variable Stiffness Caudal Peduncle Enables Higher Propulsion Performance of a Robotic Fish

Xiaofei Wang, Xiang Li, Lixia Yan, Shiji Song

摘要

Inspired by the biological mechanism of fish caudal peduncles, which are key structures connecting musculature to the caudal fin and modulating stiffness for controlled energy transfer to enable high maneuverability across diverse swimming scenarios, this letter proposed a variable-stiffness caudal peduncle for a robotic fish that integrates thermoplastic polymer polycaprolactone, using temperature control to modulate the molten state of the material and thereby adjust stiffness. This structure enables the formation of an optimal body profile across a wide frequency range, enhancing propulsion performance. The Pseudo-Rigid-Body Model and Lagrangian method were used to model the flexible caudal peduncle and dynamic behavior of the robotic fish, respectively. Thrust results show that the caudal peduncle, in its molten state, exhibits superior performance at low frequencies, while in its solid state, it performs better at high frequencies. Simulations and experiments revealed an optimal stiffness for maximum thrust, with a peak average thrust of 0.72 N. Untethered swimming tests confirmed that temperature-based stiffness regulation of the PCL molten state enables a maximum speed of 0.47 m/s (0.88 body lengths per second) and a minimum cost of transport of 68.9 J/m/kg.

Sci. Robotics 2026-04-15

Rahul, S. K. Saha, S. M. Ishtiaque, Dipayan Das

摘要

This work proposes an algorithm based on RRT* that generates$G^{2}$-continuous path from a start position to a goal position. Additionally, the generated paths satisfy initial heading constraints, bounded absolute curvature, and reduced variation of curvature. The proposed algorithm employs cubic Bézier curves for tree extension and quintic Bézier curves for rewiring, with their control points computed under the minimum jerk energy criterion. The cubic curves admit closed form solutions enabling fast tree growth, while quintic curves enable rewiring which preserves curvature continuity across the path. Results show that the proposed planner produces paths with lower curvature variation than comparable planners at competitive planning times. Additionally, the curvature and curvature rate profiles confirm that the proposed planner remains well below the allowable bound with fewer abrupt transitions. The resulting paths turn gradually around obstacles, producing natural motion suited for passenger vehicles, wheelchairs, and transport of delicate materials.

RA-L 2026-06-08

Jianxin Bi, Kevin Yuchen Ma, Ce Hao, Mike Shou Zheng, Harold Soh

机器人学习感知与传感

T-RO 2026-06-08

EA-GPnP: Efficient and Accurate Generalized-Perspective-n-Point Solution via Optimized Null Space Analysis

Yi Zhang, Baoqiong Wang, Kunhong Li, Wenjun Chen, Xiuqi Wang, Ye Zhang, et al.

JFR 2026-06-08

Mobile Manipulator Robot for Autonomous In‐Situ Soil Measurements in Chile Pepper Cultivation

Roman Langenscheidt, Mahdi Haghshenas‐Jaryani, Heinz Bernhardt

操作与机械臂感知与传感控制与动力学

摘要

Chile pepper farming in New Mexico faces critical constraints from water scarcity, soil salinity, and labor shortages. Precision agriculture technologies enabling data‐driven resource management offer promising solutions. This paper presents an autonomous in‐situ soil sensing system integrated with a mobile manipulator robot for automated soil data collection. The main contribution is a unified, failure‐aware autonomous soil sensing system that integrates vision‐based surface characterization, adaptive force‐controlled sensor insertion, and insertion monitoring with failure detection and recovery into a single low‐cost field‐deployable robotic platform. The system comprises a two‐stage visual alignment process using RGB‐D camera data to adapt to terrain slope and identify obstacle‐free insertion sites, a force‐based contact detection mechanism to determine sensor‐soil contact, and adaptive impedance control with Kalman filter‐based soil stiffness estimation for controlled sensor insertion. The system is implemented on a mobile platform with a six DoF manipulator and TEROS 12 soil sensor. Field evaluation across 41 sensing operations in varying soil conditions during the early chile pepper season demonstrated a 75.6% success rate, with soil measurements correctly obtained upon full sensor insertion. In 90.2% of sensing operations, the system made correct decisions, including aborts when necessary. Main limitations included the inability to detect flush surface obstacles, occasional false contact detections, and incorrect insertion completion verification. Nevertheless, the results demonstrate the feasibility of autonomous in‐situ soil sensing in chile pepper cultivation, providing a foundation for fully autonomous soil monitoring. The methods and approaches developed in this work may extend to other crops requiring in‐situ soil measurements.

JFR 2026-06-08

A Design Specifications Template for Wearable Haptic Interfaces: A Case Study for Robotic Gripper Applications

Amr M. El‐Sayed

操作与机械臂导航 / SLAM / 自动驾驶医疗 / 软体 / 微纳人机交互 / 遥操作

摘要

Wearable haptic interfaces are increasingly important for enhancing human robot collaboration, particularly in decision critical tasks that require intuitive and reliable interaction. Despite advances in wearable systems, existing designs often lack a structured framework that systematically integrates sensing, actuation, control, and user‐centered considerations, limiting consistency, scalability, and performance across robotic applications. This paper presents a design specifications template for wearable haptic interfaces, providing a structured approach to guide designers in addressing key parameters, including user functional needs, ergonomic requirements, and technical design data. A focused review of related technologies covering exoskeletons and wearable haptic devices, sensing technologies for touch, and recent robotic grippers was conducted to inform the template and identify gaps in current design practices. The template was validated using two complementary approaches. Theoretical validation involved mapping two existing wearable haptic systems to the template, revealing that coverage of user characteristics and functional requirements ranged from 25% to 37.5%, highlighting the need for more systematic consideration of human factors. Practical validation was performed by designing, fabricating, and evaluating a three‐finger wearable haptic device integrated with a robotic gripper, demonstrating improved coverage of user‐centered and technical parameters and confirming the template's practical applicability. Overall, the proposed framework provides a systematic, application‐driven methodology for developing reliable and scalable wearable haptic interfaces. By enabling designers to integrate human factors, device functionality, and technical specifications at the pre‐design stage, it supports improved human‐robot collaboration and sets a foundation for future standardized and adaptable haptic systems in teleoperation, rehabilitation, and robotic manipulation tasks.

RA-L 2026-06-04

Yibo Liu, Stanko Oparnica, Simon Shewchun-Jakaitis, Guoyi Fu, Jie Wang, Jun Yang, et al.

操作与机械臂导航 / SLAM / 自动驾驶机器人学习感知与传感

摘要

Contact-rich assembly is fundamental in robotics but poses significant challenges due to uncertainties in relative poses, such as misalignments and small clearances in peg-in-hole tasks. Existing approaches typically address search and high-precision insertion separately, because these tasks involve distinct action patterns. However, supporting both tasks within a single model, without switching models or weights, is desirable for intelligent assembly systems. In this work, we propose SI-Diff, a framework that learns both search and high-precision insertion through a force-domain diffusion policy. To this end, we introduce a new mode-conditioning mechanism that enables the policy to capture distinct action behaviors under a single framework. Moreover, we develop a new search teacher policy that can generate diverse trajectories. By training on successful and efficient demonstrations provided by the teacher policy, the model learns the mapping from tactile and end-effector velocity observations to effective action behaviors. We conduct thorough experiments to show that SI-Diff extends the tolerance to x-y misalignments from 2 mm to 5 mm compared to the state-of-the-art baseline, TacDiffusion [1], while also demonstrating strong zero-shot transferability to unseen shapes.

RA-L 2026-06-01

Yuying Xi, Shuo Liu, Bao Pang, Tuo Zhou, Yong Song, Xianfeng Yuan, et al.

导航 / SLAM / 自动驾驶感知与传感

操作与机械臂感知与传感

摘要

Pose estimation-guided unseen object 6-DoF robotic manipulation is a key task in robotics. However, the scalability of current pose estimation methods to unseen objects remains a fundamental challenge, as they generally rely on CAD models or dense reference views of unseen objects, which are difficult to acquire, ultimately limit their scalability. In this paper, we introduce a novel task setup, referred to as SinRef-6D, which addresses 6-DoF absolute pose estimation for unseen objects using only a single pose-labeled reference RGB-D image captured during robotic manipulation. This setup is more scalable yet technically nontrivial due to large pose discrepancies and the limited geometric and spatial information contained in a single view. To address these issues, our key idea is to iteratively establish point-wise alignment in a common coordinate system with state space models (SSMs) as backbones. Specifically, to handle large pose discrepancies, we introduce an iterative object-space point-wise alignment strategy. Then, Point and RGB SSMs are proposed to capture long-range spatial dependencies from a single view, offering superior spatial modeling capability with linear complexity. Once pre-trained on synthetic data, SinRef-6D can estimate the 6-DoF absolute pose of an unseen object using only a single reference view. With the estimated pose, we further develop a hardware-software robotic system and integrate the proposed SinRef-6D into it in real-world settings. Extensive experiments on six benchmarks and in diverse real-world scenarios demonstrate that our SinRef-6D offers superior scalability. Additional robotic grasping experiments further validate the effectiveness of the developed robotic system.

JFR 2026-06-18

Sthithpragya Gupta, Durgesh Haribhau Salunkhe, Aude Billard

摘要

Teaching robots new skills should be as natural as showing rather than programming. Learning from demonstration (LfD) moves toward this goal by allowing users to guide a robot or sketch a desired motion, enabling learning without writing a line of code. However, most LfD methods remain tied to the robot they were trained on. Changes in morphology, different link lengths, joint orientations, or limits often break the learned behavior, making retraining unavoidable. Here, we introduce a framework that endows robots with kinematic intelligence: an internal understanding of their own joint limits, singularities, and connectivity. Instead of correcting for these constraints after learning, we embedded them directly into the control policy from the outset. The approach takes one or multiple demonstrations, extracts a globally stable dynamical system, and produces behaviors that remain valid across robots with different kinematic structures. Our method is grounded in a comprehensive analytical classification of noncuspidal three-revolute (3R) robots, which form the building blocks of many commercial robots. This classification enables a joint space policy that preserves user intent and adapts to robot-specific constraints. We validated the framework on diverse simulated and real robots, both redundant and nonredundant, with varied link geometries and joint configurations. The demonstrated skill executes safely and consistently across robots without retuning, thereby achieving cross-robot skill transfer.

RA-L 2026-03-26 · 被引 1

Lightweight Learning From Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping

Liudi Yang, Yang Bai, Yuhao Wang, Ibrahim Alsarraj, Gitta Kutyniok, Zhanchi Wang, et al.

操作与机械臂医疗 / 软体 / 微纳控制与动力学

Xiao Zhang, Xueting Hu

操作与机械臂感知与传感

摘要

When manipulators perform non-repetitive tasks in dynamic environments, the generated trajectories are often highly nonlinear and difficult to verify in advance, which increases the risk of self-interference during execution. Existing studies mainly rely on detecting abrupt changes in physical signals after interference occurs, which may cause structural impact and damage. Geometric modeling approaches based on simplified bounding structures can provide predictive detection, but their conservative representations often introduce redundant envelope space and reduce motion flexibility. To address these limitations, this paper proposes a manipulator self-interference detection and path optimization method based on multi-view projection. First, an equivalent-volume representation of the manipulator is constructed by projecting the three-dimensional structure onto feature-sensitive planes and extending contour edge points along normal directions to form a compact three-layer key-point set. Then, the separability of projected key-point sets on interference-discriminative projection planes is evaluated through a geometric discrimination function to determine potential self-interference at path points. For the path points identified as interfering, a local iterative adjustment strategy based on the separating line is further applied to modify the path while preserving the original path geometry as much as possible. Simulation and experimental results demonstrate that the proposed method effectively improves self-interference detection reliability and path optimization efficiency, showing strong potential for practical industrial applications.

T-RO 2026-04-27

导航 / SLAM / 自动驾驶

RA-L 2026-06-08

Closed-Loop Sensorless Position Control of Dielectric Elastomer Soft Robots via Actuator-Level Self-Sensing

Giovanni Soleti, Paolo Roberto Massenio, Gianluca Rizzello

医疗 / 软体 / 微纳

RA-L 2026-06-08

Any-Angle and Arbitrary-Geometry SIPP: Multi-Agent Path Finding With Polygonal Shapes

Yichen Li, Xuebo Zhang, Runhua Wang, Zhijie Hu, Yaonan Wang

多机器人 / 集群

T-RO 2026-04-21

Extreme High-Gain Friction Observer of Flexible Joint Robots With $\mathcal {L}_{1}$ Adaptive Framework

Young Bin Lee, Tae Ho Yun, Min Jun Kim

摘要

Compliance control enables flexible joint robots (FJRs) to interact with unknown environments, but joint friction may significantly degrade control performance and backdrivability. While several model-free friction observers for FJRs have been studied in recent decades, current approaches still face challenges when the robot interacts with stiff environments. To tackle this, this paper proposes a new friction observer based on an $\mathcal {L}_{1}$ adaptive framework. The main advantage of the proposed method is that it overcomes a fundamental trade-off in the state-of-the-art (SOTA) method between accurate friction compensation and natural environmental interactions. Moreover, the proposed approach enables the use of extremely high gains, which yield several additional benefits. First, unlike the conventional methods, which require feedback of so-called nominal signals obtained through simulation, measured motor signals can be fed back into the controllers, leading to a simpler implementation. Second, we provide performance analysis showing that increasing the gain improves performance and results in near-zero steady-state error. Third, the observer's performance can be adjusted using only a single parameter. Lastly, the numerical issues arising from extremely high gains are alleviated by employing a stable numerical method. The above theoretical findings are validated through simulations, and the effectiveness of the proposed approach is further evaluated with real-world experiments using both single- and 7-joint FJR systems. The results demonstrate that the proposed approach enables robots to interact with stiff environments more naturally, while achieving enhanced friction compensation performance.

RA-L 2026-06-01

Guest Editorial: Advancements in MPC and Learning Algorithms for Legged Robots

Luca Rossini, Enrico Mingo Hoffman, Francesco Ruscelli, Guillaume Bellegarda, Carlos Mastalli, Luis Sentis

足式 / 四足机器人控制与动力学

RA-L 2026-06-01

End-to-End Policy Learning for Hip Exoskeleton via Reinforcement Learning and Reflex-Based Musculoskeletal Simulation

Hossein Barati, Sangdo Kim, Nguyen Thanh Xuan, Jongwon Lee, Young Jin Park

机器人学习医疗 / 软体 / 微纳

RA-L 2026-05-15

Samer Raed Aldabbas, Ahmet Talha Çetin, Murad Abu-Khalaf, Emre Koyuncu

RA-L 2026-05-29

DS-LABRNav: Land-Air Bimodal Robot Navigation With Traversable Obstacles Base on Vision-Language Model

Yongjie Li, Wenshuai Yu, Molong Duan, Bo Zhang, Zhou Liu, Qingquan Li

导航 / SLAM / 自动驾驶感知与传感

AuRo 2026-05-08 · 被引 1

Vision-based manipulation from single human video with open-world object graphs

Yifeng Zhu, Arisrei Lim, Peter Stone, Yuke Zhu

操作与机械臂感知与传感

RA-L 2026-05-11

Tianxing Zhou, Haojia Ao, Haoyang Lu, Guangyan Chen, Zichen Zhou, Te Cui, et al.

操作与机械臂

RA-L 2026-06-01

Oscillation-Based Locomotion of a Spherical Robot Driven by Flywheels With Low Control Torque*

Yituo Song, Yanfang Liu, Jianhang Sun, Muhang Liu, Debo Kong, Xu Wang, et al.

足式 / 四足机器人

RA-L 2026-06-01

A High-Speed Omnidirectional Ceiling Mobile Robot Using a Synchronized Crawler-Hanging Mechanism

Hiroto Hasegawa, Ryota Yokomura, Rui Fukui

导航 / SLAM / 自动驾驶

Tobias Löw, Cem Bilaloglu, Sylvain Calinon

人形机器人操作与机械臂多机器人 / 集群人机交互 / 遥操作控制与动力学

摘要

Many tasks in human environments require collaborative behavior between multiple kinematic chains, either to provide additional support for carrying big and bulky objects or to enable the dexterity that is required for in-hand manipulation. Since these complex systems often have a very high number of degrees of freedom, coordinating their movements is notoriously difficult to model. In this article, we present the derivation of the theoretical foundations for cooperative task spaces of multi-arm robotic systems based on geometric primitives defined using conformal geometric algebra. Based on the similarity transformations of these cooperative geometric primitives, we derive an abstraction of complex robotic systems that enables representing these systems in a way that directly corresponds to single-arm systems. By deriving the associated analytic and geometric Jacobian matrices, we then show the straightforward integration of our approach into classical control techniques rooted in operational space control. We demonstrate this using bimanual manipulators, humanoids and multi-fingered hands in optimal control experiments for reaching desired geometric primitives and in teleoperation experiments using differential kinematics control. We then discuss how the geometric primitives naturally embed nullspace structures into the controllers that can be exploited for introducing secondary control objectives. This work represents the theoretical foundations of this cooperative manipulation control framework, and thus the experiments are presented in an abstract way, while giving pointers toward potential future applications.

Roman Freiberg, Alexander Qualmann, Ngo Anh Vien, Gerhard Neumann

人形机器人操作与机械臂

摘要

Multi-embodiment grasping aims to develop approaches that exhibit generalist behavior across diverse gripper designs. Existing methods often learn the gripper kinematic structure implicitly and face challenges due to the difficulty of sourcing the required large-scale data. In this work, we present a data-efficient, flow-based, equivariant grasp synthesis architecture that handles different gripper types with variable degrees of freedom and exploits the underlying kinematic model, deducing all necessary information solely from gripper and scene geometry. Unlike previous equivariant grasping methods, we implement all modules in JAX and provide batching capabilities over scenes, grippers, and grasps, resulting in smoother learning, improved performance, and faster inference. Our dataset encompasses grippers ranging from humanoid hands to parallel-jaw designs, including 25,000 scenes and 20 million grasps.

Sci. Robotics 2026-03-25

Driver’s licenses for autonomous systems

Sebastian M. Pfotenhauer, Alexander Wentland, Manuel Jung, Markus Lienkamp, Dava Newman

摘要

Familiar licensing routines, like driving exams, may beat technical checklists in building trust in autonomous systems.

JFR 2026-05-25

摘要

The control of an orbital space robot is challenging due to the strong nonlinear dynamic coupling between the floating base spacecraft and the equipped manipulator. To address this problem effectively, this paper develops a geometric control framework by identifying and exploiting the Lie group structures of the space robot. The paper shows how to formulate the system momentum evolution equations as a set of first-order ordinary differential equations. Then, it discusses the designs of the Lie-algebra proportional-integral controller and the manifold model predictive controller to perform the three-dimensional pose trajectory tracking task. For the manifold model predictive controller, the paper presents the structure-preserving direct-collocation method to enforce the discrete dynamic constraints in a finite-horizon optimal control problem. Furthermore, it presents the performance comparisons of the above two controllers in numerical simulations, and emphasizes the significance of computational accuracy and efficiency, momentum shaping and prediction horizon selection for the manifold model predictive controller, with detailed benchmarks against the classic Euclidean model predictive controller. Finally, the paper demonstrates the trajectory tracking and object capturing experiments in a three-dimensional space via an air-bearing space robot simulator.

RA-L 2026-05-28

IEEE Robotics and Automation Society Information

RA-L 2026-05-27

IEEE Robotics and Automation Society Publication Information

RA-L 2026-05-27

IEEE Robotics and Automation Letters Information for Authors

RA-L 2026-05-27

RA-L 2026-04-22

LoD-GS: Robust and Lightweight Gaussian Splatting SLAM for Real-Time Volumetric Scene Reconstruction

Jiachen Wang, Seung-Hyun Kong

导航 / SLAM / 自动驾驶

摘要

Real-time 3D reconstruction is becoming a key enabler for robotics, mixed reality, and autonomous vehicles. Recent advances in 3D Gaussian Splatting (3DGS) have enabled high-fidelity volumetric modeling, and their integration with SLAM shows strong potential for real-time deployment. However, the substantial size of 3DGS models limits deployment on heterogeneous devices, while their rendering quality remains highly sensitive to tracking accuracy under motion blur and abrupt texture variations. In this work, we propose LoD-GS, a lightweight and robust 3DGS-SLAM framework that produces compact yet high-fidelity Gaussian scene representations for flexible deployment. LoD-GS integrates entropy-driven scene-complete volumetric mapping to improve pose quality and Gaussian initialization, a geometry-aware rendering quality optimizer that emphasizes near-field and structure-rich regions under limited optimization budgets, and a deployment-aware level-of-detail 3DGS compression module that enables adaptive resource-quality tradeoffs. Extensive experiments on public benchmarks and real-world office sequences demonstrate its effectiveness, reducing model size by up to 53.8%, increasing rendering FPS by up to 43.72%, and improving PSNR by up to 2.471 dB.

RA-L 2026-04-22

Decoupled Heuristic Multi-Vehicle Emergency Trajectory Planning for Sudden Obstacles

Dengyu Xiao, Zhenyang Zeng, Chuan Tong, Mengdie Huang, Gang Wang, Jun Luo, et al.

导航 / SLAM / 自动驾驶

摘要

The emergence of sudden obstacles can significantly reduce the feasible space and may induce locally non-convex or fragmented space, especially in densely clustered scenarios, making vehicle trajectory planning remarkably challenging. Current methods face computational bottlenecks when generating emergency trajectories under such tight real-time constraints. To address this issue, we decouple safety-critical guidance from trajectory optimization for suddenly appearing obstacles. Specifically, a novel unified nonpositivity quantification method based on vector cross-product consistency is introduced to numerically constrain non-convex regions and a heuristic risk metric is designed to guide the optimization of avoidance target. Additionally, a dynamic priority strategy is further designed to adaptively adjust the constraint dimensionality in real time, improving the success rate of emergency planning. Comparative evaluations with existing emergency planning methods demonstrate the superiority of the proposed approach in terms of success rate, planning time, and emergency trajectory length. Finally, several real-world multi-vehicle experiments validate the effectiveness and practical applicability of the proposed method.

RA-L 2026-04-22

Multi-Priority Reactive Motion Control for Safe and Coordinated Dual-Arm Manipulation in Dynamic Environments

Jichuan Yu, Jizhou Yan, Zhao Jin, Chuxiong Hu, Ze Wang

操作与机械臂

摘要

Reactive motion generation for dual-arm robotic systems is challenging due to their high degrees of freedom, nonlinear characteristics as well as the presence of multiple constraints, including kinematic limits, collision avoidance, dual-arm coordination, and other task-specific requirements. These constraints may become incompatible especially in dynamic operational scenarios. This paper presents a multi-priority reactive motion control framework to address the above challenges. First, a novel time-varying control barrier function leveraging multi-body distance blending is proposed to formulate dynamic whole-body collision avoidance constraint. Then, a constraint prioritization mechanism is introduced to incorporate multiple task objectives into a single optimization-based controller, where the constraints are resolved in strict order of priority using hierarchical quadratic programming. The proposed control framework is extensively validated in both simulations and real-world experiments, with results consistently demonstrating its ability to generate both safe and coordinated reactive motions across various dual-arm collaboration tasks.

AuRo 2026-06-17

Distributed autonomous robotic systems 2024

Michael Otte, Michael Rubenstein, Kirstin Petersen

RA-L 2026-04-27

Field Validation of Prior-Based Image Compression for Tetherless Operation of Underwater Remotely Operated Vehicles

Luyuan Peng, Yuen Min Too, Mandar Chitre, Hari Vishnu, Bharath Kalyan, Rajat Mishra, et al.

摘要

Efficient visual communication is critical for tetherless operation of underwater remotely operated vehicles, where acoustic links severely constrain bandwidth. Prior work introduced NVSPrior, which uses novel view synthesis with 3D Gaussian Splatting to encode scene priors, together with iNVS, a gradient-based refinement strategy for improving reconstruction quality. However, its performance degrades in real-world environments due to turbidity, lighting variability, and dynamic scene elements. This paper presents a systematic field evaluation of NVSPrior+iNVS in turbid natural waters using ROV trials off St. John's Island, Singapore. To improve robustness, we introduce iNVS-w, which combines a DFNet-inspired pose regressor with a perceptual refinement loss. Benchmarking against classical and learned codecs shows that iNVS-w achieves substantially lower bitrate than scene-agnostic baselines while maintaining high perceptual fidelity on realistic field imagery. Ablation studies further quantify the role of initialization, loss functions, and feature extractors. These results provide a field-based assessment of prior-based image compression and identify practical modifications needed for robust operation in bandwidth-constrained underwater inspection.

RA-L 2026-04-27

RA-L 2026-05-11

Impact-Aware Robust Convex Model Predictive Control for Quadruped Locomotion on Uncertain Terrain

Kuikui Xue, Xin Xin, Jiangyong Hu

足式 / 四足机器人控制与动力学

RA-L 2026-05-11

Tactile Memory With Soft Robot: Robust Object Insertion via Masked Encoding and Soft Wrist

Tatsuya Kamijo, Mai Nishimura, Nodoka Shibasaki, Jeremy Siburian, Cristian C. Beltran-Hernandez, Masashi Hamaya

感知与传感医疗 / 软体 / 微纳

T-RO 2026-03-30

Seungeun Rho, Shamel Fahmi, Jeonghwan Kim, Arianna Ilvonen, Sehoon Ha, Gabriel Nelson

机器人学习

RA-L 2026-04-22

Chaz Cornwall, Casey Majhor, Logan Schexnaydre, Ian Mattson, Jeremy P. Bos

摘要

Sample-based path planners (SBPs) must balance sampling time and path optimality in complex domains. Without an adequate balance, SBPs will either take too long sampling or return a path with too much excess path length (EPL). Knowing and exploiting the relationship between sampling and EPL enables faster convergence to the optimal path. However, most models of this relationship are either overly restrictive or rely on indirect representations of EPL. We show a useful, direct relationship between the number of samples and EPL in the presence of sparse obstacles is a probability distribution function consisting of a binomial expansion of gamma distributions. Using simulations of SBPs, we show our proposed distribution is able to infer planner parameters from empirical data. We also present an algorithm that uses our distribution to improve the convergence of SBPs. Simulations show our algorithm reduces median path length by approximately 10% in higher dimensions without significantly reducing success rate. Github: https://github.com/chazcornwall/can_we_get_there_faster .

RA-L 2026-04-20

Parallel Mechanism-Type Skill-Assist Arm Using a Passive-State Actuator to Aid Movement of Limbs

Kengo Tanaka, Hiroaki Kozuka, Hiroshi Tachiya

摘要

Robotic arms equipped with passive-state actuators to assist beginners in performing upper-limb motions have been developed for tasks such as welding requiring precise positioning. The arms are equipped with position-controlled actuators, one of which can be switched to a passive-state actuator by turning off its excitation. When the passive-state actuator is active, the operator can directly input motion, and the robot assists in positioning along a target trajectory by coordinating with the operator's motion and selectively switching actuator excitations. A three-degree-of-freedom (3-DOF) serial-type arm using the above method has been reported in prior work. However, it exhibited issues such as motion discontinuities caused by inertial effects during excitation switching. To address these problems, this study proposes a lightweight arm based on a parallel mechanism, incorporating torque control at the passive-state actuator to achieve stable assistance. A prototype was fabricated, and experiments were conducted in which the arm assisted subjects—assumed to be beginners—in upper-limb positioning tasks. The experimental results confirmed that the proposed arm effectively assists in positioning and confirms practical feasibility.

Jessica Yin, Haozhi Qi, Youngsun Wi, Sayantan Kundu, Mike Lambeta, William Yang, et al.

感知与传感

RA-L 2026-05-11

Globally Guided Reactive Motion Planning for Non-Prehensile Transport in Cluttered Space

Bing Zhao, Qing Gao, Xiaolong Yu, Mingxuan Zhang, Xinyang Tian, Renluan Hou, et al.

导航 / SLAM / 自动驾驶

Hongzhe Shi, Chao Ye, Chenlu Liu, Weiyang Lin

感知与传感人机交互 / 遥操作

摘要

Collision detection is critical for safe physical human-robot interaction (pHRI). The generalized momentum observer (GMO) is a prevalent sensorless method, estimating momentum deviations to detect collisions. However, its reliance on iterative solutions and static thresholds severely limits operational efficiency and dynamic adaptability, leading to reduced sensitivity and accuracy in complex collision scenarios. To overcome these limitations, this paper proposes an impulse-based dynamic threshold generalized momentum observer (IDT-GMO)for adaptive sensorless collision detection. Key innovations include: (i) An impulse-based dynamic threshold mechanism, leveraging the impulse-momentum theorem, enabling adaptive collision detection with enhanced sensitivity and accuracy; (ii) A closed-form analytical solution, formulated through Lagrangian mechanics and Lie group theory, eliminating iterative computation and achieving 45% faster processing than conventional GMO. Experimental validation across soft contact, rigid impact, and multi-contact scenarios confirms that IDT-GMO achieves superior detection sensitivity and accuracy compared to existing methods. Thus, IDT-GMO exhibits significant potential for applications ranging from delicate contact tasks to collision-prone environments.

AuRo 2026-05-24

Agile assistive hospital robot for suboptimal Task execution in dynamic environments

Yun-Chi Chiang, I-Pei Lee, Li-Chen Fu, Yun-Hsiang Lee

人机交互 / 遥操作

RA-L 2026-04-09

An Efficient Closed-Form Solution to Full Visual-Inertial State Initialization

Samuel Cerezo, Seong Hun Lee, Javier Civera

摘要

In this letter, we present a closed-form initialization method that recovers the full visual–inertial state without nonlinear optimization. Unlike previous approaches that rely on iterative solvers, our formulation yields analytical, easy-to-implement, and numerically stable solutions for reliable start-up. Our method builds on small-rotation and constant-velocity approximations, which keep the formulation compact while preserving the essential coupling between motion and inertial measurements. We further propose an observability-driven, two-stage initialization scheme that balances accuracy with initialization latency. Extensive experiments on the EuRoC dataset validate our assumptions: our method achieves 10−20% lower initialization error than optimization-based approaches, while using 4× shorter initialization windows and reducing computational cost by 5×.

RA-L 2026-04-09

Mingyuan Dou, Ning He, Lile He, Jiaxuan Chen

足式 / 四足机器人控制与动力学

Kota Kondo, Yuwei Wu, Vijay Kumar, Jonathan P. How

摘要

Hard-constraint trajectory planners often rely on commercial solvers and demand substantial computational resources. Existing soft-constraint methods achieve faster computation, but either (1) decouple spatial and temporal optimization or (2) restrict the search space. To overcome these limitations, we introduce MIGHTY, a Hermite spline-based planner that performs spatiotemporal optimization while fully leveraging the continuous search space of a spline. In simulation, MIGHTY achieves a 9.3% reduction in computation time and a 13.1% reduction in travel time over state-of-the-art baselines, with a 100% success rate. In hardware, MIGHTY completes multiple high-speed flights up to 6.7 m/s in a cluttered static environment and long-duration flights with dynamically added obstacles.

RA-L 2026-04-06

Qiang Li, Lu Wang, Wenxing Fu

无人机 / 空中机器人导航 / SLAM / 自动驾驶

摘要

Planning smooth trajectories through a sequence of waypoints under nonconvex constraints is challenging due to the coupling between coefficient optimization and time allocation. Existing gradient-based spline trajectory optimization methods tend to be susceptible to local minima and poor initializations, or restrained by complicated gradient computations. We propose a sampling augmented bilevel optimization (SABO) approach that integrates gradient-based optimization with correlated spatio-temporal sampling for improved robustness and optimality. Through temporal normalization, the closed-form solution of coefficient optimization becomes an explicit function of segment durations, while the Hessian becomes linear in their powers, enabling analytic bilevel gradient computation without using finite differences or linearized constraints. Correlated mutations are subsequently performed around the gradient-induced solution to further explore the constrained spatio-temporal space, with sample projection and covariance matrix adaptation to guide sampling towards low-cost, feasible regions. Simulations show that SABO outperforms existing methods in terms of optimality and robustness. We validate SABO in flight experiments conducted on a quadrotor.

RA-L 2026-04-28

3D RoA-Planner: Path Planner for Quadruped Robots in Confined Spaces Using 3D Rotatable Areas

Yeongwoo Son, Hyunyong Lee, Hansol Kang, Jiman Park, SeongWon Nam, Jaeyoung Oh, et al.

足式 / 四足机器人

RA-L 2026-04-28

Advancing MAPF Toward the Real World: A Scalable Multi-Agent Realistic Testbed (SMART)

Jingtian Yan, Zhifei Li, William Kang, Kevin Zheng, Yulun Zhang, Zhe Chen, et al.

多机器人 / 集群

RA-L 2026-04-28

Deformable Point Attention for LiDAR Place Recognition With Weighted GeM Aggregation

Minseo Park, JungWoo Kim, Jaejin Jeon, DoHyeong Kwon, SangHyun Lee, Soomok Lee

感知与传感

RA-L 2026-03-30

A Semantic-Aware Integrated A$^{*}$ and Artificial Potential Field Path Planning Framework

Wei Zhou, Zhouyingmiao Chen

导航 / SLAM / 自动驾驶

摘要

This letter addresses the challenge of reliable path planning for mobile robots navigating complex, semantically constrained environments. We propose a systematically integrated path planning framework that combines the A $^*$ algorithm with the artificial potential field (APF) method. Planning efficiency is improved through an adaptive cost function and an Environment-Aware Adaptive Robot Motion Block (EA-RMB) strategy, while navigation safety is reinforced by incorporating high-level semantic constraints, such as prohibited zones, via cost penalization. Furthermore, the repulsive component of the APF is integrated into the heuristic method of the proposed algorithm, enabling the planner to proactively avoid zones with dense obstacles during the global search process. Extensive simulations and experiments demonstrate that the proposed algorithm significantly improves planning efficiency while maintaining highly competitive path quality. Furthermore, it rigorously enforces semantic constraints, ensuring safe and reliable navigation in complex environments.

AuRo 2026-04-20

Diver interest via pointing in three dimensions: 3D pointing reconstruction for diver-AUV communication

Chelsey Edge, Demetrious Kutzke, Megdalia Bromhal, Junaed Sattar

感知与传感

摘要

This paper presents Diver Interest via Pointing in Three Dimensions (DIP-3D), a method to indicate an object of interest from a diver to an autonomous underwater vehicle (AUV) by pointing that includes three-dimensional distance information to discriminate between multiple objects in the AUV’s field of view. Traditional dense stereo vision for distance estimation underwater is challenging because of the relative lack of saliency of scene features and degraded lighting conditions. Yet in many applications, including distance information is necessary for robotic perception of diver pointing when multiple objects appear within the robot’s image view. We subvert the challenges of underwater distance estimation by using sparse reconstruction of specific keypoints in both the left and right images from the robot’s stereo camera to perform pose estimation. Triangulated pose keypoints, along with any object detection method, enable DIP-3D to infer the location of an object of interest when multiple objects are in the AUV’s field of view. By allowing the scuba diver to point at an arbitrary object of interest and enabling the AUV to autonomously decide which object the diver is pointing to, this method permits more natural interaction between AUVs and humans in underwater-human robot collaborative tasks.

JFR 2026-04-20

Intelligent Autonomy: A Novel Hybrid Navigation System for Autonomous Load‐Haul‐Dump Vehicles

Yuanjian Jiang, Pingan Peng, Xiaofeng Huo, Jiaheng Wang, Liguan Wang

导航 / SLAM / 自动驾驶

摘要

The automation and intelligence of underground mining vehicles are vital for ensuring safety and improving production efficiency, representing an essential trend in the evolution of the mining industry. However, achieving autonomous navigation for load‐haul‐dump (LHD) vehicles in GPS‐denied underground environments poses significant challenges. To address these challenges, we introduce a novel hybrid navigation (HN) strategy that combines the strengths of absolute navigation (AN), which relies on precise localization using pre‐mapped environments, with reactive navigation (RN), which utilizes real‐time sensor data for immediate navigation decisions. In this strategy, the AN facilitates map‐referenced positioning during turns, while the RN dynamically adjusts the trajectory on straight segments through real‐time sensor feedback, independent of absolute localization. This integration enhances the robustness of navigation. We conducted simulation experiments to compare RN, AN, and HN systems. The results demonstrate that the HN system effectively merges the adaptability of RN with the precision of AN, ensuring reliable navigation through narrow intersections and stable performance on straight paths. Field trials further validated the HN system's ability to operate an LHD vehicle at a linear speed of approximately 1.8 m/s and a turning speed of 0.6 m/s, underscoring its practical applications in real‐world scenarios. These findings highlight the HN system's potential for robust autonomous operation in complex underground environments.

JFR 2026-04-14

Six‐Dimensional Digital Twin System for Autonomous Underwater Vehicles: Conceptualization and Twin Experiments

Lin Yu, Lei Qiao

导航 / SLAM / 自动驾驶人机交互 / 遥操作

摘要

To promote the efficient, comprehensive, reliable, and low‐cost testing and application of intelligent algorithms for autonomous underwater vehicles (AUVs), this paper proposes an innovative six‐dimensional digital twin (6D DT) conceptual model and provides detailed engineering implementation strategies of this twin system. This model integrates six core dimensions, including physical entity, virtual entity, virtual native entity (VNE), twin data, services, and communication connection. The concept of VNE is introduced to significantly enhance the practicability, security, and reliability of AUV testing by constructing diversified test scenarios. To implement the proposed model, a high‐fidelity underwater Cyberspace visualization is developed using Unreal Engine 5, which improves the granularity of virtual–real mapping and enhances human–computer interaction. An efficient data bridge plugin is implemented to ensure real‐time, stable bidirectional communication. The DT system (DTS) supports both offline simulation and online DT modes, enabling flexible testing from pure software simulation to real‐time virtual–physical interaction, thereby enhancing the credibility of algorithm validation. Two experimental cases conducted on this DTS demonstrate the technical feasibility and reliability of the proposed conceptual model. The approach provides a valuable reference for applying digital twin technology in underwater unmanned systems and accelerates the development of autonomous intelligent AUVs.

Manato Fujiya, Ryota Yokomura, Hiroto Yokoyama, Yiqian Jin, Rui Fukui

RA-L 2026-04-22

Radar-Inertial Odometry for Low-Speed Driving

Luis Diener, Jens Kalkkuhl, Markus Enzweiler

导航 / SLAM / 自动驾驶

RA-L 2026-04-09

IEEE Robotics and Automation Society Publication Information

RA-L 2026-04-23

IEEE Robotics and Automation Society Information

RA-L 2026-04-23

IEEE Robotics and Automation Letters Information for Authors

RA-L 2026-03-25

Reconfigurable Straight-Spoke Tri-Wheel Mechanism With Four Bar Linkage for Optimal Stair Climbing

Liran Zhou

摘要

In this paper, we propose a new four-bar linkage reconfigurable wheel for stair climbing. Previous works on reconfigurable wheel-based stairclimbers focus on curved-spoke geometries, but we propose a Y-shaped straight-spoke geometry and show why it is superior via a static analysis. Our mechanism adds reconfigurability to the straight-spoke structure, allowing it to both travel smoothly on flat surfaces and climb steep stairs. A four-bar linkage is used for the expansion mechanism, allowing the wheel to mimic a straight-spoke design when fully expanded. We conduct a kinematic study of our mechanism to find the theoretical path during ascent. Then, we simulated our mechanism with PyBullet, finding that the new mechanism achieves stronger results in both ascending and descending stairs when compared to standard curved-spoke designs and similar results when compared to the straight-spoke design. Following this, we analyze and compare the simulated paths of each design. Finally, we investigated boundary failure cases, focusing on the average angular velocity to further compare the ascent capabilities of our design against the curved-spoke design.

RA-L 2026-04-10

G$^{2}$VLO: Accurate and Generic 2D Gaussian Based Visual-LiDAR Odometry

Diantao Tu, Hainan Cui, Peilin Tao, Yangdong Liu, Shuhan Shen

导航 / SLAM / 自动驾驶感知与传感

JFR 2026-03-27

A Wheeled Robot Inspection System for Long‐Term Operation in Large‐Scale Industrial Environments

Chenpeng Yao, Chengju Liu, Hong Chen, Qijun Chen

导航 / SLAM / 自动驾驶感知与传感控制与动力学

摘要

Robotic navigation and object detection technologies have advanced significantly. However, deploying inspection systems in large‐scale industrial environments, particularly for long‐term operations, remains challenging due to the lack of a comprehensive software and hardware platform. To address these challenges, this paper presents a wheeled robotic inspection system designed for sustained operation in large‐scale industrial settings. A novel roadmap construction method is introduced to optimize spatial structures for real‐time processing. Additionally, a feedback mechanism is proposed to ensure stable and high‐performance operation over extended periods. The system is further supported by a hardware platform that seamlessly integrates with the software framework, enhancing overall operational performance and reliability. Experimental results validate the effectiveness of the proposed method, while real‐world testing demonstrates the system's feasibility and stability for long‐term deployment. This work provides a comprehensive solution for robotic inspection in large‐scale environments, offering a practical and scalable reference for researchers and practitioners.

RA-L 2026-04-09

GPTS-Nav: End-to-End Robot Navigation in Dynamic Environments With Graph-Privileged Teacher-Student Reinforcement Learning

Kairao Zheng, Zhi Li, Yiqing Yuan, Hui Cheng

导航 / SLAM / 自动驾驶机器人学习

RA-L 2026-04-09

A Multimodal Selective Fusion Approach for Robotic Grasp Detection

Lin Shi, Shuaikang Zhang, Xinwen Zhou, Yuanwei Ma, Ran Wei

操作与机械臂感知与传感

IJRR 2026-03-24

Corrigendum to “Enabling to learn for force sensing: A coupling-decoupling under-actuated gripper with multiple-DoFs”

操作与机械臂

JFR 2026-04-01

Development of Spatial Path Tracking Algorithm and Controller for a 6‐SPS Stewart Parallel Manipulator: A Simulation and Experimental Study

Dev Kunwar Singh Chauhan, Pandu R. Vundavilli

操作与机械臂导航 / SLAM / 自动驾驶

摘要

The Stewart parallel robot is popular for its high payload capacity due to its six prismatic links. Researchers worldwide are exploring it for various applications. In this work, the authors have developed an inverse kinematics‐based spatial path tracking algorithm for the Stewart platform that allows it to track circular paths in multiple planes. Authors also conducted experiments to test the algorithm. Initially, they established the inverse kinematics, Jacobian, and singularity of the robot. Next, they established motion planning for the robot using a third‐order polynomial in task space. Subsequently, they developed a motion controller for an individual joint actuator, employing a PID control strategy to precisely control its motion. After that, they controlled the overall motion of the Stewart manipulator using inverse kinematics by utilizing the actuator's PID‐based motion controller. The authors accomplished a novel path tracking method after breaking the whole path into multiple small trajectories and matching the endpoint velocities. Later, they used the developed path‐tracking algorithm to generate a circular shape on the aluminum disc. The developed algorithm successfully created a circular form on the aluminum disc for the incremental form application.

JFR 2026-04-01

Redefining Optimal Coverage Path Planning for FLS‐Equipped AUVs With Deep Reinforcement Learning

Lorenzo Cecchi, Alberto Topini, Alessandro Bucci, Alessandro Ridolfi

导航 / SLAM / 自动驾驶机器人学习

摘要

Autonomous Underwater Vehicles (AUVs) have emerged as indispensable tools for a variety of subsea tasks, from habitat monitoring and seabed mapping to infrastructure inspection and mine countermeasures. A fundamental challenge in this field is Coverage Path Planning (CPP), the problem of ensuring complete and efficient area coverage. Within this research activity, we propose a Deep Reinforcement Learning (DRL)‐based framework for CPP in underwater environments using a Forward‐Looking Sonar (FLS). We validate the proposed methodology through simulation experiments comparing it with the classical lawnmower path and a state‐of‐the‐art sampling‐based algorithm. Results demonstrate that our DRL‐based solution outperforms these baseline approaches in terms of coverage time per unit area and path length. Additionally, we present on‐field deployment outcomes on FeelHippo AUV, showcasing the feasibility and practicality of our framework in real‐world underwater missions.

RA-L 2026-04-13

PHMRNet: Persistent Homology Based Mamba–RWKV Network for LiDAR Place Recognition

Dejing Zhou, Xinyu Jiang, Sitao Chen, Zhonghao Cai, Jin Wu, Xieyuanli Chen, et al.

感知与传感

RA-L 2026-04-13

MoDeSuite: Robot Learning Task Suite for Benchmarking Mobile Manipulation With Deformable Objects

Yuying Zhang, Kevin Sebastian Luck, Francesco Verdoja, Ville Kyrki, Joni Pajarinen

操作与机械臂

摘要

EC/HE/101189836/EU//XSCAVE

RA-L 2026-04-10

Eel-Inspired Electrohydraulic Soft Swimmer With Programmable Undulatory Gaits

Yi Jin, Imon G. Pranta, Yunteng Cao, Guiyin Xu, Changyong Cao

足式 / 四足机器人

JFR 2026-04-08

Cover Image, Volume 43, Number 3, May 2026

Zhenliang Zheng, Yongyuan Xu, Xuchun He, Tin Lun Lam, Ning Ding

AuRo 2026-05-04

Time-discounted ergodicity on graphs for active robotic inspection of confined spaces

Benjamin Wong, Ryan H. Lee, Tyler M. Paine, Santosh Devasia, Ashis G. Banerjee

AuRo 2026-04-22

Global feature enhancement and skip-connected fusion for grasping detection

Shengjun Xu, Xiaoyi Wang, Rui Shen, Ya Shi, Bohan Zhan, Erhu Liu, et al.

操作与机械臂感知与传感

RA-L 2026-04-06

S$^{3}$aDPWo: Spatial-, Semantic-, and Shape-Aware Diffusion Policy Toward Autonomous Wound Repair

Wenda Xu, Haozhe Fang, Zexin Cao, Zhihang Tan, Gongcheng Wang, Han Wang, et al.

机器人学习

AuRo 2026-04-27

StealthBAT: Crafting stealthy blackbox adaptive adversarial triggers for monocular UAS navigation and tracking

Naman Patel, Nikolaos Evangeliou, Prashanth Krishnamurthy, Anthony Tzes, Farshad Khorrami

导航 / SLAM / 自动驾驶

JFR 2026-03-23

DURAL: Degradation‐Resistant Robust Adaptive Localization by LiDAR‐Inertial‐UWB‐Wheel Fusion for Coal Mine Robots

Kun Hu, Menggang Li, Zhiwen Jin, Chaoquan Tang, Eryi Hu, Gongbo Zhou

导航 / SLAM / 自动驾驶感知与传感

摘要

Simultaneous Localization and Mapping (SLAM) in large‐scale, complex, global positioning system (GPS)‐denied underground coal mines poses significant challenges. In these environments, abnormal conditions hinder sensor performance: GPS unavailability impedes scene reconstruction and geographic referencing; uneven or slippery terrain degrades wheel odometer accuracy; and long, feature‐poor tunnels reduce light detection and ranging (LiDAR) effectiveness. To address these challenges, we propose DURAL, a multimodal SLAM framework based on the Iterated Error‐State Kalman Filter that fuses multiple sensors from coal mine robots to overcome individual sensor limitations. First, LiDAR‐inertial odometry is tightly coupled with Ultra‐Wideband (UWB) absolute positioning constraints to establish an absolute coordinate system. Next, the wheel odometer is integrated through tight coupling, enhanced by nonholonomic constraints and vehicle lever arm compensation, to mitigate performance degradation beyond the UWB measurement range. Finally, an adaptive fusion mode switching mechanism dynamically adjusts sensor constraints based on UWB coverage and environmental conditions. Experimental results indicate that our method achieves state‐of‐the‐art accuracy and robustness in both simulated tunnel environments and real‐world underground coal mines. In real‐world experiments, the system attains an absolute pose error of 0.167 m within the UWB range, maintains a relative pose error of 6.53% outside this range, and improves mapping accuracy to 6.456 cm, significantly outperforming existing approaches in challenging mining scenarios.

RA-L 2026-04-10

AuRo 2026-04-07

Dual-Stage LiDAR-Inertial SLAM with Hierarchical Dynamic Object Removal in Dynamic Environments

Xiao Yang, Baicang Guo, Lisheng Jin, Yewei Shi, Hongyu Zhang, Hao Liu, et al.

导航 / SLAM / 自动驾驶感知与传感

RA-L 2026-03-25

AuRo 2026-04-03

Inverse k-visibility for RSSI-based Indoor geometric mapping

Junseo Kim, Matthew Lisondra, Yeganeh Bahoo, Sajad Saeedi

导航 / SLAM / 自动驾驶

JFR 2026-04-08