Deform360 is a massive multi-view visuotactile dataset of 198 everyday deformable objects, captured across 1,980 robotic interactions with 41 synchronized surround-view cameras and bimanual tactile grippers. It provides 215.7 cumulative multi-view hours with markerless 3D particle annotations for benchmarking deformable-object world models, and was accepted at ECCV 2026.

How big is the Deform360 dataset?

Deform360 contains 198 daily-life deformable objects, 1,980 interaction sequences, 41 surround-view cameras, 215.7 cumulative multi-view hours, roughly 23.3 million frames, and 74,850 raw videos captured at 720p and 30 FPS.

How does Deform360 differ from prior deformable-object datasets?

Deform360 substantially increases the scale and sensory richness available for deformable dynamics research: 198 objects, 1,980 interactions, 41 calibrated surround views, tactile sensing, and dense markerless 3D annotations.

What sensors does Deform360 use?

A synchronized rig of 41 RGB cameras provides full 360-degree coverage, complemented by bimanual tactile grippers based on the UMI platform that recover the contact signals usually occluded from the cameras.

How are the 3D annotations in Deform360 produced?

A markerless visuotactile tracking pipeline produces the annotations: per-frame 3D Gaussian Splatting recovers geometry, CoTracker3 2D tracks are lifted to 3D and fused across the 41 views, and physics-informed optimization with tactile constraints enforces physically plausible motion.

Which models were benchmarked on Deform360?

Benchmarks compare Cosmos against ParticleFormer, PGND, and PhysTwin where applicable, using Chamfer distance, track error, PSNR, SSIM, and LPIPS across per-episode, multi-episode, and multi-object settings.

Do 2D video or 3D particle models perform better on deformable dynamics?

There is no universal winner. Explicit 3D priors are effective in the lowest-data setting. On held-out episodes, Cosmos reconstructs appearance best while ParticleFormer predicts future dynamics best. On held-out objects, Cosmos leads PSNR and LPIPS but can drift from commanded actions over long horizons.

Is the Deform360 dataset open source?

Yes. The Deform360 dataset and the full pipeline — capture, reconstruction, perception, and world-model baselines — are released under the MIT License. The dataset is hosted on HuggingFace and is free to use.

        
        ACCEPTED · ECCV 2026
      

Deform360: A Massive Multi-view Visuotactile Dataset for Deformable World Models

198 daily-life deformable objects. 1,980 interactions. 41 surround-view cameras and bimanual tactile grippers — a foundation for benchmarking 2D and 3D world models on real-world deformable dynamics.

Hongyu Li^1,2* Wanjia Fu¹* Xiaoyan Cong¹ Zekun Li¹ Binghao Huang² Hanxiao Jiang² Xintong He¹ Yiqing Liang¹ Rao Fu¹ Tao Lu¹ Srinath Sridhar¹ Kevin A. Smith³ George Konidaris¹ Yunzhu Li²

¹ Brown University ² Columbia University ³ MIT

* Equal contribution

▷ LOOP

Watch it move. Could you predict the next second?

A robot that could imagine how this shirt folds would know how to act on it. Today's robots can't do this reliably — predicting deformable motion remains one of the hardest open problems in physical-world reasoning, with high-dimensional state and contact that is usually occluded. The rest of this page is about why, and what it takes to change that.

01 · MOTIVATION

Why deformable dynamics challenge current world models

Deformable bodies — rope, cloth, plush toys — have theoretically infinite degrees of freedom, and the contact that drives their motion is frequently occluded by the gripper or the object itself. That combination is what makes the next second so hard to predict.

Two paradigms have emerged to model this: predicting dynamics in 2D pixel space (video generation) or in explicit 3D geometric space (particles & meshes). Comparing them requires diverse, large-scale, multi-view data with high-fidelity 3D annotations and tactile contact cues through occlusion.

THE GAP — 2D

Video models scale, but drift

Internet-scale pre-training captures rich appearance, yet long-horizon rollouts suffer 3D and temporal inconsistency.

THE GAP — 3D

3D models add structure, but lack scale

Explicit geometry and structural priors support data-efficient prediction, but current learned 3D models lack comparable massive pre-training.

Existing benchmarks trade off diversity, scale, multi-view coverage, tactile sensing, and annotation fidelity—making controlled comparison difficult.

THE MISSING INGREDIENT

No data could settle it — so we built Deform360

Deform360 contains 215.7 cumulative multi-view hours of synchronized 41-view video and bimanual tactile recordings across 198 everyday deformable objects, paired with dense markerless 3D particle annotations.

A markerless visuotactile tracking pipeline — combining multi-view reconstruction and tactile contact signals — turns those recordings into dense particle annotations, enabling a controlled comparison of 2D video models and 3D particle models on real-world deformable dynamics, and measuring the trade-off between structural priors and scale.

object-overview-zoom.mp4  ·  Fig 1 — infinite zoom-out across the 198-object dataset

FIG · 1

Overview of Deform360. A large-scale multi-view visuotactile dataset of 198 deformable objects across 1,980 interactions, supporting 2D and 3D world models, contact detection, and real-world robot planning.

02 · THE DATASET

A significant increase in scale and sensory richness

Every interaction is captured by a synchronized rig built for full 360° observability — multi-view coverage that substantially reduces the occlusion that limits single-view setups, complemented by tactile sensing for contact regions that remain out of camera view.

Object taxonomy — graded by material response

1D · LINEAR

Ropes, cables & wires

Varying stiffness and thickness.

2D · THIN-SHELL

Fabrics, cloth & paper

Diverse textiles, airbags and thin shells.

3D · VOLUMETRIC

Plush, stuffed & foam

Objects that exhibit large shape change.

198 DAILY-LIFE DEFORMABLE OBJECTS

brics-odroid-015 · cam0 · ep 0  ·  hover to enlarge

These preview clips are downsampled to a low resolution for fast in-browser loading; the released dataset is full-resolution 41-view video.

001 · Rope

002 · Rope Silk

003 · Cable

004 · Rubber Band

005 · Thread

006 · Fur

007 · Feather

008 · Pink Cloth

009 · Yellow Cloth

010 · Orange Cloth

011 · Green Cloth

012 · Hat Cloth

013 · Glove Cloth

014 · Glove Vinyl Cloth

015 · Airbag Cloth

016 · Shirt Cloth

017 · Chessboard Cloth

018 · Trashbag Cloth

019 · Trashbag Plastic Cloth

020 · Cutting Mat Cloth

021 · Bag Cloth

022 · Handkerchief

023 · Cleaning Cloth

024 · Glass Cleaner Cloth

025 · Bag Small Cloth

026 · Sock Cloth

027 · Umbrella Bag Cloth

028 · Ziplog Cloth

029 · Foam Cloth

030 · Bandage Cloth

030 · Foam Flat Cloth

031 · Cotton Cloth

032 · Teabag Cloth

033 · Mask Cloth

034 · Plastic Bag Cloth

035 · Wipe Cloth

036 · Napkin Cloth

037 · Mop Cloth

038 · Black Bag Cloth

038 · Mat Cloth

039 · Bday Hat

040 · Paper Cloth

041 · Wrap Paper Cloth

042 · Necktie Cloth

043 · Dog

044 · Doll

045 · Cat

046 · Sponge

047 · Rectangle Sponge

048 · Butter Sponge

049 · Ball

050 · Boxing

051 · Cube

052 · Rubber Duck

053 · Squeezer

054 · Toothpaste

055 · Lettuce Cloth

056 · Makeup Sponge

057 · Kitchen Sponge

058 · Roll Napkin

059 · Shoe

060 · Bread Cloth

061 · Cup

062 · Banana

063 · Flower

064 · Box

065 · Pita Bread Cloth

066 · Glove Half Black Cloth

067 · Paracord

068 · Nylon Rope

069 · Jump Rope

071 · Climbing Rope

072 · Cotton Clohesline

073 · Shoelace

074 · String

075 · Leather

076 · Rubber Bands

077 · Hemp Rope

078 · Fishing Line

079 · Chain Metal

080 · Wool

081 · Stripe Rope

082 · Curtain Cloth

083 · Blanket Cloth

084 · Apron Cloth

085 · Scarf Cloth

086 · Cotton Scarf Cloth

087 · Plastic Bag Blue Cloth

088 · Snake

089 · Football

090 · Sloth

091 · Net Cloth

092 · Squirrel

093 · Squeezable Fruit

094 · Ring

095 · Watermelon

096 · Octopus

097 · Pillow

098 · Beach Ball Cloth

099 · Teeth

100 · Puppet

102 · Stress Ball

103 · Ice Pack Cloth

104 · Alloy

105 · Clay Cloth

106 · Baking Mat

107 · Mitt Cloth

108 · Drying Mat Cloth

109 · Pouch Cloth

110 · Shower Cap Cloth

111 · Headband Cloth

112 · Wristband Cloth

113 · Collar

114 · Finger Wrap Cloth

115 · Cotton Gauze Cloth

116 · Hydrogel Patch Cloth

117 · Bubble Wrap Cloth

118 · Envelope Cloth

119 · Seal Cloth

120 · Bread Plush

121 · Croissant Plush

122 · Sheets Cloth

123 · Pipe Cleaner

124 · Tulle Fabric Cloth

125 · Rabbit

126 · Jellyfish

127 · Pen Grip

128 · Stress Cube

129 · Pack Coaster Cloth

130 · Stress Donut

131 · Animal Toy

133 · Hiking Glove Cloth

134 · Earplug

135 · Makeup Sponge

136 · Foam Letters Cloth

137 · Kitchen Napkin Cloth

138 · Sponge Stamps

139 · Rubber Ball

140 · Rubber Glove Cloth

141 · Eraser

142 · Shoe Sole Cloth

143 · Silicone Wristband

144 · Jar Opener Cloth

145 · Rubber Toy

146 · Frog

147 · Baking Mold

148 · Crepe Paper Cloth

149 · Sticker Paper Cloth

150 · Shredded Packing Paper Cloth

151 · Parchment Paper Cloth

152 · Slime

153 · Cake

155 · Crystal Slime

156 · Mesh Produce Bag Cloth

157 · Sack Cloth

158 · Jewelry Pouch Cloth

159 · Purse

160 · Hose

161 · Tube

162 · Straw

163 · Bear

164 · Sheep

165 · Glove Yellow Cloth

166 · Glove Green Cloth

167 · Glove Gray Cloth

168 · Cat Big

169 · Pencilcase Cloth

170 · Spider

171 · Penguin

172 · Napkin Case Cloth

173 · Poster Paper Cloth

174 · Chain

175 · Plastic Bag Cloth

176 · Candy Packet Cloth

177 · Bottle Cover

178 · Bottle Accessory Cloth

179 · Towel Black Cloth

180 · Box Big

181 · Belt

182 · Plastic Sheets Cloth

183 · Shower Cap Transparent Cloth

184 · Foam Roller Thick Cloth

185 · Cheese

186 · Monster

187 · White Bear

188 · Foam Roll Small

189 · Bear Big

190 · Monkey

191 · Sloth Green

192 · Fish

193 · Frog

194 · Fish Orange

195 · Hello Kitty Brown

196 · Hello Kitty White

197 · Hand Sanitizer

198 · Kneepad Cloth

199 · Hat

200 · Watch

Numbers on a slide are easy to claim. So don't take ours — reach in and turn the data over yourself.

3D GAUSSIAN SPLATTING

Fully interactive 3D reconstructions

Click any object to load it live — orbit, zoom and pan in real 3D.

1D · 001 · Rope

2D · 008 · Pink cloth

3D · 096 · Octopus

per-frame 3DGS · full set released with the dataset

How it compares

Deform360 substantially increases the scale and sensory richness of real-world deformable benchmarks: 198 objects, 41 calibrated surround views, tactile sensing, and dense markerless 3D annotations.

DATASET

MESH

CALIB

TACTILE

360°

# OBJ

# FRAMES

03 · APPROACH

Per-frame geometry is only half the problem. Contact-induced motion is frequently hidden under a gripper or behind a fold—so how do we recover dense, temporally consistent 3D motion?

From raw video & touch to dense particle motion

A markerless pipeline decouples per-frame geometry from temporal tracking: 3D Gaussian Splatting recovers high-fidelity geometry each frame, 2D tracks are lifted into 3D for multi-view consistency, and tactile signals enforce physical plausibility through occlusion.

Fig 2 — annotation pipeline diagram: multi-view capture, per-frame 3DGS reconstruction, and particle dynamics

Multi-view video + tactile → per-frame 3DGS → markerless 2D tracking → 3D lifting → physics-informed optimization.

{{ p.tag }}

Tactile signals through occlusion

Synchronized tactile sensors measure normal-pressure contact cues that help constrain particle motion where cameras are occluded. Tangential micro-slip remains unobserved.

008 · Pink Cloth

Tactile signal · 2D thin-shell

001 · Rope

Tactile signal · 1D linear

04 · INTERACTIVE 4D

With dense 3D state recovered for every frame, the dynamics come alive — explore them yourself.

Explore the reconstructions in 4D

Each sequence reconstructs both appearance (3DGS) and particle dynamics over time. Orbit, scrub, and inspect — these are live Viser viewers, not videos.

Note: for fast, stable playback in the browser, this viewer shows the Gaussian-Splatting centroids as a point cloud (rather than rendering the full splats), includes 3 of the camera views, and subsamples the tracked points whose motion trails are drawn. Click below to load the live, interactive viewer.

viser · {{ activeFile }}

frustums

{{ seqReady }} · full set of 1,980 available in the release

05 · VIDEO RESULTS

Real-world planning

As a preliminary demonstration, a PhysTwin representation learned from Deform360 is used for model-predictive control on a different xArm robot in another lab. The planner rolls out actions that move cloth and rope toward a goal state, without fine-tuning to the new setup.

GOAL

MPC ROLLOUT

06 · BENCHMARKS

So we arrive back at the question we opened with: to predict a deformable future, should you trust pixels or particles?

2D video vs. 3D particle models

The result depends on the task and data regime. Explicit 3D priors remain effective with very little data; on held-out episodes, ParticleFormer predicts future dynamics best; on held-out objects, pretrained Cosmos delivers the strongest PSNR and LPIPS, but can drift from commanded actions.

{{ r.cat }}

PSNR ↑ · SSIM {{ r.ssim }}

Multi-episode future prediction

Table 4

METHOD

PARADIGM

CD ↓

PSNR ↑

LPIPS ↓

ParticleFormer leads every reported future-prediction metric; Cosmos’s appearance advantage is in reconstruction, not held-out futures. Bold = best in column.

Multi-object generalization (zero-shot)

Table 5

METHOD

PARADIGM

CD ↓

PSNR ↑

LPIPS ↓

On unseen objects, Cosmos leads PSNR and LPIPS; ParticleFormer retains the best geometric CD and track error, which are not defined for Cosmos.

07 · QUALITATIVE

Predicted futures, side by side

Compare selected model rollouts against ground truth under two distinct held-out settings: a novel object or a novel episode.

WHAT IS NOVEL {{ qualSettingTitle }}. {{ qualSettingDescription }}

EXAMPLE

{{ c.label }}

Qualitative future rollouts for the {{ qualName }} under the {{ qualSettingCaption }} setting.

WHAT THE BENCHMARK REVEALS

Structure helps predict; pre-training helps render and transfer.

There is no universal winner. The paper’s three evaluation regimes separate data efficiency, future prediction, and zero-shot visual generalization.

01 · VERY LIMITED DATA

Explicit priors remain effective

PhysTwin leads the per-episode 3D benchmark. Cosmos is not reported because so little data did not support stable post-training.

02 · NOVEL EPISODE

3D structure predicts futures better

Cosmos best reconstructs texture and appearance, but ParticleFormer leads every reported metric on held-out future prediction.

03 · NOVEL OBJECT

2D pre-training improves visual transfer

Cosmos leads PSNR and LPIPS; ParticleFormer retains the best geometric errors. Cosmos can still drift from commanded actions on long rollouts.

For control, explicit state still matters. Particle models expose 3D geometry for objectives such as Chamfer distance. The paper does not deploy Cosmos for MPC because cross-environment appearance shift and reward design directly in video space remain difficult.

Annotation limits. Heavy self-occlusion, highly plastic materials, and slip can still reduce tracking fidelity; the current tactile sensors measure normal pressure and do not directly observe micro-slip.

08 · OPEN SOURCE

Fully open — plug and play

The Deform360 dataset and the full pipeline are open-sourced. Every stage — capture, reconstruction, perception, and the world-model baselines — is modular, so if you have a stronger perception pipeline or a better module, you can swap it in and run against the same benchmark.

Improvements are welcome — open an issue or submit a pull request to push the pipeline forward.

{ } Code & pipeline 🤗 Dataset

09 · CITE

BibTeX

{{ bibtex }}