research-article

DMHomo: Learning Homography with Diffusion Models

Authors:
Haipeng Li

University of Electronic Science and Technology of China, Chengdu, China

University of Electronic Science and Technology of China, Chengdu, China

0000-0003-3983-9287
View Profile

,
Hai Jiang

Sichuan University, Chengdu, China

Sichuan University, Chengdu, China

0000-0002-7087-6775
View Profile

,
Ao Luo

Megvii Technology, Beijing, China

Megvii Technology, Beijing, China

0000-0003-3494-8062
View Profile

,
Ping Tan

The Hong Kong University of Science and Technology, Hongkong, China

The Hong Kong University of Science and Technology, Hongkong, China

0000-0002-4506-6973
View Profile

,
Haoqiang Fan

Megvii Technology, Beijing, China

Megvii Technology, Beijing, China

0000-0002-7398-6873
View Profile

,
Bing Zeng

University of Electronic Science and Technology of China, Chengdu, China

University of Electronic Science and Technology of China, Chengdu, China

0000-0002-4491-7967
View Profile

,
Shuaicheng Liu

University of Electronic Science and Technology of China, Chengdu, China

University of Electronic Science and Technology of China, Chengdu, China

0000-0002-8815-5335
View Profile

Authors Info & Claims

ACM Transactions on Graphics Volume 43 Issue 3Article No.: 30pp 1–16https://doi.org/10.1145/3652207

Published:09 April 2024Publication History

ACM Transactions on Graphics

Abstract

Supervised homography estimation methods face a challenge due to the lack of adequate labeled training data. To address this issue, we propose DMHomo, a diffusion model-based framework for supervised homography learning. This framework generates image pairs with accurate labels, realistic image content, and realistic interval motion, ensuring that they satisfy adequate pairs. We utilize unlabeled image pairs with pseudo labels such as homography and dominant plane masks, computed from existing methods, to train a diffusion model that generates a supervised training dataset. To further enhance performance, we introduce a new probabilistic mask loss, which identifies outlier regions through supervised training, and an iterative mechanism to optimize the generative and homography models successively. Our experimental results demonstrate that DMHomo effectively overcomes the scarcity of qualified datasets in supervised homography learning and improves generalization to real-world scenes. The code and dataset are available at GitHub ( https://github.com/lhaippp/DMHomo).

Supplemental Material

tog-23-0114-file004.mp4

mp4

55.5 MB

Download

REFERENCES

Balntas Vassileios, Lenc Karel, Vedaldi Andrea, and Mikolajczyk Krystian. 2017. HPatches: A benchmark and evaluation of handcrafted and learned local descriptors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5173–5182.Google Scholar
Bao Fan, Li Chongxuan, Zhu Jun, and Zhang Bo. 2022. Analytic-DPM: An analytic estimate of the optimal reverse variance in diffusion probabilistic models. arXiv preprint arXiv:2201.06503 (2022).Google Scholar
Barath Daniel, Matas Jiri, and Noskova Jana. 2019. MAGSAC: Marginalizing sample consensus. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10197–10205.Google ScholarCross Ref
Barath Daniel, Noskova Jana, Ivashechkin Maksym, and Matas Jiri. 2020. MAGSAC++, a fast, reliable and accurate robust estimator. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1304–1312.Google ScholarCross Ref
Butler Daniel J., Wulff Jonas, Stanley Garrett B., and Black Michael J.. 2012. A naturalistic open source movie for optical flow evaluation. In Proceedings of the European Conference on Computer Vision. 611–625.Google ScholarDigital Library
Cao Si-Yuan, Hu Jianxin, Sheng Zehua, and Shen Hui-Liang. 2022. Iterative deep homography estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1879–1888.Google ScholarCross Ref
Chang Che-Han, Chou Chun-Nan, and Chang Edward Y.. 2017. CLKN: Cascaded Lucas-Kanade networks for image alignment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2213–2221.Google ScholarCross Ref
Cunningham Padraig and Delany Sarah Jane. 2021. K-nearest neighbour classifiers—A tutorial. ACM Computing Surveys 54, 6 (2021), 1–25.Google ScholarDigital Library
DeTone Daniel, Malisiewicz Tomasz, and Rabinovich Andrew. 2016. Deep image homography estimation. arXiv preprint arXiv:1606.03798 (2016).Google Scholar
DeTone Daniel, Malisiewicz Tomasz, and Rabinovich Andrew. 2018. SuperPoint: Self-supervised interest point detection and description. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 224–236.Google ScholarCross Ref
Dhariwal Prafulla and Nichol Alexander. 2021. Diffusion models beat GANs on image synthesis. Advances in Neural Information Processing Systems 34 (2021), 8780–8794.Google Scholar
Ding Tianjiao, Yang Yunchen, Zhu Zhihui, Robinson Daniel P., Vidal René, Kneip Laurent, and Tsakiris Manolis C.. 2020. Robust homography estimation via dual principal component pursuit. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6080–6089.Google ScholarCross Ref
Dosovitskiy Alexey, Fischer Philipp, Ilg Eddy, Hausser Philip, Hazirbas Caner, Golkov Vladimir, Smagt Patrick Van Der, Cremers Daniel, and Brox Thomas. 2015. FlowNet: Learning optical flow with convolutional networks. In Proceedings of the IEEE International Conference on Computer Vision. 2758–2766.Google ScholarDigital Library
Fischler Martin A. and Bolles Robert C.. 1981. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM 24, 6 (1981), 381–395.Google ScholarDigital Library
Gast Jochen and Roth Stefan. 2018. Lightweight probabilistic deep networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3369–3378.Google ScholarCross Ref
Geiger Andreas, Lenz Philip, and Urtasun Raquel. 2012. Are we ready for autonomous driving? The KITTI Vision Benchmark Suite. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3354–3361.Google ScholarCross Ref
Greff Klaus, Belletti Francois, Beyer Lucas, Doersch Carl, Du Yilun, Duckworth Daniel, Fleet David J., Gnanapragasam Dan, Golemo Florian, Herrmann Charles, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti (Derek) Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi, Matan Sela, Vincent Sitzmann, Austin Stone, Deqing Sun, Suhani Vora, Ziyu Wang, Tianhao Wu, Kwang Moo Yi, Fangcheng Zhong, and Andrea Tagliasacchi. 2022. Kubric: A scalable dataset generator. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3749–3761.Google ScholarCross Ref
Han Yunhui, Luo Kunming, Luo Ao, Liu Jiangyu, Fan Haoqiang, Luo Guiming, and Liu Shuaicheng. 2022. RealFlow: EM-based realistic optical flow dataset generation from videos. In Proceedings of the European Conference on Computer Vision. 288–305.Google ScholarDigital Library
Hartley Richard and Zisserman Andrew. 2003. Multiple View Geometry in Computer Vision. Cambridge University Press.Google ScholarDigital Library
Ho Jonathan, Jain Ajay, and Abbeel Pieter. 2020. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems 33 (2020), 6840–6851.Google Scholar
Ho Jonathan and Salimans Tim. 2022. Classifier-free diffusion guidance. arXiv preprint arXiv:2207.12598 (2022).Google Scholar
Hong Mingbo, Lu Yuhang, Ye Nianjin, Lin Chunyu, Zhao Qijun, and Liu Shuaicheng. 2022. Unsupervised homography estimation with coplanarity-aware GAN. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 17663–17672.Google ScholarCross Ref
Hoogeboom Emiel, Heek Jonathan, and Salimans Tim. 2023. Simple diffusion: End-to-end diffusion for high resolution images. arXiv preprint arXiv:2301.11093 (2023).Google Scholar
Ilg Eddy, Cicek Ozgun, Galesso Silvio, Klein Aaron, Makansi Osama, Hutter Frank, and Brox Thomas. 2018. Uncertainty estimates and multi-hypotheses networks for optical flow. In Proceedings of the European Conference on Computer Vision. 652–667.Google ScholarDigital Library
Jiang Hai, Li Haipeng, Lu Yuhang, Han Songchen, and Liu Shuaicheng. 2022. Semi-supervised deep large-baseline homography estimation with progressive equivalence constraint. arXiv preprint arXiv:2212.02763 (2022).Google Scholar
Karras Tero, Aittala Miika, Aila Timo, and Laine Samuli. 2022. Elucidating the design space of diffusion-based generative models. arXiv preprint arXiv:2206.00364 (2022).Google Scholar
Kharismawati Dewi Endah, Akbarpour Hadi Ali, Aktar Rumana, Bunyak Filiz, Palaniappan Kannappan, and Kazic Toni. 2020. CorNet: Unsupervised deep homography estimation for agricultural aerial imagery. In Proceedings of the European Conference on Computer Vision. 400–417.Google ScholarDigital Library
Kingma Diederik P. and Ba Jimmy. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
Kingma Durk P., Salimans Tim, and Welling Max. 2015. Variational dropout and the local reparameterization trick. Advances in Neural Information Processing Systems 28 (2015), 1–9.Google Scholar
Le Hoang, Liu Feng, Zhang Shu, and Agarwala Aseem. 2020. Deep homography estimation for dynamic scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7652–7661.Google ScholarCross Ref
Li Haipeng, Luo Kunming, and Liu Shuaicheng. 2021. GyroFlow: Gyroscope-guided unsupervised optical flow learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12869–12878.Google Scholar
Li Haipeng, Luo Kunming, Zeng Bing, and Liu Shuaicheng. 2023. GyroFlow+: Gyroscope-guided unsupervised deep homography and optical flow learning. arXiv preprint arXiv:2301.10018 (2023).Google Scholar
Li Zhengqi and Snavely Noah. 2018. MegaDepth: Learning single-view depth prediction from Internet photos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2041–2050.Google ScholarCross Ref
Lin Tsung-Yi, Maire Michael, Belongie Serge, Hays James, Perona Pietro, Ramanan Deva, Dollár Piotr, and Zitnick C. Lawrence. 2014. Microsoft COCO: Common objects in context. In Proceedings of the European Conference on Computer Vision. 740–755.Google ScholarCross Ref
Liu Shuaicheng, Li Haipeng, Wang Zhengning, Wang Jue, Zhu Shuyuan, and Zeng Bing. 2021a. DeepOIS: Gyroscope-guided deep optical image stabilizer compensation. IEEE Transactions on Circuits and Systems for Video Technology. Published Online, August 9, 2021. DOI: 10.1109/TCSVT.2021.3103281Google Scholar
Liu Shuaicheng, Lu Yuhang, Jiang Hai, Ye Nianjin, Wang Chuan, and Zeng Bing. 2022. Unsupervised global and local homography estimation with motion basis learning. IEEE Transactions on Pattern Analysis and Machine Intelligence. Published Online, November 21, 2022.Google Scholar
Liu Shuaicheng, Ye Nianjin, Wang Chuan, Luo Kunming, Wang Jue, and Sun Jian. 2023b. Content-aware unsupervised deep homography estimation and its extensions. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 3 (2023), 2849–2863.Google Scholar
Liu Shuaicheng, Yuan Lu, Tan Ping, and Sun Jian. 2013. Bundled camera paths for video stabilization. ACM Transactions on Graphics 32, 4 (2013), 1–10.Google ScholarDigital Library
Liu Xihui, Park Dong Huk, Azadi Samaneh, Zhang Gong, Chopikyan Arman, Hu Yuxiao, Shi Humphrey, Rohrbach Anna, and Darrell Trevor. 2023a. More control for free! Image synthesis with semantic diffusion guidance. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 289–299.Google ScholarCross Ref
Liu Zhen, Lin Wenjie, Li Xinpeng, Rao Qing, Jiang Ting, Han Mingyan, Fan Haoqiang, Sun Jian, and Liu Shuaicheng. 2021b. ADNet: Attention-guided deformable convolutional network for high dynamic range imaging. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 463–470.Google ScholarCross Ref
Lowe David G.. 2004. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 2 (2004), 91–110.Google ScholarDigital Library
Lu Cheng, Zhou Yuhao, Bao Fan, Chen Jianfei, Li Chongxuan, and Zhu Jun. 2022. DPM-Solver: A fast ODE solver for diffusion probabilistic model sampling in around 10 steps. arXiv preprint arXiv:2206.00927 (2022).Google Scholar
Luo Ziwei, Gustafsson Fredrik K., Zhao Zheng, Sjölund Jens, and Schön Thomas B.. 2023. Image restoration with mean-reverting stochastic differential equations. arXiv preprint arXiv:2301.11699 (2023).Google Scholar
Menze Moritz and Geiger Andreas. 2015. Object scene flow for autonomous vehicles. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3061–3070.Google ScholarCross Ref
Mildenhall Ben, Srinivasan Pratul P., Tancik Matthew, Barron Jonathan T., Ramamoorthi Ravi, and Ng Ren. 2021. NeRF: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM 65, 1 (2021), 99–106.Google ScholarDigital Library
Mur-Artal Raul, Montiel Jose Maria Martinez, and Tardos Juan D.. 2015. ORB-SLAM: A versatile and accurate monocular SLAM system. IEEE Transactions on Robotics 31, 5 (2015), 1147–1163.Google ScholarDigital Library
Nguyen Ty, Chen Steven W., Shivakumar Shreyas S., Taylor Camillo Jose, and Kumar Vijay. 2018. Unsupervised deep homography: A fast and robust homography estimation model. IEEE Robotics and Automation Letters 3, 3 (2018), 2346–2353.Google ScholarCross Ref
Park Keunhong, Sinha Utkarsh, Barron Jonathan T., Bouaziz Sofien, Goldman Dan B., Seitz Steven M., and Martin-Brualla Ricardo. 2021. Nerfies: Deformable neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 5865–5874.Google ScholarCross Ref
Pumarola Albert, Corona Enric, Pons-Moll Gerard, and Moreno-Noguer Francesc. 2021. D-NeRF: Neural radiance fields for dynamic scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10318– 10327.Google ScholarCross Ref
Raistrick Alexander, Lipson Lahav, Ma Zeyu, Mei Lingjie, Wang Mingzhe, Zuo Yiming, Kayan Karhan, Wen Hongyu, Han Beining, Wang Yihan, Alejandro Newell, Hei Law, Ankit Goyal, Kaiyu Yang, and Jia Deng. 2023. Infinite photorealistic worlds using procedural generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12630–12641.Google ScholarCross Ref
Rombach Robin, Blattmann Andreas, Lorenz Dominik, Esser Patrick, and Ommer Björn. 2022. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10684–10695.Google ScholarCross Ref
Rublee Ethan, Rabaud Vincent, Konolige Kurt, and Bradski Gary. 2011. ORB: An efficient alternative to SIFT or SURF. In Proceedings of the IEEE International Conference on Computer Vision. 2564–2571.Google ScholarDigital Library
Saharia Chitwan, Ho Jonathan, Chan William, Salimans Tim, Fleet David J., and Norouzi Mohammad. 2022. Image super-resolution via iterative refinement. IEEE Transactions on Pattern Analysis and Machine Intelligence. Published Online, September 12, 2022.Google ScholarDigital Library
Salimans Tim and Ho Jonathan. 2022. Progressive distillation for fast sampling of diffusion models. arXiv preprint arXiv:2202.00512 (2022).Google Scholar
Sarlin Paul-Edouard, DeTone Daniel, Malisiewicz Tomasz, and Rabinovich Andrew. 2020. SuperGlue: Learning feature matching with graph neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4938–4947.Google ScholarCross Ref
Saurer Olivier, Fraundorfer Friedrich, and Pollefeys Marc. 2012. Homography based visual odometry with known vertical direction and weak Manhattan world assumption. In Proceedings of the Vicomor Workshop at IROS, Vol. 2012.Google Scholar
Shao Ruizhi, Wu Gaochang, Zhou Yuemei, Fu Ying, Fang Lu, and Liu Yebin. 2021. LocalTrans: A multiscale local transformer network for cross-resolution homography estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 14890–14899.Google Scholar
Sohl-Dickstein Jascha, Weiss Eric, Maheswaranathan Niru, and Ganguli Surya. 2015. Deep unsupervised learning using nonequilibrium thermodynamics. In Proceedings of the International Conference on Machine Learning. 2256–2265.Google Scholar
Song Jiaming, Meng Chenlin, and Ermon Stefano. 2020a. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502 (2020).Google Scholar
Song Yang and Ermon Stefano. 2019. Generative modeling by estimating gradients of the data distribution. Advances in Neural Information Processing Systems 32 (2019), 1–13.Google Scholar
Song Yang, Shen Liyue, Xing Lei, and Ermon Stefano. 2021. Solving inverse problems in medical imaging with score-based generative models. arXiv preprint arXiv:2111.08005 (2021).Google Scholar
Song Yang, Sohl-Dickstein Jascha, Kingma Diederik P., Kumar Abhishek, Ermon Stefano, and Poole Ben. 2020b. Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456 (2020).Google Scholar
Sun Jiaming, Shen Zehong, Wang Yuang, Bao Hujun, and Zhou Xiaowei. 2021. LoFTR: Detector-free local feature matching with transformers. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8922–8931.Google ScholarCross Ref
Tevet Guy, Raab Sigal, Gordon Brian, Shafir Yonatan, Cohen-Or Daniel, and Bermano Amit H.. 2022. Human motion diffusion model. arXiv preprint arXiv:2209.14916 (2022).Google Scholar
Tian Yurun, Yu Xin, Fan Bin, Wu Fuchao, Heijnen Huub, and Balntas Vassileios. 2019. SOSNet: Second order similarity regularization for local descriptor learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 11016–11025.Google ScholarCross Ref
Truong Prune, Danelljan Martin, and Timofte Radu. 2020. GLU-Net: Global-local universal network for dense flow and correspondences. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6258–6268.Google ScholarCross Ref
Truong Prune, Danelljan Martin, Gool Luc Van, and Timofte Radu. 2021. Learning accurate dense correspondences and when to trust them. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5714– 5724.Google ScholarCross Ref
Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Łukasz, and Polosukhin Illia. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017), 1–11.Google Scholar
Wang Yinhuai, Yu Jiwen, and Zhang Jian. 2022. Zero-shot image restoration using denoising diffusion null-space model. arXiv preprint arXiv:2212.00490 (2022).Google Scholar
Wu Shangzhe, Xu Jiarui, Tai Yu-Wing, and Tang Chi-Keung. 2018. Deep high dynamic range imaging with large foreground motions. In Proceedings of the European Conference on Computer Vision. 117–132.Google ScholarDigital Library
Ye Nianjin, Wang Chuan, Fan Haoqiang, and Liu Shuaicheng. 2021. Motion basis learning for unsupervised deep homography estimation with subspace projection. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 13117–13125.Google Scholar
Yi Kwang Moo, Trulls Eduard, Lepetit Vincent, and Fua Pascal. 2016. LIFT: Learned Invariant Feature Transform. In Proceedings of the European Conference on Computer Vision. 467–483.Google ScholarCross Ref
Yu Jason J., Harley Adam W., and Derpanis Konstantinos G.. 2016. Back to basics: Unsupervised learning of optical flow via brightness constancy and motion smoothness. In Computer Vision—ECCV 2016 Workshops. Lecture Notes in Computer Science, Vol. 9915. Springer, 3–10.Google Scholar
Zhang Jirong, Wang Chuan, Liu Shuaicheng, Jia Lanpeng, Ye Nianjin, Wang Jue, Zhou Ji, and Sun Jian. 2020. Content-aware unsupervised deep homography estimation. In Proceedings of the European Conference on Computer Vision. 653–669.Google ScholarDigital Library

Index Terms

DMHomo: Learning Homography with Diffusion Models
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Image and video acquisition
        Computational photography

Recommendations

Content-Aware Unsupervised Deep Homography Estimation
Computer Vision – ECCV 2020
Abstract
Homography estimation is a basic image alignment method in many applications. It is usually conducted by extracting and matching sparse feature points, which are error-prone in low-light and low-texture images. On the other hand, previous deep ...
Read More
An adaptive particle filter tracking method based on homography and common FOV
RACS '12: Proceedings of the 2012 ACM Research in Applied Computation Symposium

In object tracking, methods based on a particle filter are widely used, but the technique alone often fails in various situations. Sometimes multi-camera systems using homography are tried to solve problems like occlusion. We propose an adaptive ...
Read More
Homography-based block motion estimation for video coding of PTZ cameras

We propose a homography-based search (HBS) algorithm for block motion estimation.We use optical flow tracking algorithm to obtain homography between two frames.Adaptive thresholds are adopted in our method to classify different kinds of blocks. Due to ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Graphics Volume 43, Issue 3
June 2024
160 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/3613683
Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 9 April 2024
- Online AM: 11 March 2024
- Accepted: 27 February 2024
- Revised: 20 February 2024
- Received: 11 October 2023
Published in tog Volume 43, Issue 3

Check for updates
Author Tags
Homography
diffusion models
image alignment
datasets
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 719
  Total Downloads
- Downloads (Last 12 months)719
- Downloads (Last 6 weeks)321
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

DMHomo: Learning Homography with Diffusion Models

ACM Transactions on Graphics

Abstract

Supplemental Material

REFERENCES

Cited By

Index Terms

Recommendations

Content-Aware Unsupervised Deep Homography Estimation

An adaptive particle filter tracking method based on homography and common FOV

Homography-based block motion estimation for video coding of PTZ cameras

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

Caption

DMHomo: Learning Homography with Diffusion Models

ACM Transactions on Graphics

Abstract

Supplemental Material

REFERENCES

Cited By

Index Terms

Recommendations

Content-Aware Unsupervised Deep Homography Estimation

An adaptive particle filter tracking method based on homography and common FOV

Homography-based block motion estimation for video coding of PTZ cameras

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

Share this Publication link

Share on Social Media