TopoTag: A Robust and Scalable Topological Fiducial Marker System

Guangdong Virtual Reality Technology Co., Ltd. (aka. Ximmerse), Guangzhou

IEEE Transactions on Visualization and Computer Graphics (IEEE TVCG 2021)

Abstract

Fiducial markers have been playing an important role in augmented reality (AR), robot navigation, and general applications where the relative pose between a camera and an object is required. Here we introduce TopoTag, a robust and scalable topological fiducial marker system, which supports reliable and accurate pose estimation from a single image. TopoTag uses topological and geometrical information in marker detection to achieve higher robustness. Topological information is extensively used for 2D marker detection, and further corresponding geometrical information for ID decoding. Robust 3D pose estimation is achieved by taking advantage of all TopoTag vertices. Without sacrificing bits for higher recall and precision like previous systems, TopoTag can use full bits for ID encoding. TopoTag supports tens of thousands unique IDs and easily extends to millions of unique tags resulting in massive scalability. We collected a large test dataset including in total 169,713 images for evaluation, involving in-plane and out-of-plane rotation, image blur, different distances and various backgrounds, etc. Experiments on the dataset and real indoor and outdoor scene tests with a rolling shutter camera both show that TopoTag significantly outperforms previous fiducial marker systems in terms of various metrics, including detection accuracy, vertex jitter, pose jitter and accuracy, etc. In addition, TopoTag supports occlusion as long as the main tag topological structure is maintained and allows for flexible shape design where users can customize internal and external marker shapes. Code for our marker design/generation, marker detection, and dataset are available at https://herohuyongtao.github.io/research/publications/topo-tag/.

Paper

Preprint (arXiv) (PDF, 8.9 MB)
Supplemental material (PDF, 0.5 MB)

Code / Tools

TopoTag Generator (EXE, 4 MB)
TopoTag Detector (EXE, 5 MB)
TopoTag Detector (C++ Library, hosted on GitHub)

Result

Comparison of dictionary size, tracking range and detection accuracy

Comparison of pose error and jitter under laboratory test with a global shutter industrial camera (best results are shown in bold and underlined)

Comparison of pose jitter under real scene test with a rolling shutter camera (“---” means detection failure, best results are shown in bold and underlined)

Comparison of pose jitter of real scene test

Dataset

Parameters of used camera (TXT)
Groundtruth of robot positions (CSV)
Collection setup: [Seq-#1 (MP4, 4 MB), Seq-#2 (MP4, 5 MB)]
Samples for TopoTag_4x4_Square: [Seq-#1 (AVI, 13 MB), Seq-#2 (AVI, 16 MB), Seq-#3 (AVI, 99 MB)] (Note: images maybe compressed in video)
Full dataset (TeraBox | Baidu Drive) [single folder, 247 GB, Baidu Drive password: ehep] in images for all tags. You use following links if you want to download data for specific tags from Baidu Drive or corresponding sub-folders from above TeraBox link.
- AprilTag (16h5) (ZIP, 15.4G, password: ja7o)
- AprilTag (25h7) (ZIP, 15.4G, password: fi53)
- AprilTag (25h9) (ZIP, 15.4G, password: hqp9)
- AprilTag (36h9) (ZIP, 15.5G, password: vidm)
- AprilTag (36h11) (ZIP, 15.4G, password: kssb)
- ARToolKit (ZIP, 15.5G, password: ve1f)
- ARToolKitPlus (ZIP, 15.4G, password: c3tm)
- ArUco (16h3) (ZIP, 15.3G, password: 9ig0)
- ArUco (25h7) (ZIP, 15.4G, password: f73x)
- ArUco (36h12) (ZIP, 15.5G, password: ratx)
- ChromaTag (ZIP, 15.5G, password: hhv6)
- RuneTag (ZIP, 15.6G, password: qqz9)
- TopoTag (3x3), Square (ZIP, 15.4G, password: 6tbd)
- TopoTag (4x4), Square (ZIP, 15.5G, password: 39t7)
- TopoTag (3x3), Circle (ZIP, 15.4G, password: 6klu)
- TopoTag (4x4), Circle (ZIP, 15.4G, password: wr9r)

Bibtex

@article{yu2021topotag,
  title={{TopoTag: A Robust and Scalable Topological Fiducial Marker System}},
  author={Yu, Guoxing and Hu, Yongtao and Dai, Jingwen},
  journal={IEEE Transactions on Visualization and Computer Graphics (TVCG)},
  volume={27},
  number={9},
  pages={3769-3780},
  year={2021},
  publisher={IEEE}
}