Visual Servoing Platform  version 3.6.1 under development (2024-12-17)
Tutorial: Markerless generic model-based tracking using AprilTag for

initialization (use case)

Introduction

As stated in Tutorial: Markerless generic model-based tracking using a color camera ViSP allows the tracking of a markerless object using the knowledge of its CAD model while providing its 3D localization when a calibrated camera is used [8], [49].

In this tutorial we will experiment live tracking of a cube observed by an USB camera (webcam) or an Intel Realsense RGB-D camera (SR300 or D400 series).

To automate generic model-based tracker initialization avoiding user click, we use here an AprilTag glued on a cube face. The pose of the tag is used to initialize the tracker; see Tutorial: AprilTag marker detection.

Thanks to a confidence indicator associated to the tracker we are able to detect a tracking failure and continue waiting for an AprilTag new detection to initialize again the tracker in an infinite loop.

Getting started

Build your own cube

The object considered in this tutorial is a cube made of cardboard with an AprilTag sticked on a face. The default length of one side of the cube is 12.5 cm, while the default AprilTag external black square is 8 cm large. As explained in Using an USB camera and in Using an Intel Realsense camera you can build your own object with other sizes and set your cube and tag dimensions as command line parameters.

To print your own tag, follow instructions given in Print an AprilTag marker. Of course, you will need a webcam to run the tutorial on your computer.

Here is an image of the cube we built:

Cube used for the tutorial

Considered classes

This tutorial focuses on vpMbGenericTracker class that was introduced in ViSP 3.1.0 and on vpDetectorAprilTag for initialization of the tracker.

In this tutorial, we will show how to use vpMbGenericTracker class in order to track an object from images acquires by a monocular color camera using moving edges and keypoints as visual features, and how to initialize the tracker using vpDetectorAprilTag class.

Note that all the material (source code and videos) described in this tutorial is part of ViSP source code (in tutorial/tracking/model-based/generic-apriltag folder) and could be found in https://github.com/lagadic/visp/tree/master/tutorial/tracking/model-based/generic-apriltag.

Considered third-parties

Depending on your use case the following optional third-parties may be used by the tracker. Make sure ViSP was build with the appropriate 3rd parties:

  • OpenCV: Essential if you want to acquire images on non Linux platforms (Windows, OSX) and use the keypoints as visual features that are detected and tracked thanks to the KLT tracker.

Pseudo code

The algorithm that is implemented in order to detect the cube location and use this location to initialize the model-based tracker is the following:

main()
cube_size, tag_size, detector_settings, tracker_settings = read_command_line_options()
create_cube_cad_model(cube_size, tag_size)
I = start_image_grabber()
detector = initialize_apriltag_detector(detector_settings)
tracker = initialize_tracker(cad_model, tracker_settings)
state = state_detection
while(state != state_quit)
I = acquire()
display(I);
if (state == state_detection)
state = detector.detect_tag(I)
if (state == state_tracking)
cMo = detector.get_pose()
tracker.set_pose(cMo)
if (state == state_tracking)
state = tracker.track(I)
if (state == state_tracking)
cMo = tracker.get_pose()
display(cMo, cad_model)
if (display(mouse_click))
state = state_quit

If you have an USB camera, this algorithm is implemented in tutorial-mb-generic-tracker-apriltag-webcam.cpp. If you have rather an Intel Realsense RGB-D camera, this algorithm is implemented in tutorial-mb-generic-tracker-apriltag-rs2.cpp

Using an USB camera

If you have an USB camera like a webcam or a laptop camera, you can try to track your cube using tutorial-mb-generic-tracker-apriltag-webcam.cpp source code that is part of ViSP. This example allows to track a cubic box with an AprilTag on a face using either moving edges or keypoints as visual features. This example is an extension of the one explained in Tutorial: Markerless generic model-based tracking using a color camera. To grab images from an USB camera we use either Video 4 Linux driver thanks to vpV4l2Grabber class (a wrapper over libv4l), or OpenCV VideoCapture class if vpV4l2Grabber is not available (on Windows or OSX for example). To install libv4l third-party, follow Recommended 3rd parties. To install OpenCV, depending on your platform follow one of the tutorials given in Installation from source code.

Binary usage

Once build, to see the options that are available, just run:

$ ./tutorial-mb-generic-tracker-apriltag-webcam --help
Usage: ./tutorial-mb-generic-tracker-apriltag-webcam [--input <camera id>] [--cube_size <size in m>] [--tag_size <size in m>] [--quad_decimate <decimation>] [--nthreads <nb>] [--intrinsic <xml intrinsic file>] [--camera_name <camera name in xml file>] [--tag_family <0: TAG_36h11, 1: TAG_36h10, 2: TAG_36ARTOOLKIT, 3: TAG_25h9, 4: TAG_25h7, 5: TAG_16h5>] [--display_off] [--texture] [--projection_error <30 - 100>] [--help]

To test the tracker on a 12.5 cm wide cube that has an AprilTag of size 8 by 8 cm, and enable moving-edges and keypoints as visual features run:

$ ./tutorial-mb-generic-tracker-apriltag-webcam --texture

To test the tracker on a 12.5 cm wide cube that has an AprilTag of size 8 by 8 cm, with only the moving-edges as visual features run:

$ ./tutorial-mb-generic-tracker-apriltag-webcam

By default, the following settings are used:

  • Default camera has id=0. To specify another one, run for example:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --input 1
  • The default size of the cube is 0.125 meter large. To use rather a 0.20 meter large cube, run:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --cube_size 0.20
  • The AprilTag size is 0.08 by 0.08 meters. To change the tag size to let say 0.10 meter square, use:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --tag_size 0.10
  • The AprilTag detection of quads is set to 1.0 by default. You can change it to 2 to speed up the detection using:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --quad_decimate 2
  • The AprilTag default number of threads used is 1. To increase the number of threads used during detection to 2, run:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --nthreads 2
  • The AprilTag default tag family is 36h11. It is possible to use other tags specifying the tag id (0 to 5): 0=TAG_36h11, 1=TAG_36h10, 2=TAG_36ARTOOLKIT, 3=TAG_25h9, 4=TAG_25h7, 5=TAG_16h5). To use tag family 36h10, run:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --tag_family 1
  • The default camera parameters used are: px=600, py=600, and principal point is set at image center. To specify your own parameters, you can provide for example a camera.xml parameters file with a camera named myCam in it, by running:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --intrinsic camera.xml --camera_name myCam
    See Tutorial: Camera intrinsic calibration to see how to generate camera.xml file by calibration your camera.
  • The display is turned on by default, but you can disable it with:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --display_off
  • Tracking failure detection is achieved thresholding the projection error indicator provided by the tracker. The default value of this threshold is set to 40 degrees. To decrease this threshold to 30 degrees (meaning that we accept less projection error and thus trigger a new AprilTag detection more often) you may run:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --projection_error 30

Expected results

  • Tracking using only edges: Running $ ./tutorial-mb-generic-tracker-apriltag-rs2 you should be able to get results similar to these:



Cube tracking result with an USB camera using only moving edges

  • Tracking using and hybrid scheme with edges and keypoints: Enabling keypoints as visual features will lead to an hybrid model-based tracking scheme using $ ./tutorial-mb-generic-tracker-apriltag-rs2 --texture. The corresponding results would rather look like these:



Cube tracking result with an USB camera using moving edges and keypoints

Using an Intel Realsense camera

If you have an Intel Realsense RDB-D camera like an SR300 or a D435, you can try to track your cube using tutorial-mb-generic-tracker-apriltag-rs2.cpp source code that is part of ViSP. This example allows to track a cubic box with an AprilTag on a face using either moving edges, keypoints or depth as visual features. This example is an extension of the one explained in Tutorial: Markerless generic model-based tracking using a RGB-D camera. To grab images from an Intel RealSense device we use vpRealSense2 class. This class needs librealsense 2.x third-party which installation instructions are provided in the tutorials available from Installation from source code.

Binary usage

Once build, to see the options that are available, just run:

$ ./tutorial-mb-generic-tracker-apriltag-rs2 --help
Usage: ./tutorial-mb-generic-tracker-apriltag-rs2 [--cube_size <size in m>] [--tag_size <size in m>] [--quad_decimate <decimation>] [--nthreads <nb>] [--tag_family <0: TAG_36h11, 1: TAG_36h10, 2: TAG_36ARTOOLKIT, 3: TAG_25h9, 4: TAG_25h7, 5: TAG_16h5>] [--display_off] [--texture] [--depth] [--projection_error <30 - 100>] [--help]

To test the tracker on a 12.5 cm wide cube that has an AprilTag of size 8 by 8 cm, and enable moving-edges, keypoints and depth as visual features run:

$ ./tutorial-mb-generic-tracker-apriltag-rs2 --texture --depth

To test the tracker on a 12.5 cm wide cube that has an AprilTag of size 8 by 8 cm, and enable only moving-edges and keypoints as visual features run:

$ ./tutorial-mb-generic-tracker-apriltag-rs2 --texture

To test the tracker on a 12.5 cm wide cube that has an AprilTag of size 8 by 8 cm, using only the moving-edges as visual features run rather:

$ ./tutorial-mb-generic-tracker-apriltag-rs2

By default, the following settings are used:

  • Default size of the cube is 0.125 meter large. To use rather a 0.20 meter large cube, run:
    $ ./tutorial-mb-generic-tracker-apriltag-rs2 --cube_size 0.20
  • The AprilTag size is 0.08 by 0.08 meters. To change the tag size to let say 0.10 meter square, use:
    $ ./tutorial-mb-generic-tracker-apriltag-rs2 --tag_size 0.10
  • The AprilTag detection of quads is set to 1.0 by default. You can change it to 2 to speed up the detection using:
    $ ./tutorial-mb-generic-tracker-apriltag-rs2 --quad_decimate 2
  • The AprilTag default number of threads used is 1. To increase the number of threads used during detection to 2, run:
    $ ./tutorial-mb-generic-tracker-apriltag-rs2 --nthreads 2
  • The AprilTag default tag family is 36h11. It is possible to use other tags specifying the tag id (0 to 5): 0=TAG_36h11, 1=TAG_36h10, 2=TAG_36ARTOOLKIT, 3=TAG_25h9, 4=TAG_25h7, 5=TAG_16h5). To use tag family 36h10, run:
    $ ./tutorial-mb-generic-tracker-apriltag-rs2 --tag_family 1
  • The display is turned on by default, but you can disable it with:
    $ ./tutorial-mb-generic-tracker-apriltag-rs2 --display_off
  • Tracking failure detection is achieved thresholding the projection error indicator provided by the tracker. The default value of this threshold is set to 40 degrees. To decrease this threshold to 30 degrees (meaning that we accept less projection error and thus trigger a new AprilTag detection more often) you may run:
    $ ./tutorial-mb-generic-tracker-apriltag-webcam --projection_error 30

Expected results

  • Tracking using only edges: Running $ ./tutorial-mb-generic-tracker-apriltag-rs2 you should be able to get results similar to these:



Cube tracking result with an Intel Realsense camera using only moving edges

  • Tracking using and hybrid scheme with edges and keypoints: Enabling keypoints as visual features will lead to an hybrid model-based tracking scheme using $ ./tutorial-mb-generic-tracker-apriltag-rs2 --texture. The corresponding results would rather look like these:



Cube tracking result with an Intel Realsense camera using moving edges and keypoints

  • Tracking using and hybrid scheme with edges, keypoints and depth: Enabling keypoints and depth as visual features will lead to an other hybrid model-based tracking scheme using $ ./tutorial-mb-generic-tracker-apriltag-rs2 --texture --depth. The corresponding results would rather look like these:



Cube tracking result with an Intel Realsense camera using moving edges, keypoints and depth

Conclusion

Playing with the above binaries you will see that:

  • using an RGB-D camera and thus depth as visual features makes the tracker more robust
  • using an USB camera and enabling keypoints shows also that the tracker is also more robust than just using moving-edges as visual features

Some recommendations:

  • if possible use rather a textured object of exploit hybrid tracker performance using keypoints as visual features
  • depending on the camera frame rate and thus the displacement of the object between two successive images adapt the range parameter to be sure its value is a little bit larger than the effective displacement in pixels (see Moving-edges settings). This could be achieved modifying in tutorial-mb-generic-tracker-apriltag-webcam.cpp or in tutorial-mb-generic-tracker-apriltag-rs2.cpp the line:
    me.setRange(12);
    setting for example 15 pixels as range (instead of 12).

Hereafter, you have the tracking result using a textured cube (we just glue images over the cube faces):


Textured cube tracking result with an Intel Realsense camera using moving edges, keypoints and depth

Next tutorial

You can follow Tutorial: Object detection and localization to learn how to initialize the tracker without user click, by learning the object to track using keypoints when the object is textured.

There is also Tutorial: Markerless generic model-based tracking using a stereo camera if you want to know how to extend the tracker to use a stereo camera. There is also this other Tutorial: Template tracking.