Markerless model-based trackers

The model-based tracker can consider moving-edges behind the lines of the model (see section Model-based edges tracker). It can also consider keypoints that are detected and tracked on each visible face of the model (see section Model-based keypoint tracker). The tracker can also handle moving-edges and keypoints in an hybrid scheme (see section Model-based hybrid tracker).

While the Model-based edges tracker is appropriate to track untextured object, the Model-based keypoint tracker is more designed to exploit textured objects with edges that are not really visible. The Model-based hybrid tracker is appropriate to track textured objects with visible edges.

In the following sections, we consider the tracking of a tea box modeled in cao format.

Model-based edges tracker

The following example that comes from tutorial-mb-edge-tracker.cpp allows to track the tea box using vpMbEdgeTracker class.

#include <visp/vpDisplayGDI.h>
#include <visp/vpDisplayOpenCV.h>
#include <visp/vpDisplayX.h>
#include <visp/vpImageIo.h>
#include <visp/vpIoTools.h>
#include <visp/vpMbEdgeTracker.h>
#include <visp/vpVideoReader.h>
int main(int argc, char** argv)
{
#if defined(VISP_HAVE_OPENCV) && (VISP_HAVE_OPENCV_VERSION >= 0x020100) || defined(VISP_HAVE_FFMPEG)
  try {
    std::string videoname = "teabox.mpg";
    for (int i=0; i<argc; i++) {
      if (std::string(argv[i]) == "--name")
        videoname = std::string(argv[i+1]);
      else if (std::string(argv[i]) == "--help") {
        std::cout << "\nUsage: " << argv[0] << " [--name <video name>] [--help]\n" << std::endl;
        return 0;
      }
    }
    std::string parentname = vpIoTools::getParent(videoname);
    std::string objectname = vpIoTools::getNameWE(videoname);
    if(! parentname.empty())
       objectname = parentname + "/" + objectname;
    std::cout << "Video name: " << videoname << std::endl;
    std::cout << "Tracker requested config files: " << objectname
              << ".[init,"
#ifdef VISP_HAVE_XML2
              << "xml,"
#endif
              << "cao or wrl]" << std::endl;
    std::cout << "Tracker optional config files: " << objectname << ".[ppm]" << std::endl;
    vpImage<unsigned char> I;
    vpCameraParameters cam;
    vpHomogeneousMatrix cMo;
    vpVideoReader g;
    g.setFileName(videoname);
    g.open(I);
#if defined(VISP_HAVE_X11)
    vpDisplayX display;
#elif defined(VISP_HAVE_GDI)
    vpDisplayGDI display;
#elif defined(VISP_HAVE_OPENCV)
    vpDisplayOpenCV display;
#else
    std::cout << "No image viewer is available..." << std::endl;
    return 0;
#endif
    display.init(I, 100, 100,"Model-based edge tracker");
    vpMbEdgeTracker tracker;
    bool usexml = false;
#ifdef VISP_HAVE_XML2
    if(vpIoTools::checkFilename(objectname + ".xml")) {
      tracker.loadConfigFile(objectname + ".xml");
      usexml = true;
    }
#endif
    if (! usexml) {
      vpMe me;
      me.setMaskSize(5);
      me.setMaskNumber(180);
      me.setRange(8);
      me.setThreshold(10000);
      me.setMu1(0.5);
      me.setMu2(0.5);
      me.setSampleStep(4);
      me.setNbTotalSample(250);
      tracker.setMovingEdge(me);
      cam.initPersProjWithoutDistortion(839, 839, 325, 243);
      tracker.setCameraParameters(cam);
      tracker.setAngleAppear( vpMath::rad(70) );
      tracker.setAngleDisappear( vpMath::rad(80) );
      tracker.setNearClippingDistance(0.1);
      tracker.setFarClippingDistance(100.0);
      tracker.setClipping(tracker.getClipping() | vpMbtPolygon::FOV_CLIPPING);
    }
    tracker.setOgreVisibilityTest(false);
    if(vpIoTools::checkFilename(objectname + ".cao"))
      tracker.loadModel(objectname + ".cao");
    else if(vpIoTools::checkFilename(objectname + ".wrl"))
      tracker.loadModel(objectname + ".wrl");
    tracker.setDisplayFeatures(true);
    tracker.initClick(I, objectname + ".init", true);
    while(! g.end()){
      g.acquire(I);
      vpDisplay::display(I);
      tracker.track(I);
      tracker.getPose(cMo);
      tracker.getCameraParameters(cam);
      tracker.display(I, cMo, cam, vpColor::red, 2);
      vpDisplay::displayFrame(I, cMo, cam, 0.025, vpColor::none, 3);
      vpDisplay::displayText(I, 10, 10, "A click to exit...", vpColor::red);
      vpDisplay::flush(I);
      if (vpDisplay::getClick(I, false))
        break;
    }
    vpDisplay::getClick(I);
#ifdef VISP_HAVE_XML2
    vpXmlParser::cleanup();
#endif
#if defined(VISP_HAVE_COIN) && (COIN_MAJOR_VERSION == 3)
    SoDB::finish();
#endif
  }
  catch(vpException e) {
    std::cout << "Catch an exception: " << e << std::endl;
  }
#else
  (void)argc;
  (void)argv;
  std::cout << "Install OpenCV or ffmpeg and rebuild ViSP to use this example." << std::endl;
#endif
}

The video below shows the result of the tea box model-based edges tracking.

Hereafter is the description of the new lines introduced in this example.

#include <visp/vpMbEdgeTracker.h>

Here we include the header of the vpMbEdgeTracker class that allows to track an object from its cad model using moving-edges. The tracker will use image I and the intrinsic camera parameters cam as input.

vpImage<unsigned char> I;

vpCameraParameters cam;

As output, it will estimate cMo, the pose of the object in the camera frame.

vpHomogeneousMatrix cMo;

Once the input image teabox.pgm is loaded in I, a window is created and initialized with image I. Then we create an instance of the tracker.

vpMbEdgeTracker tracker;

There are then two different ways to initialize the tracker.

The first one, if libxml2 is available, is to read the settings from teabox.xml input file if the file exists.
#ifdef VISP_HAVE_XML2

if(vpIoTools::checkFilename(objectname + ".xml")) {

tracker.loadConfigFile(objectname + ".xml");

usexml = true;

}

#endif

The content of the xml file is the following:
<?xml version="1.0"?>

<conf>

<ecm>

<mask>

<size>5</size>

<nb_mask>180</nb_mask>

</mask>

<range>

<tracking>8</tracking>

</range>

<contrast>

<edge_threshold>10000</edge_threshold>

<mu1>0.5</mu1>

<mu2>0.5</mu2>

</contrast>

</ecm>

<sample>

<step>4</step>

<nb_sample>250</nb_sample>

</sample>

<camera>

<u0>325.66776</u0>

<v0>243.69727</v0>

<px>839.21470</px>

<py>839.44555</py>

</camera>

<face>

<angle_appear>70</angle_appear>

<angle_disappear>80</angle_disappear>

<near_clipping>0.1</near_clipping>

<far_clipping>100</far_clipping>

<fov_clipping>1</fov_clipping>

</face>

</conf>
The second one consists in initializing the parameters directly in the source code:
vpMe me;

me.setMaskSize(5);

me.setMaskNumber(180);

me.setRange(8);

me.setThreshold(10000);

me.setMu1(0.5);

me.setMu2(0.5);

me.setSampleStep(4);

me.setNbTotalSample(250);

tracker.setMovingEdge(me);

cam.initPersProjWithoutDistortion(839, 839, 325, 243);

tracker.setCameraParameters(cam);

tracker.setAngleAppear( vpMath::rad(70) );

tracker.setAngleDisappear( vpMath::rad(80) );

tracker.setNearClippingDistance(0.1);

tracker.setFarClippingDistance(100.0);

tracker.setClipping(tracker.getClipping() | vpMbtPolygon::FOV_CLIPPING);

An important setting concerns the visibility test that is used to determine if a face is visible. Note that moving-edges are tracked only on visible faces. Two different visibility tests are implemented; with or without Ogre ray tracing. The default test is the one without Ogre. The function vpMbEdgeTracker::setOgreVisibilityTest() allow to select which test is to use.

Let us now highlight how the visibility test works. As illustrated in the following figure, the angle $\alpha$ between the normal of the face and the line going from the camera to the center of gravity of the face is used to determine if the face is visible. If we consider two parameters; the angle to determine if a face is appearing $\alpha_{appear}$ , and the angle to determine if the face is disappearing $\alpha_{disappear}$ , a face will be considered as visible if $\alpha \leq \alpha_{disappear}$ . We consider also that a new face is appearing if $\alpha \geq \alpha_{appear}$ . These two parameters can be set either in the xml file:

<conf>
  ...
  <face>
    <angle_appear>70</angle_appear> 
    <angle_disappear>80</angle_disappear> 
  </face>

or in the code:

tracker.setAngleAppear( vpMath::rad(70) );

tracker.setAngleDisappear( vpMath::rad(80) );

Note: When these two angle parameters are not set, their default values set to 89 degrees are used.

Principle of the visibility test used to determine if a face is visible.

When Ogre visibility test is disabled (we recall that this is the default behavior), the algorithm that computes the normal of the face is very simple. It makes the assumption that faces are convex and oriented counter clockwise. Here the face is considered as appearing if $\alpha < 70$ degrees, and disappearing if $\alpha > 80$ degrees. When only moving-edges are used (nor keypoints) and when the object to track is simple like a single box, we suggest as here to disable Ogre visibility test.
tracker.setOgreVisibilityTest(false);
When Ogre visibility test is enabled, the algorithm used to determine the visibility of a face is the same than previously except that once visible faces are detected thanks to their normal, we add an other test to reject faces that are partially occluded by an other one. This additional test is performed using Ogre ray-tracing capability.
tracker.setOgreVisibilityTest(true);

Additionally to the visibility test described above, it is also possible to use clipping. Firstly, the algorithm removes the faces that are not visibles, according to the visibility test used, then it will also remove the faces or parts of the faces that are out of the clipping planes. As illustrated in the following figure, different clipping planes can be enabled.

Camera field of view and clipping planes.

Let's consider two plane categories: the ones belonging to the field of view or FOV (Left, Right, Up and Down), and the Near and Far clipping planes. The FOV planes can be enabled by:

tracker.setClipping(tracker.getClipping() | vpMbtPolygon::FOV_CLIPPING);

which is equivalent to:

tracker.setClipping(vpMbtPolygon::LEFT_CLIPPING 
                  | vpMbtPolygon::RIGHT_CLIPPING
                  | vpMbtPolygon::UP_CLIPPING 
                  | vpMbtPolygon::DOWN_CLIPPING);

Of course, if the user just wants to activate Right and Up clipping, he will use:

tracker.setClipping(vpMbtPolygon::RIGHT_CLIPPING | vpMbtPolygon::UP_CLIPPING);

For the Near and Far clipping it is quite different. Indeed, thoses planes require clipping distances. Here there are two choices, either the user uses default values and activate them with:

tracker.setClipping(vpMbtPolygon::NEAR_CLIPPING | vpMbtPolygon::FAR_CLIPPING);

or the user can specify the distances in meters, which will automatically activate the clipping for thoses planes:

tracker.setNearClippingDistance(0.1);

tracker.setFarClippingDistance(100.0);

It is also possible to enable them in the xml file. This is done with the following lines:

<conf>
  ...
  <face>
    ...
    <near_clipping>0.1</near_clipping>
    <far_clipping>100.0</far_clipping>
    <fov_clipping>0</fov_clipping>
  </face>

Here for simplicity, the user just has the possibility to either activate all the FOV clipping planes or none of them (fov_clipping requires a boolean).

Note: When clipping parameters are not set in the xml file, nor in the code, clipping is not used. Usually clipping is not helpful when the oject to track is simple.

Now we are ready to load the cad model of the object. ViSP supports cad model in cao format or in vrml format. The cao format is a particular format only supported by ViSP. It doesn't require an additional 3rd party rather then vrml format that require Coin 3rd party. We load the cad model in cao format with:

if(vpIoTools::checkFilename(objectname + ".cao"))

tracker.loadModel(objectname + ".cao");

The file teabox.cao describes first the vertices of the box, then the edges that corresponds to the faces. A more complete description of this file is provided in teabox.cao example. The next figure gives the index of the vertices that are defined in teabox.cao.

To load the cad model in vrml the user has to replace the previous line by the following:

else if(vpIoTools::checkFilename(objectname + ".wrl"))

tracker.loadModel(objectname + ".wrl");

As for the cao format, teabox.wrl describes first the vertices of the box, then the edges that corresponds to the faces. A more complete description of this file is provided in teabox.wrl example.

Index of the vertices used to model the tea box in cao format.

Once the model of the object to track is loaded, with the next line the display in the image window of additional drawings in overlay such as the moving edges positions, is then enabled by:

tracker.setDisplayFeatures(true);

Now we have to initialize the tracker. With the next line we choose to use a user interaction.

tracker.initClick(I, objectname + ".init", true);

The user has to click in the image on four vertices with their 3D coordinates defined in the "teabox.init" file. The following image shows where the user has to click.

Image "teabox.ppm" used to help the user to initialize the tracker.

Matched 2D and 3D coordinates are then used to compute an initial pose used to initialize the tracker. Note also that the third optional argument "true" is used here to enable the display of an image that may help the user for the initialization. The name of this image is the same as the "*.init" file except the extension that sould be ".ppm". In our case it will be "teabox.ppm".

The content of teabox.init file that defines 3D coordinates of some points of the model used during user intialization is provided hereafter. Note that all the characters after character '#' are considered as comments.

 4                  # Number of points
 0     0      0     # Point 0
 0.165 0      0     # Point 3
 0.165 0     -0.08  # Point 2
 0.165 0.068 -0.08  # Point 5

We give now the signification of each line of this file:

line 1: Number of 3D points that should be defined in this file. At least 4 points are required. Four is the minimal number of points requested to compute a pose.
line 2: Each point is defined by its 3D coordinates. Here we define the first point with coordinates (0,0,0). In the previous figure it corresponds to vertex 0 of the tea box. This point is also the origin of the frame in which all the points are defined.
line 3: 3D coordinates of vertex 3.
line 4: 3D coordinates of vertex 2.
line 5: 3D coordinates of vertex 5.

Here the user has to click on vertex 0, 3, 2 and 5 in the window that displays image I. From the 3D coordinates defined in teabox.init and the corresponding 2D coordinates of the vertices obtained by user interaction a pose is computed that is than used to initialize the tracker.

Next, in the infinite while loop, after displaying the next image, we track the object on a new image I.

tracker.track(I);

The result of the tracking is a pose cMo that could be obtained by:

tracker.getPose(cMo);

Next lines are used first to retrieve the camera parameters used by the tracker, then to display the visible part of the cad model using red lines with 2 as thickness, and finally to display the object frame at the estimated position cMo. Each axis of the frame are 0.025 meters long. Using vpColor::none indicates that x-axis is displayed in red, y-axis in green, while z-axis in blue. The thickness of the axis is 3.

tracker.getCameraParameters(cam);

tracker.display(I, cMo, cam, vpColor::red, 2);

The last lines are here to free the memory allocated by libxml2 or Coin 3rd party libraries:

#ifdef VISP_HAVE_XML2
    vpXmlParser::cleanup();
#endif
#if defined(VISP_HAVE_COIN) && (COIN_MAJOR_VERSION == 3)
    SoDB::finish();
#endif

Model-based keypoint tracker

The following example that comes from tutorial-mb-klt-tracker.cpp allows to track the tea box using vpMbKltTracker class.

#include <visp/vpDisplayGDI.h>
#include <visp/vpDisplayOpenCV.h>
#include <visp/vpDisplayX.h>
#include <visp/vpImageIo.h>
#include <visp/vpIoTools.h>
#include <visp/vpMbKltTracker.h>
#include <visp/vpVideoReader.h>
int main(int argc, char** argv)
{
#if defined(VISP_HAVE_OPENCV) && (VISP_HAVE_OPENCV_VERSION >= 0x020100)
  try {
    std::string videoname = "teabox.mpg";
    for (int i=0; i<argc; i++) {
      if (std::string(argv[i]) == "--name")
        videoname = std::string(argv[i+1]);
      else if (std::string(argv[i]) == "--help") {
        std::cout << "\nUsage: " << argv[0] << " [--name <video name>] [--help]\n" << std::endl;
        return 0;
      }
    }
    std::string parentname = vpIoTools::getParent(videoname);
    std::string objectname = vpIoTools::getNameWE(videoname);
    if(! parentname.empty())
       objectname = parentname + "/" + objectname;
    std::cout << "Video name: " << videoname << std::endl;
    std::cout << "Tracker requested config files: " << objectname
              << ".[init,"
#ifdef VISP_HAVE_XML2
              << "xml,"
#endif
              << "cao or wrl]" << std::endl;
    std::cout << "Tracker optional config files: " << objectname << ".[ppm]" << std::endl;
    vpImage<unsigned char> I;
    vpCameraParameters cam;
    vpHomogeneousMatrix cMo;
    vpVideoReader g;
    g.setFileName(videoname);
    g.open(I);
#if defined(VISP_HAVE_X11)
    vpDisplayX display;
#elif defined(VISP_HAVE_GDI)
    vpDisplayGDI display;
#elif defined(VISP_HAVE_OPENCV)
    vpDisplayOpenCV display;
#else
    std::cout << "No image viewer is available..." << std::endl;
    return 0;
#endif
    display.init(I, 100, 100,"Model-based keypoint tracker");
    vpMbKltTracker tracker;
    bool usexml = false;
#ifdef VISP_HAVE_XML2
    if(vpIoTools::checkFilename(objectname + ".xml")) {
      tracker.loadConfigFile(objectname + ".xml");
      usexml = true;
    }
#endif
    if (! usexml) {
      tracker.setMaskBorder(5);
      vpKltOpencv klt_settings;
      klt_settings.setMaxFeatures(300);
      klt_settings.setWindowSize(5);
      klt_settings.setQuality(0.015);
      klt_settings.setMinDistance(8);
      klt_settings.setHarrisFreeParameter(0.01);
      klt_settings.setBlockSize(3);
      klt_settings.setPyramidLevels(3);
      tracker.setKltOpencv(klt_settings);
      cam.initPersProjWithoutDistortion(839, 839, 325, 243);
      tracker.setCameraParameters(cam);
      tracker.setAngleAppear( vpMath::rad(70) );
      tracker.setAngleDisappear( vpMath::rad(80) );
      tracker.setNearClippingDistance(0.1);
      tracker.setFarClippingDistance(100.0);
      tracker.setClipping(tracker.getClipping() | vpMbtPolygon::FOV_CLIPPING);
    }
    tracker.setOgreVisibilityTest(true);
    tracker.loadModel(objectname + "-triangle.cao");
    tracker.setDisplayFeatures(true);
    tracker.initClick(I, objectname + ".init", true);
    while(! g.end()){
      g.acquire(I);
      vpDisplay::display(I);
      tracker.track(I);
      tracker.getPose(cMo);
      tracker.getCameraParameters(cam);
      tracker.display(I, cMo, cam, vpColor::red, 2, true);
      vpDisplay::displayFrame(I, cMo, cam, 0.025, vpColor::none, 3);
      vpDisplay::displayText(I, 10, 10, "A click to exit...", vpColor::red);
      vpDisplay::flush(I);
      if (vpDisplay::getClick(I, false))
        break;
    }
    vpDisplay::getClick(I);
#ifdef VISP_HAVE_XML2
    vpXmlParser::cleanup();
#endif
#if defined(VISP_HAVE_COIN) && (COIN_MAJOR_VERSION == 3)
    SoDB::finish();
#endif
  }
  catch(vpException e) {
    std::cout << "Catch an exception: " << e << std::endl;
  }
#else
  (void)argc;
  (void)argv;
  std::cout << "Install OpenCV and rebuild ViSP to use this example." << std::endl;
#endif
}

The video below shows the result of the tea box model-based KLT tracking where keypoints are used as input features.

This example is very similar to the one presented in Model-based edges tracker except that here we use vpMbKltTracker class to track the tea box.

As previously, there are two different ways to initialize the tracker.

The first one, if libxml2 is available, consists in reading the settings from an xml file.
#ifdef VISP_HAVE_XML2

if(vpIoTools::checkFilename(objectname + ".xml")) {

tracker.loadConfigFile(objectname + ".xml");

usexml = true;

}

#endif

The teabox.xml file used here contains the following:
<?xml version="1.0"?>

<conf>

<klt>

<mask_border>5</mask_border>

<max_features>300</max_features>

<window_size>5</window_size>

<quality>0.015</quality>

<min_distance>8</min_distance>

<harris>0.01</harris>

<size_block>3</size_block>

<pyramid_lvl>3</pyramid_lvl>

</klt>

<camera>

<u0>325.66776</u0>

<v0>243.69727</v0>

<px>839.21470</px>

<py>839.44555</py>

</camera>

<face>

<angle_appear>70</angle_appear>

<angle_disappear>80</angle_disappear>

<near_clipping>0.1</near_clipping>

<far_clipping>100</far_clipping>

<fov_clipping>1</fov_clipping>

</face>

</conf>
The second one consists in initializing the parameters directly in the source code:
tracker.setMaskBorder(5);

vpKltOpencv klt_settings;

klt_settings.setMaxFeatures(300);

klt_settings.setWindowSize(5);

klt_settings.setQuality(0.015);

klt_settings.setMinDistance(8);

klt_settings.setHarrisFreeParameter(0.01);

klt_settings.setBlockSize(3);

klt_settings.setPyramidLevels(3);

tracker.setKltOpencv(klt_settings);

cam.initPersProjWithoutDistortion(839, 839, 325, 243);

tracker.setCameraParameters(cam);

tracker.setAngleAppear( vpMath::rad(70) );

tracker.setAngleDisappear( vpMath::rad(80) );

tracker.setNearClippingDistance(0.1);

tracker.setFarClippingDistance(100.0);

tracker.setClipping(tracker.getClipping() | vpMbtPolygon::FOV_CLIPPING);

Note also that in this example we can model the tea box with triangles:
tracker.loadModel("teabox-triangle.cao");

The file teabox-triangle.cao describes first the vertices of the box, then the triangular faces. A more complete description of this file is provided in teabox-triangle.cao example).

Note that this is the only tracker for which lines of the model are not necessary edges of the object.

Model-based hybrid tracker

The following example that comes from tutorial-mb-hybrid-tracker.cpp allows to track the tea box using vpMbEdgeKltTracker class.

#include <visp/vpDisplayGDI.h>
#include <visp/vpDisplayOpenCV.h>
#include <visp/vpDisplayX.h>
#include <visp/vpImageIo.h>
#include <visp/vpIoTools.h>
#include <visp/vpMbEdgeKltTracker.h>
#include <visp/vpVideoReader.h>
int main(int argc, char** argv)
{
#if defined(VISP_HAVE_OPENCV) && (VISP_HAVE_OPENCV_VERSION >= 0x020100)
  try {
    std::string videoname = "teabox.mpg";
    for (int i=0; i<argc; i++) {
      if (std::string(argv[i]) == "--name")
        videoname = std::string(argv[i+1]);
      else if (std::string(argv[i]) == "--help") {
        std::cout << "\nUsage: " << argv[0] << " [--name <video name>] [--help]\n" << std::endl;
        return 0;
      }
    }
    std::string parentname = vpIoTools::getParent(videoname);
    std::string objectname = vpIoTools::getNameWE(videoname);
    if(! parentname.empty())
       objectname = parentname + "/" + objectname;
    std::cout << "Video name: " << videoname << std::endl;
    std::cout << "Tracker requested config files: " << objectname
              << ".[init,"
#ifdef VISP_HAVE_XML2
              << "xml,"
#endif
              << "cao or wrl]" << std::endl;
    std::cout << "Tracker optional config files: " << objectname << ".[ppm]" << std::endl;
    vpImage<unsigned char> I;
    vpCameraParameters cam;
    vpHomogeneousMatrix cMo;
    vpVideoReader g;
    g.setFileName(videoname);
    g.open(I);
#if defined(VISP_HAVE_X11)
    vpDisplayX display(I,100,100,"Model-based hybrid tracker");;
#elif defined(VISP_HAVE_GDI)
    vpDisplayGDI display(I,100,100,"Model-based hybrid tracker");;
#elif defined(VISP_HAVE_OPENCV)
    vpDisplayOpenCV display(I,100,100,"Model-based hybrid tracker");;
#else
    std::cout << "No image viewer is available..." << std::endl;
#endif
    vpMbEdgeKltTracker tracker;
    bool usexml = false;
#ifdef VISP_HAVE_XML2
    if(vpIoTools::checkFilename(objectname + ".xml")) {
      tracker.loadConfigFile(objectname + ".xml");
      usexml = true;
    }
#endif
    if (! usexml) {
      vpMe me;
      me.setMaskSize(5);
      me.setMaskNumber(180);
      me.setRange(8);
      me.setThreshold(10000);
      me.setMu1(0.5);
      me.setMu2(0.5);
      me.setSampleStep(4);
      me.setNbTotalSample(250);
      tracker.setMovingEdge(me);
      tracker.setMaskBorder(5);
      vpKltOpencv klt_settings;
      klt_settings.setMaxFeatures(300);
      klt_settings.setWindowSize(5);
      klt_settings.setQuality(0.015);
      klt_settings.setMinDistance(8);
      klt_settings.setHarrisFreeParameter(0.01);
      klt_settings.setBlockSize(3);
      klt_settings.setPyramidLevels(3);
      tracker.setKltOpencv(klt_settings);
      cam.initPersProjWithoutDistortion(839, 839, 325, 243);
      tracker.setCameraParameters(cam);
      tracker.setAngleAppear( vpMath::rad(70) );
      tracker.setAngleDisappear( vpMath::rad(80) );
      tracker.setNearClippingDistance(0.1);
      tracker.setFarClippingDistance(100.0);
      tracker.setClipping(tracker.getClipping() | vpMbtPolygon::FOV_CLIPPING);
    }
    tracker.setOgreVisibilityTest(true);
    tracker.loadModel(objectname + ".cao");
    tracker.setDisplayFeatures(true);
    tracker.initClick(I, objectname + ".init", true);
    while(! g.end()){
      g.acquire(I);
      vpDisplay::display(I);
      tracker.track(I);
      tracker.getPose(cMo);
      tracker.getCameraParameters(cam);
      tracker.display(I, cMo, cam, vpColor::red, 2, true);
      vpDisplay::displayFrame(I, cMo, cam, 0.025, vpColor::none, 3);
      vpDisplay::displayText(I, 10, 10, "A click to exit...", vpColor::red);
      vpDisplay::flush(I);
      if (vpDisplay::getClick(I, false))
        break;
    }
    vpDisplay::getClick(I);
#ifdef VISP_HAVE_XML2
    vpXmlParser::cleanup();
#endif
#if defined(VISP_HAVE_COIN) && (COIN_MAJOR_VERSION == 3)
    SoDB::finish();
#endif
  }
  catch(vpException e) {
    std::cout << "Catch an exception: " << e << std::endl;
  }
#else
  (void)argc;
  (void)argv;
  std::cout << "Install OpenCV and rebuild ViSP to use this example." << std::endl;
#endif
}

The video below shows the result of the tea box model-based hybrid tracking where moving-edges and keypoints are used as input features.

The source code is very similar to the one described in Model-based edges tracker and Model-based keypoint tracker. It doesn't require additional line by line explanation. We provide just hereafter the content of the teabox.xml file:

<?xml version="1.0"?>
<conf>
  <ecm>
    <mask>
      <size>5</size>
      <nb_mask>180</nb_mask>
    </mask>
    <range>
      <tracking>8</tracking>
    </range>
    <contrast>
      <edge_threshold>10000</edge_threshold>
      <mu1>0.5</mu1>
      <mu2>0.5</mu2>
    </contrast>
  </ecm>
  <sample>
    <step>4</step>
    <nb_sample>250</nb_sample>
  </sample>
  <klt>
    <mask_border>5</mask_border> 
    <max_features>300</max_features> 
    <window_size>5</window_size> 
    <quality>0.015</quality> 
    <min_distance>8</min_distance> 
    <harris>0.01</harris>
    <size_block>3</size_block> 
    <pyramid_lvl>3</pyramid_lvl> 
  </klt>
  <camera>
    <u0>325.66776</u0> 
    <v0>243.69727</v0> 
    <px>839.21470</px> 
    <py>839.44555</py> 
  </camera>
  <face>
    <angle_appear>70</angle_appear> 
    <angle_disappear>80</angle_disappear> 
    <near_clipping>0.1</near_clipping>
    <far_clipping>100</far_clipping>
    <fov_clipping>1</fov_clipping>
  </face>
</conf>

How to model the objects to track

ViSP supports two different ways to describe CAD models, either in cao or in vrml format.

cao format is specific to ViSP. It allows to describe the CAD model of an object using a text file with extension .cao.
vrml format is supported only if Coin 3rd party is installed. This format allows to describe the CAD model of an object using a text file with extension .wrl.

teabox.cao example

The content of the file teabox.cao used in the Model-based edges tracker and Model-based hybrid tracker examples is given here:

 V1
 # 3D Points
 8                  # Number of points
 0     0      0     # Point 0: X Y Z
 0     0     -0.08
 0.165 0     -0.08
 0.165 0      0
 0.165 0.068  0
 0.165 0.068 -0.08
 0     0.068 -0.08
 0     0.068  0     # Point 7
 # 3D Lines
 0                  # Number of lines
 # Faces from 3D lines
 0                  # Number of faces
 # Faces from 3D points
 6                  # Number of faces
 4 0 1 2 3          # Face 0: [number of points] [index of the 3D points]...
 4 1 6 5 2
 4 4 5 6 7
 4 0 3 4 7
 4 5 4 3 2
 4 0 7 6 1          # Face 5
 # 3D cylinders
 0                  # Number of cylinders
 # 3D circles
 0                  # Number of circles

This file describes the model of the tea box corresponding to the next image:

Index of the vertices used to model the tea box in cao format.

We make the choice to describe the faces of the box from the 3D points that correspond to the vertices. We provide now a line by line description of the file. Notice that the characters after the '#' are considered as comments.

line 1: Header of the .cao file
line 3: The model is defined by 8 3D points. Here the 8 points correspond to the 8 vertices of the tea box presented in the previous figure. Thus, next 8 lines define the 3D points coordinates.
line 4: 3D point with coordinate (0,0,0) corresponding to vertex 0 of the tea box. This point is also the origin of the frame in which all the 3D points are defined.
line 5: 3D point with coordinate (0,0,-0.08) corresponding to vertex 1.
line 6 to 11: The other 3D points corresponding to vertices 2 to 7 respectively.
line 13: Number of 3D lines defined from two 3D points. It is possible to introduce 3D lines and then use these lines to define faces from these 3D lines. This is particularly useful to define faces from non-closed polygons. For instance, it can be used to specify the tracking of only 3 edges of a rectangle. Notice also that a 3D line that doesn't belong to a face is always visible and consequently always tracked.
line 15: Number of faces defined from 3D lines. In our teabox example we decide to define all the faces from 3D points, that is why this value is set to 0.
line 17: The number of faces defined by a set of 3D points. Here our teabox has 6 faces. Thus, next 6 lines describe each face from the 3D points defined previously line 4 to 11. Notice here that all the faces defined from 3D points corresponds to closed polygons.
line 18: First face defined by 4 3D points, respectively vertices 0,1,2,3. The orientation of the face is counter clockwise by going from vertex 0 to vertex 1, then 2 and 3. This fixes the orientation of the normal of the face going outside the object.
line 19: Second face also defined by 4 points, respectively vertices 1,6,5,2 to have a counter clockwise orientation.
line 20 to 23: The four other faces of the box.
line 25: Number of 3D cylinders describing the model. Since we model a simple box, the number of cylinders is 0.
line 27: Number of 3D circles describing the model. For the same reason, the number of circles is 0.

teabox-triangle.cao example

The content of the file teabox-triangle.cao used in the Model-based keypoint tracker example is given here:

 V1
 # 3D Points
 8                  # Number of points
 0     0      0     # Point 0: X Y Z
 0     0     -0.08
 0.165 0     -0.08
 0.165 0      0
 0.165 0.068  0
 0.165 0.068 -0.08
 0     0.068 -0.08
 0     0.068  0     # Point 7
 # 3D Lines
 0                  # Number of lines 
 # Faces from 3D lines
 0                  # Number of faces
 # Faces from 3D points
 12                 # Number of faces
 3 0 1 2            # Face 0: [number of points] [index of the 3D points]...
 3 0 2 3
 3 0 3 7
 3 3 4 7
 3 4 5 6 
 3 4 6 7
 3 1 6 5 
 3 1 5 2
 3 5 3 2
 3 5 4 3
 3 7 6 1
 3 7 1 0            # Face 11
 # 3D cylinders
 0                  # Number of cylinders
 # 3D circles
 0                  # Number of circles

This file describes the model of the tea box corresponding to the next image:

Index of the vertices used to model the tea box in cao format with triangles.

Until line 15, the content of this file is similar to the one described in teabox.cao example. Line 17 we specify that the model contains 12 faces. Each face is then described as a triangle.

teabox.wrl example

The content of the teabox.wrl file used in the Model-based edges tracker is given hereafter. This content is to make into relation with teabox.cao described in teabox.cao example.

 #VRML V2.0 utf8
 
 DEF fst_0 Group {
 children [
 
 # Object "teabox"
 Shape {
 
 geometry DEF cube IndexedFaceSet {
 
 coord Coordinate { 
 point [
 0     0      0   ,
 0     0     -0.08,
 0.165 0     -0.08,
 0.165 0      0   ,
 0.165 0.068  0   ,
 0.165 0.068 -0.08,
 0     0.068 -0.08,
 0     0.068  0    ]
 }
 
 coordIndex [
  0,1,2,3,-1,
  1,6,5,2,-1,
  4,5,6,7,-1,
  0,3,4,7,-1,
  5,4,3,2,-1,
  0,7,6,1,-1]}
 }
 
 ]
 }

This file describes the model of the tea box corresponding to the next image:

Index of the vertices used to model the tea box in vrml format.

We provide now a line by line description of the file where the faces of the box are defined from the vertices:

line 1 to 10: Header of the .wrl file.
line 13 to 20: 3D coordinates of the 8 tea box vertices.
line 34 to 29: Each line describe a face. In this example, a face is defined by 4 vertices. For example, the first face join vertices 0,1,2,3. The orientation of the face is counter clockwise by going from vertex 0 to vertex 1, then 2 and 3. This fixes the orientation of the normal of the face going outside the object.

Advanced: How to manipulate the model

The following code shows how to access to the CAD model

to check if a face is visible,
to get the name of the face (only with models in .cao format for the moment)
to check if the level of detail is enable/disable (only with models in .cao format for the moment)
to access to the coordinates of the 3D points used to model a face
from the pose cMo estimated by the tracker to compute the coordinates of the 3D points in the image

vpMbHiddenFaces<vpMbtPolygon> &faces = tracker.getFaces();
std::cout << "Number of faces: " << faces.size() << std::endl;
for (unsigned int i=0; i < faces.size(); i++) {
  std::vector<vpMbtPolygon*> &poly = faces.getPolygon();
  std::cout << "face " << i << " with index: " << poly[i]->getIndex()
      << (poly[i]->getName().empty() ? "" : (" with name: " + poly[i]->getName()))
      << " is " << (poly[i]->isVisible() ? "visible" : "not visible")
      << " and has " << poly[i]->getNbPoint() << " points" 
      << " and LOD is" << (poly[i]->useLod ? "enabled" : "disabled") << std::endl;
      
  for (unsigned j=0; j<poly[i]->getNbPoint(); j++) {
    vpPoint P = poly[i]->getPoint(j);
    P.project(cMo);
    std::cout << " P obj " << j << ": " << P.get_oX() << " " << P.get_oY() << " " << P.get_oZ() << std::endl;
    std::cout << " P cam" << j << ": " << P.get_X() << " " << P.get_Y() << " " << P.get_Z() << std::endl;
    vpImagePoint iP;
    vpMeterPixelConversion::convertPoint(cam, P.get_x(), P.get_y(), iP);
    std::cout << " iP " << j << ": " << iP.get_u() << " " << iP.get_v() << std::endl;
  }
}

Advanced: Level of detail (LOD)

The level of detail (LOD) consists in introducing additional constraints to the visibility check to determine if the features of a face have to be tracked or not. Two parameters are used

the line length (in pixel)
the area of the face (in pixel²), that could be closed or not (you can define an open face by adding all the segments without the last one which closes the face)

The tracker allows to enable/disable the level of detail concept using vpMbTracker::setLod() function. This example permits to set LOD settings to all elements :

tracker.setLod(true);
tracker.setMinLineLengthThresh(40.0);
tracker.setMinPolygonAreaThresh(500.0);

This example permits to set LOD settings to specific elements denominated by his name.

tracker.setLod(false);
tracker.setLod(true, "Left line");
tracker.setLod(true, "Front face");
tracker.setMinLineLengthThresh(35.0, "Left line");
tracker.setMinPolygonAreaThresh(120.0, "Front face");

Furthermore, to set a name to a face see How to set a name to a face.

Advanced: CAD model in cao format

How to model faces from lines

The first thing to do is to declare the differents points. Then you define each segment of the face with the index of the start point and with the index of the end point. Finally, you define the face with the index of the segments which constitute the face.

Note: The way you declare the face segments (clockwise or counter clockwise) will determine the direction of the normal of the face and so will influe on the visibility of the face.

V1
# Left wing model
6                               # Number of points
# 3D points
-4     -3.8     0.7
-6     -8.8     0.2
-12   -21.7    -1.2
-9    -21.7    -1.2
 0.8   -8.8     0.2
 4.6   -3.8     0.7
# 3D lines
6                               # Number of lines
0 1                             # line 0
1 2
2 3
3 4
4 5
5 0                             # line 5
# Faces from 3D lines
1                               # Number of faces defined by lines
6 0 1 2 3 4 5                   # face 0: [number of lines] [index of the lines]...
# Faces from 3D points
0
# 3D cylinders
0
# 3D circles
0

How to model cylinders

The first thing to do is to declare the two points defining the cylinder axis of revolution. Then you declare the cylinder with the index of the points that define the cylinder axis of revolution and with the cylinder radius.

Note: For the level of detail, in a case of a cylinder, this is taking into account by using the length of the axis of revolution to determine the visibility.

Example of a cylinder.

V1
# Cylinder model
2                 # Number of points
# 3D points
16.9 0 0.5        # point 0
-20  0 0.5        # point 1
# 3D lines
0
# Faces from 3D lines
0
# Faces from 3D points
0
# 3D cylinders
1                 # Number of cylinders
0 1 2.4           # cylinder 0: [1st point on revolution axis] [2nd point on revolution axis] [radius]
# 3D circles
0

How to model circles

The first thing to do is to declare three points: one point for the center of the circle and two points on the circle plane (i.e. not necessary located on the perimeter of the circle but on the plane of the circle). Then you declare your circle with the radius and with index of the three points.

Note: The way you declare the two points on the circle plane (clockwise or counter clockwise) will determine the direction of the normal of the circle and so will influe on the visibility of the circle. For the level of detail, in a case of a circle, this is taking into account by using the area of the bounding box of the circle to determine the visibility.

Example of a circle.

V1
# Circle model
3                    # Number of points
# 3D points
-3.4    14.6    1.1  # point 0
-3.4    15.4    1.1
-3.4    14.6    1.8  # point 2
# 3D lines
0
# Faces from 3D lines
0
# Faces from 3D points
0
# 3D cylinders
0
# 3D circles
1                    # Number of circles
0.8 0 2 1            # circle 0: [radius] [circle center] [1st point on circle plane] [2nd point on circle plane]

How to create an hierarchical model

It could be useful to define a complex model instead of using one big model file with all the declaration with different sub-models, each one representing a specific part of the complex model in a specific model file. To create an hierarchical model, the first step is to define all the elementary parts and then regroup them.

Example of a possible hierarchical modelling of a plane.

For example, if we want to have a model of a plane, we could represent as elementary parts the left and right wings, the tailplane (which is constituted of some other parts) and a cylinder for the plane fuselage. The following lines represent the top model of the plane.

V1
# header
# load the different parts of the plane
load("wings.cao")       # load the left and right wings
load("tailplane.cao")
# 3D points
2                       # Number of points
16.9 0 0.5
-20  0 0.5
# 3D lines
0
# Faces from 3D lines
0
# Faces from 3D points
0
# 3D cylinders
1                       # Number of cylinders
0 1 2.4                 # define the plane fuselage as a cylinder
# 3D circles
0

Note: The path to include another model can be expressed as an absolute or a relative path (relative to the file which includes the model).

How to set a name to a face

To exploit the name of a face in the code, see Advanced: Level of detail (LOD).

It could be useful to give a name for a face in a model in order to easily modify his LOD parameters for example, or just for debuging purpose. This is done directly in the model file :

V1
# header
# load the different parts of the plane
load("wings.cao")
load("tailplane.cao")
# 3D points
5                                    # Number of points
16.9    0   0.5
-20     0   0.5
-3.4    14.6    1.1
-3.4    15.4    1.1
-3.4    14.6    1.8
# 3D lines
0
# Faces from 3D lines
0
# Faces from 3D points
0
# 3D cylinders
1                                    # Number of cylinders
0 1 2.4     name=plane_fuselage
# 3D circles
1                                    # Number of circles
0.8 2 4 3   name="right reactor"

Note: If the name contains space characters, it must be surrounded by quotes. You can give a name to all the elements excepts for points.

How to tune the level of detail

As explained in section Advanced: Level of detail (LOD) the parameters of the lod can be set in the source code. They can also be set directly in the configuration file or in the CAD model in cao format.

The following lines show the content of the configuration file :

<?xml version="1.0"?>
<conf>
  <lod>
    <use_lod>1</use_lod>
    <min_line_length_threshold>40</min_line_length_threshold>
    <min_polygon_area_threshold>150</min_polygon_area_threshold>
  </lod>
</conf>

In CAD model file, you can specify the LOD settings to the desired elements :

V1
# header
# load the different parts of the plane
load("wings.cao")
load("tailplane.cao")
# 3D points
5               # number of points
16.9    0   0.5
-20     0   0.5
-3.4    14.6    1.1
-3.4    15.4    1.1
-3.4    14.6    1.8
# 3D lines
0
# Faces from 3D lines
0
# Faces from 3D points
0
# 3D cylinders
1                               # Number of cylinders
0 1 2.4 name=plane_fuselage useLod=true minLineLengthThreshold=100.0
# 3D circles
1                               # Number of circles
0.8 2 4 3   name="right reactor" useLod=true minPolygonAreaThreshold=40.0

Note: The order you call the methods to load the configuration file and to load the CAD model in the code will modify the result of the LOD parameters. Basically, the LOD settings expressed in configuration file will have effect on all the elements in the CAD model while the LOD settings expressed in CAD model will be specific to an element. The natural order would be to load first the configuration file and after the CAD model.

You are now ready to see the next Tutorial: Template tracking.

Table of Contents

Markerless model-based trackers

Model-based edges tracker

Model-based keypoint tracker

Model-based hybrid tracker

How to model the objects to track

teabox.cao example

teabox-triangle.cao example

teabox.wrl example

Advanced: How to manipulate the model

Advanced: Level of detail (LOD)

Advanced: CAD model in cao format

How to model faces from lines

How to model cylinders

How to model circles

How to create an hierarchical model

How to set a name to a face

How to tune the level of detail