0% found this document useful (0 votes)

108 views20 pages

Introduction To Programming With OpenCV

This document provides an introduction to programming with OpenCV. It describes OpenCV's features and components, data structures like images and matrices, and how to perform common tasks like loading images, drawing on images, and working with video. It includes sample code for a basic "hello world" OpenCV program.

Uploaded by

Cao Thi Nhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

108 views20 pages

Introduction To Programming With OpenCV

Uploaded by

Cao Thi Nhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 20

]lIntroduction to programming with OpenCV

Gady Agam Department of Computer Science Illinois Institute of Technology January 27, 2006

Abstract:
The purpose of this document is to get you started quickly with OpenCV without having to go through lengthy reference manuals. Once you understand these basics you will be able to consult the OpenCV manuals on a need basis.

Contents

Introduction o Description of OpenCV o Resources o OpenCV naming conventions o Compilation instructions o Example C Program

GUI commands o Window management o Input handling

Basic OpenCV data structures o Image data structure o Matrices and vectors o Other data structures

Working with images o Allocating and releasing images o Reading and writing images
1

o o o

Accessing image elements Image conversion Drawing commands

Working with matrices o Allocating and releasing matrices o Accessing matrix elements o Matrix/vector operations

Working with video sequences o Capturing a frame from a video sequence o Getting/setting frame information o Saving a video file

Introduction
Description of OpenCV

General description o Open source computer vision library in C/C++. o Optimized and intended for real-time applications. o OS/hardware/window-manager independent. o Generic image/video loading, saving, and acquisition. o Both low and high level API. o Provides interface to Intel's Integrated Performance Primitives (IPP) with processor specific optimization (Intel processors). Features: o Image data manipulation (allocation, release, copying, setting, conversion). o Image and video I/O (file and camera based input, image/video file output). o Matrix and vector manipulation and linear algebra routines (products, solvers, eigenvalues, SVD). o Various dynamic data structures (lists, queues, sets, trees, graphs). o Basic image processing (filtering, edge detection, corner detection, sampling and interpolation, color conversion, morphological operations, histograms, image pyramids). o Structural analysis (connected components, contour processing, distance transform, various moments, template matching, Hough transform, polygonal approximation, line fitting, ellipse fitting, Delaunay triangulation). o Camera calibration (finding and tracking calibration patterns, calibration, fundamental matrix estimation, homography estimation, stereo correspondence). o Motion analysis (optical flow, motion segmentation, tracking). o Object recognition (eigen-methods, HMM). o Basic GUI (display image/video, keyboard and mouse handling, scroll-bars). o Image labeling (line, conic, polygon, text drawing)
2

OpenCV modules: o cv - Main OpenCV functions. o cvaux - Auxiliary (experimental) OpenCV functions. o cxcore - Data structures and linear algebra support. o highgui - GUI functions.

Resources

Reference manuals:
o <opencv-root>/docs/index.htm

Web resources: o Official webpage: http://www.intel.com/technology/computing/opencv/ o Software download: http://sourceforge.net/projects/opencvlibrary/ Books: o Open Source Computer Vision Library by Gary R. Bradski, Vadim Pisarevsky, and Jean-Yves Bouguet, Springer, 1st ed. (June, 2006). Sample programs for video processing (in <opencv-root>/samples/c/): o color tracking: camshiftdemo o point tracking: lkdemo o motion segmentation: motempl o edge detection: laplace Sample programs for image processing (in <opencv-root>/samples/c/): o edge detection: edge o segmentation: pyramid_segmentation o morphology: morphology o histogram: demhist o distance transform: distrans o ellipse fitting: fitellipse

OpenCV naming conventions

Function naming conventions:

cvActionTargetMod(...) Action = the core functionality (e.g. set, create) Target = the target image area (e.g. contour, polygon) Mod = optional modifiers (e.g. argument type)

Matrix data types:

CV_<bit_depth>(S|U|F)C<number_of_channels> S = Signed integer U = Unsigned integer F = Float E.g.: CV_8UC1 means an 8-bit unsigned single-channel matrix, CV_32FC2 means a 32-bit float matrix with two channels.

Image data types:

IPL_DEPTH_<bit_depth>(S|U|F) E.g.: IPL_DEPTH_8U means an 8-bit unsigned image. IPL_DEPTH_32F means a 32-bit float image.

Header files:
#include #include #include #include <cv.h> <cvaux.h> <highgui.h> <cxcore.h>

// unnecessary - included in cv.h

Compilation instructions

Linux:
g++ hello-world.cpp -o hello-world \ -I /usr/local/include/opencv -L /usr/local/lib -lm -lcv -lhighgui -lcvaux \

Windows:
In the project preferences set the path to the OpenCV header files and the path to the OpenCV library files.

Example C Program
//////////////////////////////////////////////////////////////////////// // // hello-world.cpp // // This is a simple, introductory OpenCV program. The program reads an // image from a file, inverts it, and displays the result. // //////////////////////////////////////////////////////////////////////// #include <stdlib.h> #include <stdio.h> #include <math.h> #include <cv.h> #include <highgui.h> int main(int argc, char *argv[]) { IplImage* img = 0; int height,width,step,channels; uchar *data; int i,j,k; if(argc<2){ printf("Usage: main <image-file-name>\n\7"); exit(0);

} // load an image img=cvLoadImage(argv[1]); if(!img){ printf("Could not load image file: %s\n",argv[1]); exit(0); } // get the image data height = img->height; width = img->width; step = img->widthStep; channels = img->nChannels; data = (uchar *)img->imageData; printf("Processing a %dx%d image with %d channels\n",height,width,channels); // create a window cvNamedWindow("mainWin", CV_WINDOW_AUTOSIZE); cvMoveWindow("mainWin", 100, 100); // invert the image for(i=0;i<height;i++) for(j=0;j<width;j++) for(k=0;k<channels;k++) data[i*step+j*channels+k]=255-data[i*step+j*channels+k]; // show the image cvShowImage("mainWin", img ); // wait for a key cvWaitKey(0); // release the image cvReleaseImage(&img ); return 0; }

GUI commands
Window management

Create and position a window:

cvNamedWindow("win1", CV_WINDOW_AUTOSIZE); cvMoveWindow("win1", 100, 100); // offset from the UL corner of the screen

Load an image:
IplImage* img=0; img=cvLoadImage(fileName); if(!img) printf("Could not load image file: %s\n",fileName);

Display an image:
cvShowImage("win1",img);

Can display a color or grayscale byte/float-image. A byte image is assumed to have values in the range . A float image is assumed to have values in the range assumed to have data in BGR order.

. A color image is

Close a window:
cvDestroyWindow("win1");

Resize a window:
cvResizeWindow("win1",100,100); // new width/heigh in pixels

Input handling

Handle mouse events: o Define a mouse handler:

void mouseHandler(int event, int x, int y, int flags, void* param) { switch(event){ case CV_EVENT_LBUTTONDOWN: if(flags & CV_EVENT_FLAG_CTRLKEY) printf("Left button down with CTRL pressed\n"); break; case CV_EVENT_LBUTTONUP: printf("Left button up\n"); break; } } x,y: pixel coordinates with respect to the UL corner

event: CV_EVENT_LBUTTONDOWN, CV_EVENT_RBUTTONDOWN, CV_EVENT_MBUTTONDOWN, CV_EVENT_LBUTTONUP, CV_EVENT_RBUTTONUP, CV_EVENT_MBUTTONUP, CV_EVENT_LBUTTONDBLCLK, CV_EVENT_RBUTTONDBLCLK, CV_EVENT_MBUTTONDBLCLK, CV_EVENT_MOUSEMOVE: flags: CV_EVENT_FLAG_CTRLKEY, CV_EVENT_FLAG_SHIFTKEY, CV_EVENT_FLAG_ALTKEY, CV_EVENT_FLAG_LBUTTON, CV_EVENT_FLAG_RBUTTON, CV_EVENT_FLAG_MBUTTON o

Register the handler:

mouseParam=5; cvSetMouseCallback("win1",mouseHandler,&mouseParam);

Handle keyboard events:

o o

The keyboard does not have an event handler. Get keyboard input without blocking:
int key; key=cvWaitKey(10); // wait 10ms for input

Get keyboard input with blocking:

int key; key=cvWaitKey(0); // wait indefinitely for input

The main keyboard event loop:

while(1){ key=cvWaitKey(10); if(key==27) break; switch(key){ case 'h': ... break; case 'i': ... break; } }

Handle trackbar events: o Define a trackbar handler:

void trackbarHandler(int pos) { printf("Trackbar position: %d\n",pos); } o

Register the handler:

int trackbarVal=25; int maxVal=100; cvCreateTrackbar("bar1", "win1", &trackbarVal ,maxVal , trackbarHandler);

Get the current trackbar position:

int pos = cvGetTrackbarPos("bar1","win1");

Set the trackbar position:

cvSetTrackbarPos("bar1", "win1", 25);

Basic OpenCV data structures

Image data structure

IPL image:
IplImage |-- int nChannels; // |-- int depth; // | // | // | // | // |-- int width; // |-- int height; // |-- char* imageData; // | // |-- int dataOrder; // | // | // |-- int origin; // | // |-- int widthStep; // |-- int imageSize; // |-- struct _IplROI *roi;// | // |-- char *imageDataOrigin; | | |-- int align; // | // |-- char colorModel[4]; // Number of color channels (1,2,3,4) Pixel depth in bits: IPL_DEPTH_8U, IPL_DEPTH_8S, IPL_DEPTH_16U,IPL_DEPTH_16S, IPL_DEPTH_32S,IPL_DEPTH_32F, IPL_DEPTH_64F image width in pixels image height in pixels pointer to aligned image data Note that color images are stored in BGR order 0 - interleaved color channels, 1 - separate color channels cvCreateImage can only create interleaved images 0 - top-left origin, 1 - bottom-left origin (Windows bitmaps style) size of aligned image row in bytes image data size in bytes = height*widthStep image ROI. when not NULL specifies image region to be processed. // pointer to the unaligned origin of image data // (needed for correct image deallocation) Alignment of image rows: 4 or 8 byte alignment OpenCV ignores this and uses widthStep instead Color model - ignored by OpenCV

Matrices and vectors

Matrices:
CvMat |-flags |-|-|-|-int type; // 2D array // elements type (uchar,short,int,float,double) and // full row length in bytes // dimensions // alternative dimensions reference // // // // // data data data data data pointer pointer pointer pointer pointer for for for for for an unsigned char matrix a short matrix an integer matrix a float matrix a double matrix

// N-dimensional array // elements type (uchar,short,int,float,double) and // number of array dimensions // data pointer for an unsigned char matrix // data pointer for a short matrix

| | | | |--

// data pointer for an integer matrix // data pointer for a float matrix // data pointer for a double matrix // information for each dimension // number of elements in a given dimension // distance between elements in a given dimension

CvSparseMat // SPARSE N-dimensional array

Generic arrays:
CvArr* // // // // // Used only as a function parameter to specify that the function accepts arrays of more than a single type, such as: IplImage*, CvMat* or even CvSeq*. The particular array type is determined at runtime by analyzing the first 4 bytes of the header of the actual array.

Scalars:
CvScalar |-- double val[4]; //4D vector

Initializer function:
CvScalar s = cvScalar(double val0, double val1=0, double val2=0, double val3=0);

Example:
CvScalar s = cvScalar(20.0); s.val[0]=10.0;

Note that the initializer function has the same name as the data structure only starting with a lower case character. It is not a C++ constructor.

Other data structures

Points:
CvPoint p = cvPoint(int x, int y); CvPoint2D32f p = cvPoint2D32f(float x, float y); CvPoint3D32f p = cvPoint3D32f(float x, float y, float z); E.g.: p.x=5.0; p.y=5.0;

Rectangular dimensions:
CvSize CvSize2D32f r = cvSize(int width, int height); r = cvSize2D32f(float width, float height);

Rectangular dimensions with offset:

CvRect r = cvRect(int x, int y, int width, int height);

Working with images

Allocating and releasing images

Allocate an image:
IplImage* cvCreateImage(CvSize size, int depth, int channels); size: cvSize(width,height);

depth: pixel depth in bits: IPL_DEPTH_8U, IPL_DEPTH_8S, IPL_DEPTH_16U, IPL_DEPTH_16S, IPL_DEPTH_32S, IPL_DEPTH_32F, IPL_DEPTH_64F channels: Number of channels per pixel. Can be 1, 2, 3 or 4. The channels are interleaved. The usual data layout of a color image is b0 g0 r0 b1 g1 r1 ...

Examples:
// Allocate a 1-channel byte image IplImage* img1=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,1); // Allocate a 3-channel float image IplImage* img2=cvCreateImage(cvSize(640,480),IPL_DEPTH_32F,3);

Release an image:
IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,1); cvReleaseImage(&img);

Clone an image:
IplImage* img1=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,1); IplImage* img2; img2=cvCloneImage(img1);

Set/get the region of interest:

void cvSetImageROI(IplImage* image, CvRect rect); void cvResetImageROI(IplImage* image); vRect cvGetImageROI(const IplImage* image); The majority of OpenCV functions support ROI.

Set/get the channel of interest:

void cvSetImageCOI(IplImage* image, int coi); // 0=all int cvGetImageCOI(const IplImage* image);

The majority of OpenCV functions do NOT support COI.

Reading and writing images

Reading an image from a file:

IplImage* img=0; img=cvLoadImage(fileName); if(!img) printf("Could not load image file: %s\n",fileName); Supported image formats: BMP, DIB, JPEG, JPG, JPE, PNG, PBM, PGM, PPM, SR, RAS, TIFF, TIF

By default, the loaded image is forced to be a 3-channel color image. This default can be modified by using:
img=cvLoadImage(fileName,flag); flag: >0 the loaded image is forced to be a 3-channel color image =0 the loaded image is forced to be a 1 channel grayscale image <0 the loaded image is loaded as is (with number of channels in the file).

Writing an image to a file:

if(!cvSaveImage(outFileName,img)) printf("Could not save: %s\n",outFileName);

The output file format is determined based on the file name extension.

Accessing image elements

Assume that you need to access the row index is in the range

-th channel of the pixel at the . The column index

-row and

-th column. The .

is in the range

The channel index is in the range . Indirect access: (General, but inefficient, access to any type image) o For a single-channel byte image:
IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,1); CvScalar s; s=cvGet2D(img,i,j); // get the (i,j) pixel value printf("intensity=%f\n",s.val[0]); s.val[0]=111; cvSet2D(img,i,j,s); // set the (i,j) pixel value o

For a multi-channel float (or byte) image:

IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_32F,3); CvScalar s; s=cvGet2D(img,i,j); // get the (i,j) pixel value printf("B=%f, G=%f, R=%f\n",s.val[0],s.val[1],s.val[2]); s.val[0]=111; s.val[1]=111; s.val[2]=111; cvSet2D(img,i,j,s); // set the (i,j) pixel value

Direct access: (Efficient access, but error prone) o For a single-channel byte image:
IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,1); ((uchar *)(img->imageData + i*img->widthStep))[j]=111; o

For a multi-channel byte image:

IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,3); ((uchar *)(img->imageData + i*img->widthStep))[j*img->nChannels + 0]=111; // B ((uchar *)(img->imageData + i*img->widthStep))[j*img->nChannels + 1]=112; // G ((uchar *)(img->imageData + i*img->widthStep))[j*img->nChannels + 2]=113; // R

For a multi-channel float image:

IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_32F,3); ((float *)(img->imageData + i*img->widthStep))[j*img->nChannels + 0]=111; // B ((float *)(img->imageData + i*img->widthStep))[j*img->nChannels + 1]=112; // G ((float *)(img->imageData + i*img->widthStep))[j*img->nChannels + 2]=113; // R

Direct access using a pointer: (Simplified and efficient access under limiting assumptions) o For a single-channel byte image:
IplImage* img int height int width int step uchar* data data[i*step+j] o = = = = = = cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,1); img->height; img->width; img->widthStep/sizeof(uchar); (uchar *)img->imageData; 111;

For a multi-channel byte image:

IplImage* img = cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,3); int height = img->height; int width = img->width; int step = img->widthStep/sizeof(uchar); int channels = img->nChannels; uchar* data = (uchar *)img->imageData; data[i*step+j*channels+k] = 111;

For a multi-channel float image (assuming a 4-byte alignment):

IplImage* img = cvCreateImage(cvSize(640,480),IPL_DEPTH_32F,3); int height = img->height; int width = img->width; int step = img->widthStep/sizeof(float); int channels = img->nChannels; float * data = (float *)img->imageData; data[i*step+j*channels+k] = 111;

Direct access using a c++ wrapper: (Simple and efficient access) o Define a c++ wrapper for single-channel byte images, multi-channel byte images, and multichannel float images:
template<class T> class Image { private: IplImage* imgp; public: Image(IplImage* img=0) {imgp=img;} ~Image(){imgp=0;} void operator=(IplImage* img) {imgp=img;} inline T* operator[](const int rowIndx) { return ((T *)(imgp->imageData + rowIndx*imgp->widthStep));} }; typedef struct{ unsigned char b,g,r; } RgbPixel; typedef struct{ float b,g,r; } RgbPixelFloat; typedef typedef typedef typedef o Image<RgbPixel> Image<RgbPixelFloat> Image<unsigned char> Image<float> RgbImage; RgbImageFloat; BwImage; BwImageFloat;

For a single-channel byte image:

IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,1); BwImage imgA(img); imgA[i][j] = 111;

For a multi-channel byte image:

IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_8U,3); RgbImage imgA(img); imgA[i][j].b = 111; imgA[i][j].g = 111; imgA[i][j].r = 111;

For a multi-channel float image:

IplImage* img=cvCreateImage(cvSize(640,480),IPL_DEPTH_32F,3); RgbImageFloat imgA(img); imgA[i][j].b = 111; imgA[i][j].g = 111; imgA[i][j].r = 111;

Image conversion

Convert to a grayscale or color byte-image:

cvConvertImage(src, dst, flags=0); src = float/byte grayscale/color image dst = byte grayscale/color image flags = CV_CVTIMG_FLIP (flip vertically) CV_CVTIMG_SWAP_RB (swap the R and B channels)

Convert a color image to grayscale:

Using the OpenCV conversion:

cvCvtColor(cimg,gimg,CV_BGR2GRAY); // cimg -> gimg

Using a direct conversion:

for(i=0;i<cimg->height;i++) for(j=0;j<cimg->width;j++) gimgA[i][j]= (uchar)(cimgA[i][j].b*0.114 + cimgA[i][j].g*0.587 + cimgA[i][j].r*0.299);

Convert between color spaces:

cvCvtColor(src,dst,code); // src -> dst code = CV_<X>2<Y> <X>/<Y> = RGB, BGR, GRAY, HSV, YCrCb, XYZ, Lab, Luv, HLS e.g.: CV_BGR2GRAY, CV_BGR2HSV, CV_BGR2Lab

Drawing commands

Draw a box:
// draw a box with red lines of width 1 between (100,100) and (200,200) cvRectangle(img, cvPoint(100,100), cvPoint(200,200), cvScalar(255,0,0), 1);

Draw a circle:
// draw a circle at (100,100) with a radius of 20. Use green lines of width 1 cvCircle(img, cvPoint(100,100), 20, cvScalar(0,255,0), 1);

Draw a line segment:

// draw a green line of width 1 between (100,100) and (200,200) cvLine(img, cvPoint(100,100), cvPoint(200,200), cvScalar(0,255,0), 1);

Draw a set of polylines:

CvPoint CvPoint CvPoint* int int int int curve1[]={10,10, 10,100, 100,100, curve2[]={30,30, 30,130, 130,130, curveArr[2]={curve1, curve2}; nCurvePts[2]={4,5}; nCurves=2; isCurveClosed=1; lineWidth=1; 100,10}; 130,30, 150,10};

cvPolyLine(img,curveArr,nCurvePts,nCurves,isCurveClosed,cvScalar(0,255,255),lineW idth);

Draw a set of filled polygons:

cvFillPoly(img,curveArr,nCurvePts,nCurves,cvScalar(0,255,255));

Add text:
CvFont font; double hScale=1.0; double vScale=1.0; int lineWidth=1; cvInitFont(&font,CV_FONT_HERSHEY_SIMPLEX|CV_FONT_ITALIC, hScale,vScale,0,lineWidth); cvPutText (img,"My comment",cvPoint(200,400), &font, cvScalar(255,255,0));

Other possible fonts:

CV_FONT_HERSHEY_SIMPLEX, CV_FONT_HERSHEY_PLAIN, CV_FONT_HERSHEY_DUPLEX, CV_FONT_HERSHEY_COMPLEX, CV_FONT_HERSHEY_TRIPLEX, CV_FONT_HERSHEY_COMPLEX_SMALL, CV_FONT_HERSHEY_SCRIPT_SIMPLEX, CV_FONT_HERSHEY_SCRIPT_COMPLEX,

Working with matrices

Allocating and releasing matrices

General: o OpenCV has a C interface to matrix operations. There are many alternatives that have a C++ interface (which is more convenient) and are as efficient as OpenCV. o Vectors are obtained in OpenCV as matrices having one of their dimensions as 1. o Matrices are stored row by row where each row has a 4 byte alignment. Allocate a matrix:
15

CvMat* cvCreateMat(int rows, int cols, int type); type: Type of the matrix elements. Specified in form CV_<bit_depth>(S|U|F)C<number_of_channels>. E.g.: CV_8UC1 means an 8-bit unsigned single-channel matrix, CV_32SC2 means a 32-bit signed matrix with two channels. Example: CvMat* M = cvCreateMat(4,4,CV_32FC1);

Release a matrix:
CvMat* M = cvCreateMat(4,4,CV_32FC1); cvReleaseMat(&M);

Clone a matrix:
CvMat* M1 = cvCreateMat(4,4,CV_32FC1); CvMat* M2; M2=cvCloneMat(M1);

Initialize a matrix:
double a[] = { 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 }; CvMat Ma=cvMat(3, 4, CV_64FC1, a);

Alternatively:
CvMat Ma; cvInitMatHeader(&Ma, 3, 4, CV_64FC1, a);

Initialize a matrix to identity:

CvMat* M = cvCreateMat(4,4,CV_32FC1); cvSetIdentity(M); // does not seem to be working properly

Accessing matrix elements

Assume that you need to access the Indirect matrix element access:
cvmSet(M,i,j,2.0); // Set M(i,j) t = cvmGet(M,i,j); // Get M(i,j)

cell of a 2D float matrix.

Direct matrix element access assuming a 4-byte alignment:

CvMat* M int n = cvCreateMat(4,4,CV_32FC1); = M->cols;

float data = M->data.fl; data[in+j] = 3.0;

Direct matrix element access assuming possible alignment gaps:

CvMat* M = cvCreateMat(4,4,CV_32FC1); int step = M->step/sizeof(float); float *data = M->data.fl; (data+i*step)[j] = 3.0;

Direct matrix element access of an initialized matrix:

double a[16]; CvMat Ma = cvMat(3, 4, CV_64FC1, a); a[i*4+j] = 2.0; // Ma(i,j)=2.0;

Matrix/vector operations

Matrix-matrix operations:
CvMat *Ma, *Mb, *Mc; cvAdd(Ma, Mb, Mc); cvSub(Ma, Mb, Mc); cvMatMul(Ma, Mb, Mc); // Ma+Mb // Ma-Mb // Ma*Mb -> Mc -> Mc -> Mc

Elementwise matrix operations:

CvMat *Ma, *Mb, *Mc; cvMul(Ma, Mb, Mc); // Ma.*Mb -> Mc cvDiv(Ma, Mb, Mc); // Ma./Mb -> Mc cvAddS(Ma, cvScalar(-10.0), Mc); // Ma.-10 -> Mc

Vector products:
double va[] = {1, 2, 3}; double vb[] = {0, 0, 1}; double vc[3]; CvMat Va=cvMat(3, 1, CV_64FC1, va); CvMat Vb=cvMat(3, 1, CV_64FC1, vb); CvMat Vc=cvMat(3, 1, CV_64FC1, vc); double res=cvDotProduct(&Va,&Vb); // dot product: Va . Vb -> res cvCrossProduct(&Va, &Vb, &Vc); // cross product: Va x Vb -> Vc end{verbatim}

Note that Va, Vb, Vc, must be 3 element vectors in a cross product.

Single matrix operations:

CvMat *Ma, *Mb; cvTranspose(Ma, Mb); CvScalar t = cvTrace(Ma); double d = cvDet(Ma); cvInvert(Ma, Mb);

// // // //

transpose(Ma) -> Mb (cannot transpose onto self) trace(Ma) -> t.val[0] det(Ma) -> d inv(Ma) -> Mb

Inhomogeneous linear system solver:

CvMat* A = CvMat* x = CvMat* b = cvSolve(&A, cvCreateMat(3,3,CV_32FC1); cvCreateMat(3,1,CV_32FC1); cvCreateMat(3,1,CV_32FC1); &b, &x); // solve (Ax=b) for x

Eigen analysis (of a symmetric matrix):

CvMat* A = cvCreateMat(3,3,CV_32FC1); CvMat* E = cvCreateMat(3,3,CV_32FC1); CvMat* l = cvCreateMat(3,1,CV_32FC1); cvEigenVV(&A, &E, &l); // l = eigenvalues of A (descending order) // E = corresponding eigenvectors (rows)

Singular value decomposition:

CvMat* A = CvMat* U = CvMat* D = CvMat* V = cvSVD(A, D, cvCreateMat(3,3,CV_32FC1); cvCreateMat(3,3,CV_32FC1); cvCreateMat(3,3,CV_32FC1); cvCreateMat(3,3,CV_32FC1); U, V, CV_SVD_U_T|CV_SVD_V_T); // A = U D V^T

The flags cause U and V to be returned transposed (does not work well without the transpose flags).

Working with video sequences

Capturing a frame from a video sequence

OpenCV supports capturing images from a camera or a video file (AVI). Initializing capture from a camera:
CvCapture* capture = cvCaptureFromCAM(0); // capture from video device #0

Initializing capture from a file:

CvCapture* capture = cvCaptureFromAVI("infile.avi");

Capturing a frame:
IplImage* img = 0; if(!cvGrabFrame(capture)){ // capture a frame printf("Could not grab a frame\n\7"); exit(0);

} img=cvRetrieveFrame(capture);

// retrieve the captured frame

To obtain images from several cameras simultaneously, first grab an image from each camera. Retrieve the captured images after the grabbing is complete.

Releasing the capture source:

cvReleaseCapture(&capture);

Note that the image captured by the device is allocated/released by the capture function. There is no need to release it explicitly.

Getting/setting frame information

Get capture device properties:

cvQueryFrame(capture); // this call is necessary to // capture properties int frameH = (int) cvGetCaptureProperty(capture, int frameW = (int) cvGetCaptureProperty(capture, int fps = (int) cvGetCaptureProperty(capture, int numFrames = (int) cvGetCaptureProperty(capture, get correct CV_CAP_PROP_FRAME_HEIGHT); CV_CAP_PROP_FRAME_WIDTH); CV_CAP_PROP_FPS); CV_CAP_PROP_FRAME_COUNT);

The total frame count is relevant for video files only. It does not seem to be working properly.

Get frame information:

float posMsec int posFrames float posRatio = cvGetCaptureProperty(capture, CV_CAP_PROP_POS_MSEC); = (int) cvGetCaptureProperty(capture, CV_CAP_PROP_POS_FRAMES); = cvGetCaptureProperty(capture, CV_CAP_PROP_POS_AVI_RATIO);

Get the position of the captured frame in [msec] with respect to the first frame, or get its index where the first frame starts with an index of 0. The relative position (ratio) is 0 in the first frame and 1 in the last frame. This ratio is valid only for capturing images from a file.

Set the index of the first frame to capture:

// start capturing from a relative position of 0.9 of a video file cvSetCaptureProperty(capture, CV_CAP_PROP_POS_AVI_RATIO, (double)0.9);

This only applies for capturing from a file. It does not seem to be working properly.

Saving a video file

Initializing a video writer:

CvVideoWriter *writer = 0; int isColor = 1; int fps = 25; // or 30

int frameW = 640; // 744 for firewire cameras int frameH = 480; // 480 for firewire cameras writer=cvCreateVideoWriter("out.avi",CV_FOURCC('P','I','M','1'), fps,cvSize(frameW,frameH),isColor);

Other possible codec codes:

CV_FOURCC('P','I','M','1') CV_FOURCC('M','J','P','G') CV_FOURCC('M', 'P', '4', '2') CV_FOURCC('D', 'I', 'V', '3') CV_FOURCC('D', 'I', 'V', 'X') CV_FOURCC('U', '2', '6', '3') CV_FOURCC('I', '2', '6', '3') CV_FOURCC('F', 'L', 'V', '1') = = = = = = = = MPEG-1 codec motion-jpeg codec (does not work well) MPEG-4.2 codec MPEG-4.3 codec MPEG-4 codec H263 codec H263I codec FLV1 codec

A codec code of -1 will open a codec selection window (in windows).

Writing the video file:

IplImage* img = 0; int nFrames = 50; for(i=0;i<nFrames;i++){ cvGrabFrame(capture); img=cvRetrieveFrame(capture); cvWriteFrame(writer,img); }

// capture a frame // retrieve the captured frame // add the frame to the file

To view the captured frames during capture, add the following in the loop:
cvShowImage("mainWin", img); key=cvWaitKey(20); // wait 20 ms

Note that without the 20[msec] delay the captured sequence is not displayed properly.

Releasing the video writer:

cvReleaseVideoWriter(&writer);

Opencv Computer Vision Application Programming Cookbook: Second Edition
No ratings yet
Opencv Computer Vision Application Programming Cookbook: Second Edition
30 pages
Clutch Control Check & Tightening
No ratings yet
Clutch Control Check & Tightening
42 pages
To Opencv: Marvin Smith
100% (2)
To Opencv: Marvin Smith
29 pages
Image Processing With Opencv Python: Kripasindhu Sarkar
No ratings yet
Image Processing With Opencv Python: Kripasindhu Sarkar
67 pages
Opencv - Introduction: Mašinska Vizija, 2017
No ratings yet
Opencv - Introduction: Mašinska Vizija, 2017
41 pages
Learning OpenCV 3 Computer Vision With Python - Second Edition - Sample Chapter
80% (5)
Learning OpenCV 3 Computer Vision With Python - Second Edition - Sample Chapter
25 pages
OpenCV Tutorial PDF
No ratings yet
OpenCV Tutorial PDF
32 pages
OpenCV Tutorial
No ratings yet
OpenCV Tutorial
18 pages
Fluke 1735 Certificate Revised
No ratings yet
Fluke 1735 Certificate Revised
1 page
Opencv Tutorial Eccv 2010: Gary Bradski
No ratings yet
Opencv Tutorial Eccv 2010: Gary Bradski
123 pages
OpenCV and Visual C++ Programming in Image Processing
No ratings yet
OpenCV and Visual C++ Programming in Image Processing
31 pages
Computer Vision and Robotics Lab R22-1
No ratings yet
Computer Vision and Robotics Lab R22-1
36 pages
Image Processing: Robotics Club Summer Camp'12
No ratings yet
Image Processing: Robotics Club Summer Camp'12
28 pages
An Introduction To Opencv Using Python With Ubuntu: Krupali Mistry, Avneet Saluja
No ratings yet
An Introduction To Opencv Using Python With Ubuntu: Krupali Mistry, Avneet Saluja
4 pages
VNX - Su Espace 2 1991 1997 PDF
100% (2)
VNX - Su Espace 2 1991 1997 PDF
555 pages
ISBN 978 1 940366 12 8 1577 Chapter06
No ratings yet
ISBN 978 1 940366 12 8 1577 Chapter06
22 pages
Basics of OpenCV API
No ratings yet
Basics of OpenCV API
10 pages
OpenCV by Example - Sample Chapter
No ratings yet
OpenCV by Example - Sample Chapter
25 pages
Opencv Tutorial
No ratings yet
Opencv Tutorial
213 pages
Introduction To OpenCV
No ratings yet
Introduction To OpenCV
19 pages
Aula 02 VC Ppmec
No ratings yet
Aula 02 VC Ppmec
34 pages
Opencv Notes
No ratings yet
Opencv Notes
55 pages
OpenCV With C++
No ratings yet
OpenCV With C++
12 pages
1) 1C LSZH
No ratings yet
1) 1C LSZH
44 pages
Intro To The Opencv Library: For Tu Dresden Computer Vision 2 Lecture and General Use
No ratings yet
Intro To The Opencv Library: For Tu Dresden Computer Vision 2 Lecture and General Use
16 pages
A Computer-Vision Library: Seeing With Opencv
No ratings yet
A Computer-Vision Library: Seeing With Opencv
5 pages
OpenCV Tutorial by R. Laganiere
No ratings yet
OpenCV Tutorial by R. Laganiere
38 pages
Find Mii Project and Opencv Tutorial
No ratings yet
Find Mii Project and Opencv Tutorial
44 pages
OpenCV Quick Guide
No ratings yet
OpenCV Quick Guide
100 pages
007 - Summer Training Report
No ratings yet
007 - Summer Training Report
38 pages
Submarine Pipeline Route Selection Upheaval Buckling External Pressure Collapse
No ratings yet
Submarine Pipeline Route Selection Upheaval Buckling External Pressure Collapse
19 pages
Python First Module Notes
No ratings yet
Python First Module Notes
19 pages
Open CVIntro
No ratings yet
Open CVIntro
13 pages
Opencv Tutorials
No ratings yet
Opencv Tutorials
113 pages
The Mechanical Vapor Compression 38 Years of Experience PDF
No ratings yet
The Mechanical Vapor Compression 38 Years of Experience PDF
10 pages
Operations Research Course File
No ratings yet
Operations Research Course File
42 pages
Unit-I Vehicle Structure and Engines
No ratings yet
Unit-I Vehicle Structure and Engines
31 pages
Machine Vision Exp 1 (Mumbai Univesrity)
No ratings yet
Machine Vision Exp 1 (Mumbai Univesrity)
5 pages
The Comparison of CPU Time Consumption For PDF
No ratings yet
The Comparison of CPU Time Consumption For PDF
4 pages
Estimating - Practice Solutions PE Civil
No ratings yet
Estimating - Practice Solutions PE Civil
8 pages
IS 10262 2019 NewConcreteMix Design
No ratings yet
IS 10262 2019 NewConcreteMix Design
69 pages
(Ebooks PDF) Download Learning Image Processing With OpenCV 1st Edition Gloria Bueno García Oscar Deniz Suarez Full Chapters
No ratings yet
(Ebooks PDF) Download Learning Image Processing With OpenCV 1st Edition Gloria Bueno García Oscar Deniz Suarez Full Chapters
67 pages
Opencv Tutorials PDF
No ratings yet
Opencv Tutorials PDF
113 pages
Master's Thesis Defense: Comparison of Noncoherent Detectors For SOQPSK and GMSK in Phase Noise Channels
No ratings yet
Master's Thesis Defense: Comparison of Noncoherent Detectors For SOQPSK and GMSK in Phase Noise Channels
55 pages
Penelitian 2 Latber
No ratings yet
Penelitian 2 Latber
7 pages
Summit 325e Service Manual
0% (1)
Summit 325e Service Manual
40 pages
Vehicle Theft7
No ratings yet
Vehicle Theft7
1 page
NE1-40-QC-SEC-ITP-WGV (C) - 00001 ITP of Excavation and Construction of CT MAF Foundation
No ratings yet
NE1-40-QC-SEC-ITP-WGV (C) - 00001 ITP of Excavation and Construction of CT MAF Foundation
12 pages
Briquetting Pre-WPS Office
No ratings yet
Briquetting Pre-WPS Office
5 pages
Blaetterkatalog
No ratings yet
Blaetterkatalog
44 pages
3VA11635GE420AA0 Datasheet en
No ratings yet
3VA11635GE420AA0 Datasheet en
6 pages
FG - BDER-78 Technical Catalogue - Technical - UN
No ratings yet
FG - BDER-78 Technical Catalogue - Technical - UN
8 pages
Design Conditions For Morning Glory Spil
No ratings yet
Design Conditions For Morning Glory Spil
2 pages
Learn Image Processing With OpenCV and QT For Beginners
No ratings yet
Learn Image Processing With OpenCV and QT For Beginners
8 pages
XML CRUD in C#
100% (1)
XML CRUD in C#
3 pages
C PM 71.v2016-12-11 PDF
No ratings yet
C PM 71.v2016-12-11 PDF
31 pages
ASSIGNMENT NO.5 (Estimates For Civil Engineering Works)
No ratings yet
ASSIGNMENT NO.5 (Estimates For Civil Engineering Works)
2 pages
Urc PDR Teamrose 2024 2025
No ratings yet
Urc PDR Teamrose 2024 2025
4 pages
Co2 Sensor - Honeywell
No ratings yet
Co2 Sensor - Honeywell
2 pages
Omniswitch 6350 Family Datasheet en
No ratings yet
Omniswitch 6350 Family Datasheet en
8 pages
Introduction To Programming With OpenCV
100% (5)
Introduction To Programming With OpenCV
21 pages
Material List
No ratings yet
Material List
2 pages
What Is Shengyi SAR10S PCB
No ratings yet
What Is Shengyi SAR10S PCB
4 pages
Evaluation of The Heat Transfer Coefficient at The Metal-Mould Interface During Flow
No ratings yet
Evaluation of The Heat Transfer Coefficient at The Metal-Mould Interface During Flow
4 pages
Railway Signalling and Telecommunication: Industrial Traning Report
No ratings yet
Railway Signalling and Telecommunication: Industrial Traning Report
44 pages
ANSI B16.5 - Steel Pipe Flanges - Maximum Pressure and Temperature Ratings - Group 1
No ratings yet
ANSI B16.5 - Steel Pipe Flanges - Maximum Pressure and Temperature Ratings - Group 1
1 page
Machine Vision and Image Processing Algorithm - Machine Vision and Image Processing Algorithm Fall 2009 Mario
No ratings yet
Machine Vision and Image Processing Algorithm - Machine Vision and Image Processing Algorithm Fall 2009 Mario
47 pages
Appendix 2 Introduction To Opencv: Speaker: 黃世勳
No ratings yet
Appendix 2 Introduction To Opencv: Speaker: 黃世勳
35 pages
Introduction To Programming With OpenCV
No ratings yet
Introduction To Programming With OpenCV
16 pages
CS221 Artificial Intelligence: Principles & Techniques: Challenge Problem
No ratings yet
CS221 Artificial Intelligence: Principles & Techniques: Challenge Problem
33 pages
Opencv Tutorial: Lecturer: Amir Hossein Khalili
No ratings yet
Opencv Tutorial: Lecturer: Amir Hossein Khalili
32 pages
Designing Applications That See Designing Applications That See Lecture 8: Opencv
No ratings yet
Designing Applications That See Designing Applications That See Lecture 8: Opencv
19 pages
Opencv Beginners
100% (2)
Opencv Beginners
17 pages
Introduction To Programming With OpenCV
No ratings yet
Introduction To Programming With OpenCV
19 pages
VLL March 2009 - Geovany A. Ramirez
No ratings yet
VLL March 2009 - Geovany A. Ramirez
6 pages
Opencv: Electronics Club, Iitk
No ratings yet
Opencv: Electronics Club, Iitk
27 pages
Using Opencv in Microsoft Visual C++: Setting Up Path Environment Variable
No ratings yet
Using Opencv in Microsoft Visual C++: Setting Up Path Environment Variable
6 pages

Introduction To Programming With OpenCV

Uploaded by

Introduction To Programming With OpenCV

Uploaded by

]lIntroduction to programming with OpenCV

GUI commands o Window management o Input handling

Accessing image elements Image conversion Drawing commands

OpenCV naming conventions

Function naming conventions:

Matrix data types:

Image data types:

// unnecessary - included in cv.h

Create and position a window:

Handle mouse events: o Define a mouse handler:

Register the handler:

Handle keyboard events:

Get keyboard input with blocking:

The main keyboard event loop:

Handle trackbar events: o Define a trackbar handler:

Register the handler:

Get the current trackbar position:

Set the trackbar position:

Basic OpenCV data structures

Image data structure

Matrices and vectors

CvSparseMat // SPARSE N-dimensional array

Other data structures

Rectangular dimensions with offset:

Working with images

Set/get the region of interest:

Set/get the channel of interest:

The majority of OpenCV functions do NOT support COI.

Reading and writing images

Reading an image from a file:

Writing an image to a file:

Accessing image elements

-th channel of the pixel at the . The column index

-th column. The .

For a multi-channel float (or byte) image:

For a multi-channel byte image:

For a multi-channel float image:

For a multi-channel byte image:

For a multi-channel float image (assuming a 4-byte alignment):

For a single-channel byte image:

For a multi-channel byte image:

For a multi-channel float image:

Convert to a grayscale or color byte-image:

Convert a color image to grayscale:

Using the OpenCV conversion:

Using a direct conversion:

Convert between color spaces:

Draw a line segment:

Draw a set of polylines:

Draw a set of filled polygons:

Other possible fonts:

Working with matrices

Initialize a matrix to identity:

Accessing matrix elements

cell of a 2D float matrix.

Direct matrix element access assuming a 4-byte alignment:

float *data = M->data.fl; data[i*n+j] = 3.0;

Direct matrix element access assuming possible alignment gaps:

Direct matrix element access of an initialized matrix:

Elementwise matrix operations:

Single matrix operations:

Inhomogeneous linear system solver:

Eigen analysis (of a symmetric matrix):

Singular value decomposition:

Working with video sequences

Initializing capture from a file:

// retrieve the captured frame

Releasing the capture source:

Getting/setting frame information

Get capture device properties:

Get frame information:

Set the index of the first frame to capture:

Saving a video file

Initializing a video writer:

Other possible codec codes:

A codec code of -1 will open a codec selection window (in windows).

float data = M->data.fl; data[in+j] = 3.0;