Section C

Projected Local Binary Pattern based Two-Wheelers Detection using Adaboost Algorithm

Yeunghak Lee1,*, Taesun Kim1, Jaechang Shim2
Author Information & Copyright
1Dept. of Avionics Eng. College of Aviation , Kyungwoon University , Kumi, S.Korea, annaturu,tskim@ikw.ac.kr
2Dept. of Computer Science, Andong University, Andong, S.Korea, jcshim@andong.ac.kr
*Corresponding Author: Yeunghak Lee, Kyungwoon University, Dept. of avionics Eng. College of Aviation, annaturu@ikw.ac.kr

© Copyright 2014 Korea Multimedia Society. This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Received: Dec 01, 2014 ; Revised: Jan 15, 2015 ; Accepted: Jan 31, 2015

Published Online: Dec 31, 2014

Abstract

We propose a bicycle detection system riding on people based on modified projected local binary pattern(PLBP) for vision based intelligent vehicles. Projection method has robustness for rotation invariant and reducing dimensionality for original image. The features of Local binary pattern(LBP) are fast to compute and simple to implement for object recognition and texture classification area. Moreover, We use uniform pattern to remove the noise. This paper suggests that modified LBP method and projection vector having different weighting values according to the local shape and area in the image. Also our system maintains the simplicity of evaluation of traditional formulation while being more discriminative. Our experimental results show that a bicycle and motorcycle riding on people detection system based on proposed PLBP features achieve higher detection accuracy rate than traditional features.

Keywords: Adaboost; Histogram of oriented gradients; Pedestrian; Projection; Two wheelers

I. INTRODUCTION

Today’s computer environment is changing fast because of the development of intelligent interface and multimedia. Especially in intelligent vehicle, but while a simple device or vehicle to move things may be in past, the shape (or style) and function is very fast and variety progressed to the support of MEMS, FPGA, embedded system and etc. in now. In the past, a vehicle was regarded as a simple tool that transports people and commodities. However, the shapes and functions of vehicles continue to develop rapidly and in diverse ways. The development of vehicles was concentrated on not only improving performance, but also to protect the drivers and passengers in a vehicle in the occurrence of a traffic accident. The degree of injury to passengers in a vehicle caused by a traffic accident is reduced gradually in advantage of many safety device techniques mounted in and outside of the vehicle. But the accidents outside of a vehicle that is caused by a driver’s carelessness or by road environments still have many problems, so it is required to detect the risk of a traffic accident in order to save lives [1].

Most of the studies relating to human safety over the past several years have mainly concentrated on increasing the detection rate of pedestrians and automobiles on a road from a still image and framework. Now, the study scope has expanded to protect vulnerable road users (VRUs) such as small automobiles [1, 2]. Because pedestrians and bicycle riders are very vulnerable road participants among VRUs, they are a hot subject for the study field in intelligent transportation system. Therefore various types of sensors are utilized for an accurate and real time detection and tracking; NIR, FIR, LIDAR, RADAR, Laser Scanner, fusion system, and etc.

Among VRUs on the road, pedestrians are the slowest while the others have the characteristics of moving speedily. Two-wheelers, different from ordinary pedestrians, are expressed with complicated shapes in combination of the rider’s dressing style, hair style, whole body attitude, forms and patterns of loading, and various types of two-wheelers, which makes the shapes even further complicated according to the viewing angle, so that it requires a stronger algorithm against changes in their shapes.

The automobile vision-based systems have mainly concentrated on recognizing pedestrians and automobiles [3]. As stated in the foregoing, detection of two-wheelers on the road is similar with that of pedestrians on road. Extracting characteristics for detecting pedestrians are classified with single characteristics, multiple characteristics, whole area characteristics, and district characteristics pursuant to the usage of extracted characteristics [4]. Especially, HOG characteristics or Improved HOG characteristics are widely utilized in the methods for recognizing pedestrians by using the automobile vision. Zhu et al. [5] applied the HOG characteristics based on variable block size to improve detection speed. Further, Watanabe et al. [6] utilized co-occurrence HOG characteristics, and Wang et al. [7] utilized HOG-LBP human detection to improve detection accuracy.

Generally speaking, the literature on pedestrian detection system is abundant [8]. Features can be distinguished into global features, local features, single features, and multiple features depending on how features are measured and used [9]. Principal component analysis (PCA) [10] operates global features. On the other hand, local features are extracted by dividing a sliding window into different subset region of image [11].

LBP has been used for texture classification and face detection to computer efficient texture feature [12, 13]. And it is very invariant for illumination and contrast to gray level changes and good anti-noise performance. Despite the great success of LBP in early applications, its practical results are not satisfactory in several fields. Hence many researchers have improved the LBP in the specific domain and achieved lots of significant research [14]

As we mentioned previously, two wheelers similar with not only the shape of pedestrian but also detection technique based on several features. Two wheelers consist of human and machine; usually a human is upper part and machine is lower part in the shape. In this paper, we define that bicyclist (BL) is a people riding the bicycle and motorcycle driver (MD) is a people riding the motorcycle [8]. So two wheelers detection system can be adapted to the pedestrian detection algorithms for features extraction, classification, and non-maxima suppression. A solution of slow performance from dense encoding scheme and multi-level scale images is to use a boosting algorithm [15] to speed up classification process. Because of above reasons, we tried to use modified LBP algorithm to select best features and Adaboost to improve detection rate. In this study, we invented new algorithm based on projected LBP value which is weighted by uniform pattern and limited area. More detail about general and modified LBP will describe in section 2. This paper proposes a system for the detection of two wheelers ride on people with both efficiency and accuracy.

This paper consists of the following: Section 2 introduces basic extracting characteristics methods with projection vectors and LBP algorithm that can increase detection rates significantly. Section 3 states the framework and training of suggested detecting two-wheelers system. The evaluation and detailed analysis for the experimental results are summarized in section 4. Section 5 states the conclusion.

II. Feature Extraction

1. Projection Vectors

In general, the projection executes the role of converting N dimensional coordinate systems into smaller dimensions than N. We can take shape information, if a projection is used in the image. If it is used as one dimension, the horizontal projection vectors can be obtained by vertical projection; the vertical projection vectors can be obtained by a horizontal projection. The binary image of a lizard with its vertical and horizontal projections is shown in Figure 1 [16].

jmis-1-2-119-g1
Fig. 1. Example of projection for binary image.
Download Original Figure

If given an image whose size is M x N and the image is I(i, j), the projection H(k) along the row and the projection V(k) along the column of binary image are given by

H(k)=j=0N1I(i,j)
(1)
V(k)=j=0M1I(i,j)
(2)

It is an advantage that the projection includes space information that can be used for recognition. But there are some problems with indexing database images which include several sizes, using these projections. Therefore it is impossible to compare projections using same length vectors because of the size of the projection vectors which are generated by image size have different sizes. Because there are a number of vertical and horizontal vectors used as the image size, in the case of it being used as a database index for searching for information, it has the disadvantage that amount of indexing data is generated, causing a delay in the production of results.

2. Local Binary Pattern

The Local Binary Pattern(LBP) texture operator, which is initially proposed by Ojala et al.[17], has been used successfully in various computer vision application, such as texture descriptors, face recognitions and etc. LBP is highly discriminative and its key advantages, namely it's invariant to gray level changes, computational efficiency and good anti-noise performance. The common version of LBP code operator is defined as follows [18].

L B P P , R ( x C , y C ) = P = 0 P 1 s ( g P g C ) 2 P s ( z ) = { 1 , z 0 0 , otherwise
(3)

where gC is the intensity value of center pixel, gP is the intensity value of neighbor pixels. P is the total number of invalid pixels and R is the radius of neighborhood. The examples of local circular neighborhood are shown in Figure 2.

jmis-1-2-119-g2
Fig. 2. The circular (4,1), (8,1) and (16,2) neighborhoods.
Download Original Figure

Given an image region of size I*J , a LBP histogram of 2P is computed for feature representation.

The calculation principle of LBP for LBP8,1 adapted equation (2) is shown in Figure 3.

jmis-1-2-119-g3
Fig. 3. Example of LBP calculation
Download Original Figure

There are some patterns that occur more frequently than others and the number of state transitions between 0 and 1 for these patterns are at most two bitwise changing [19]. And if neighborhood sampling point number increases, the binary pattern species rapidly rise also. Extracted binary patterns are classified as uniform pattern and the rest as non-uniform pattern to the original operator [17]. If P=8, the number of bins is reduce from 256 to 59. Uniform pattern is the number of bitwise transitions from 0 to 1 or vice versa when the bit pattern is considered circular. For example, the 00000000(0 bitwise change), 01110000(2 bitwise change), and 1100111(2 bitwise change) are uniform patterns, while 11010011(4 bitwise change) and 01010101(7 bitwise change) are non-uniform patterns. There are two reasons for omitting the non-uniform patterns[18]. First, most of the local binary patterns in natural images are uniform. The second reason for considering uniform patterns is the statistical robustness. Therefore, uniform patterns have 3 kinds of advantages:

  1. Rotation invariant

  2. Reduce the length of the feature

  3. Robust against variation in pose or illumination.

3. Projection Local Binary Pattern (PLBP)

Projection vectors are calculated by equation (1) for 8x8 cell area from original image. The basic LBP is used neighbor pixels with center pixel, while proposed projection LBP (PLBP) is used vertical and horizontal projected pixels with projection vectors. First, we calculate the averages of projection vectors to use threshold value. The second, the definition of proposed horizontal and vertical PLBP code for kth cell is following:

PLBP(H(k))H,i=p=0P1b(H(k)pH(k)p,avg)2p
(4)
PLBP(V(k))H,j=p=0P1b(V(k)pV(k)p,avg)2pb(z)={1,z00,otherwise
(5)

where H(k)p,avg is the average value of horizontal vectors for kth cell and V(k)p,avg is the average value of vertical vectors for kth cell. H(k)p is pth pixel projected horizontal direction within kth cell and V(k)p is pth pixel projected vertical direction within kth cell. The calculation processing is described in Figure 4.

jmis-1-2-119-g4
Fig. 4. Calculation of LBP code from local cell projection vectors.
Download Original Figure

Because a natural image includes many kinks of noise, projection vector is weighted by normalized value, according to the uniform pattern and non-uniform pattern. The definition of horizontal and vertical projection vector PRJH and PRJV is following:

PRJH={H(k)i×(LBPH,i/255),if UPH(k)i×ε ,otherwise
(6)
PRJV={V(k)i×(LBPV,i/255),if UPV(k)i×ε ,otherwise
(7)

where UP means uniform pattern and ε is small value (0.01).

To normalize the cell’s projection vectors, it should be grouped into blocks (3x3 cells). This is done by accumulating a measure of local histogram value over the blocks and the result is then used to normalize the cells in the block. Although there are four different methods for block normalization suggested by Dalal and Triggs [20], L2-norm normalization ∏H,V is implemented using equation (5), as shown in Figure 5.

jmis-1-2-119-g5
Fig. 5. Normalization processing of blocks with 3x3 cells
Download Original Figure
Π H , V = P R J H , V B H , V 2 2 + ε 2
(8)

III. Classifier

Adaboost is a simple learning algorithm that selects a small set of weak classifiers from a large number of potential features according to the weighted majority of classifiers. The training procedure of Adaboost is a greedy algorithm, which constructs an additive combination of weak classifier. Our boosting algorithm is basically the same as P. Viola’s algorithm [21]. The boosting algorithm pseudo code for Adaboost is given in Figure 5. The algorithm takes as input a training set (x1,y1),…,(xn,yn) where each xi belong to some domain X and each label yi is in some label set Y.

jmis-1-2-119-g6
Fig. 6. Adaboost learning procedure.
Download Original Figure

The final hypothesis H is a weighted majority vote of the T weak hypotheses where αt is the weight assigned to ht. Using two strong classification, in this paper suggests 2nd stage cascade method. It improves the recognition rate due to the complementary role for two feature vector of quite different type.

IV. Experimental Results

In this study, an experiment was carried out with an ordinary user computer environment consisting of a Pentium 3.1 GHz and Visual C++ 6.0 Program. To access the effectiveness of the proposed evolutionary method, the algorithm was applied to a practical proposed method. For the proposed efficient two wheeler detection approach, we describe the performance of our innovative technique using projected LBP. Since there is no public dataset for two-wheeler images, we built such a dataset by ourselves, using a digital camera and websites. An image of two-wheelers can be expressed by various angles in an automobile. For our purposes, it is hypothesized in the experiment for the following 2 cases: a two-wheeler is running in front of an automobile (rear appearance) and a two-wheeler is coming toward the automobile (front appearance). Each datasets are consists of two kinds of type according to the front view point for horizontal line: 90° and 60°. And in our study, we also experiments about mixed the 90° and 60°. Our dataset contains only front and back view with a relatively limited range of poses (60° and 90°) which are scaled to size 64x128 pixels. And the negative (non-two wheelers) samples used in our experiments were extracted randomly from general street images. All our dataset examples used in this paper shows in Figure 7. In Figure 7, P means positive and N means negative.

jmis-1-2-119-g7
Fig. 7. Examples of training samples for positive and negative
Download Original Figure

Used each training set ratio of positive and negative is 1:1. For more detail, the bicyclist training images are 340 examples for 60 degree and 845 examples for 90 degree. And motorcycle driver training images are 96 examples for 60 degree and 234 examples for 90 degree. We also train the mixed of two degree and types (bicyclist and motorcycle driver). An example of the images used in the experiment is shown in Figure. 4. In Figure 4, the first row refers to the images of two-wheelers corresponding to bicycles and the second row refers to the images corresponding to motorcycles.

In the first experiments, the performance of the traditional concatenated feature saving HOG to compare with proposed algorithm has shown in Figure 5 (a), according to the data type and degree. The ROC curves were generated as described in [22] by adjusting the threshold (θ) of the last two node of the object detector. A range of thresholds of -20 to 20 was utilized in classification, and confusion matrix, true positive rate (TPR) and false positive rate (FPR) were used for analyzing experimental results per angles for the methods, and ROC curves are shown in Figure. 5, by applying Eq. (8) below:

TPR=TPTP+FN,FPR=FPFP+TN
(8)

where “TP” is True Positive”, “FP” is False Positive”, “TN” is True Negative and “FN” is False Negative. In figure 3, “Moto” means motorcycle, “Bike” means bicycle, and “MB” is a mixture of motorcycles and bicycles, respectively.

In Figure 8 (a), it has shown that the experiment according to “MB” has the best results among these experiments, but the recognition rate is significantly low. However, in Figure 8 (b), shows that the results of MB experiments according to the general projection vector method have a higher recognition rate than general HOG method. When applied with another analytical method, as shown in Figure 8 (b) and (c), because the area of the curve of the suggested method is larger than that of the curve of the ordinary method (Projection Vector), it is known that this system has a better performance. When PLBP algorithm was applied for the other characteristic vector (Projection Vector) method, a higher recognition rate could be obtained, and the results are listed in Figure 8 (c).

jmis-1-2-119-g8
Fig. 8. The ROC curves of testing samples for traditional method and proposed method: (a) concatenated feature saving HOG, (b) projection vector, and (c) proposed PLBP method
Download Original Figure

The highest accuracies for each of the methods were calculated with equation (9) and the results are listed in Table 1.

A c c u r a c y = T P + T N T P + F P + T N + F N
(9)
Table 1. The result of recognition accuracy for different features (%)
Degree(°)\Method HOG Projection PLBP
60 Moto 61.1 88.8 95.2
Bike 71.2 90.0 92.5
MB 76.7 90.6 92.1
90 Moto 74.9 93.0 96.0
Bike 78.3 91.9 91.9
MB 76.1 90.6 91.8
90-60 Moto 77.8 94.5 94.6
Bike 75.5 90.1 90.1
MB 73.1 88.7 90.1
Download Excel Table

As shown in Table 1, Moto (motorcycle) has higher accuracies than Bike and MB for proposed method, signifying that motorcycle has a trend of better classifying characteristics than Bike. In our opinion even if a motorcycle becomes more complicated by loading baggage at the rear or by the high loading of baggage than bicycle, our proposed algorithm improved the recognition rate than others. In the experiment of mixture of the two kinds of two-wheelers, the results of suggested method accuracies also have a higher accuracy than the existing algorithm. Because two-wheelers are consist of persons, bicycles, or motorcycles together with various shapes of baggage. Therefore, the upper part of a waist is similar to a pedestrian but the lower part is diversified and complicated in shape, so that it becomes another field of challenge besides detection of pedestrians and automobiles for an intelligent automobile. The two-wheeler in the following means a combination of a person and a machine.

V. CONCLUSION

Accurately and efficiently pedestrian detection in still images is one of the most difficult works due to a variety range of poses, as well as environmental conditions and cluttered backgrounds. Classification based methods have comprised the mainstream of research and have been shown to achieve successful results in object detection via pedestrian detection.

In this study, we have introduced that a new practical implementation of the solution for weak objects (vulnerable road users) on the road using projected local binary pattern.

The underlying motivation of our approach originate from the observation that LBP and projection vector have rotation invariant, reduce the length of the feature, and robust against variation in pose or illumination. We calculated horizontal and vertical vector for each cell from original image. To remove the noise, we adapted the uniform pattern as weighting value.

It has been experimentally demonstrated that PLBP method weighted value by the uniform pattern leads to better classification results than other traditional methods from ROC. Adaboost classification have comprised the main stream of research to detection two-wheelers and the results compared with existing method have been shown that it is a highly improved system performance for the further research, we consider including occluded region, changed objected according to the weather and night environment, and etc.

REFERENCES

[1].

H. Jung, Y. Ehara, J. K. Tan, H. Kim, and S. Ishikawa, “Applying MSC-HOG Feature to the Detection of Human on a Bicycle,” International Conference on Control, pp. 514-517, Oct. 2012.

[2].

H. Cho, P. E. Rybski, and W. Zhang, “Vision-based Bicyclist Detection and Tracking for Intelligent Vehicles,” Intelligent Vehicle Symposium, pp. 454-461, June 2010.

[3].

T. Gandhi and M. M. Trivedi, “Pedestrian Protection Systems: Issues, Survey, and Challenges,” IEEE Transaction on Intelligent Transportation Systems, Vol. 8, No. 3, pp. 413-430, September, 2007.

[4].

L. Yu, F. Zhao, and Z. An, “Locally Assembled Binary Feature with Feed-forward Cascade for Pedestrian Detection in Intelligent Vehicle,” Int. Conf. on Cognitive Informatics, pp. 458-463, July, 2010

[5].

Q. Zhu, M. C. Yeh, K. T. Cheng and S. Avidan, “Fast human detection using a cascade of histograms of oriented gradients,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 1491-1498, June, 2006.

[6].

T. Watanabe, S. Ito, and K. Yokoi, “Co-occurrence Histogram of Oriented Gradients for Detection,” Advances in Image and Video Technology(LNCS), pp. 37-47, 2009.

[7].

X. Y. Wang, T. X. Han, and S. Yan, “An HOG-LBP human detector with partial occlusion handling,” International Conference on Computer Vision, pp.32-39, Sept. 2009.

[8].

Y. H. Lee, S. H. Lee, T. S. Kim, and J. C. Shim, “New approach to two wheelers detection using Cell Comparison,” Journal of Multimedia Information System, Vol. 1, No. 1, pp.45-54, September, 2014.

[9].

O. Tuzel, F. Porikli and P. Meer, “Human detection via classification on Riemannian manifolds. in Proc. IEEE conf. comp. Vis. Pattern Recognition, pp. 1-8, Jun 2007.

[10].

P. F. Felzenzwalb, R. B. Girshick, D. Mcallester and D. Ramanan, “Object Detection with Discriminatively Trained Part-Based Models,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 32 no.9, pp. 1627-1645, Sep. 2010.

[11].

L. Yu, F. Zhao and Z. An, “Locally Assembled Binary Feature with Feed-forward Cascade for pedestrian Detection in Intelligent Vehicles,” in Proc. 9th Int. Conf. on Cognitive Informatics, pp. 458-463, Jul. 2010.

[12].

G. Zhenhua, Z. Lei, and D. Zhang, “A Completed Modeling of Local Binary Pattern Operator for Texture Classification,” IEEE Trans. Image Processing, Vol. 19, No. 6, pp. 1657-1663, Jun. 2010.

[13].

T. Ahonen, A. Hadid, and M. Pietikainen, “Face description with local binary patterns: Application to face recognition,” IEEE Trans. Pattern Anal., Mach. Intell., Vol 28, No. 12, pp.2037-2041, Dec. 2006.

[14].

K. Song, Y. Yan, W. Chen, and X. Zhang, “ Research and Perspective on Local Binary Pattern,” Acta Automatica Sinica, Vol. 39, No. 6, pp.730-744, Jun. 2013.

[15].

H. Cho, P. E. Rybski and W. Zhang, “Vision-based bicyclist detection and tracking for intelligent vehicles,” 2010 IEEE Intelligent Vehicles Symposium, pp. 454-461 , San Diego, CA., Jun. 2010.

[16].

R. Jain, Machine Vision, McGraw-Hill, 1995.

[17].

T. Ojala, M. Pietikäinen, and D. Harwood, “A comparative study of texture measures with classification based on featured distributions,” Pattern Recognition, Vol. 29, No. 1, pp.51-59, Jan. 1996.

[18].

M. Pietikäinen, A. Hadid, G. Zhao, and T. Ahonen, “ Local Binary Patterns for Still Images,” Computer Vision Using Local Binary Patterns, Vol. 40, pp.13-47, 2011.

[19].

A. Satpathy, X. Jiang, and H. Eng, “Human detection using Discriminative and Robust Local Binary Pattern,” Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 2376-2380, May 2013.

[20].

N. Dalal and B. Triggs, “Histogram of Oriented Gradients for Human Detection,” IEEE Computer. Vision Pattern Recognition, pp.886-893, Jun. 2005.

[21].

P. Viloa, M. Jones and M. Snow, “Detecting pedestrians using patterns of motion and appearance. The 9th ICCV, pp. 153-161, Oct. 2003.