读 \

Person Re-identification by Local Maximum Occurrence Representation and Metric Learning[^1]

Local Maximal Occurrence Feature. XQDA(CVPR 2015)

预处理

multiscale Retinex algorithm

创新点

引入SILTP

SILTP并不是作者创新,只是将之引入到LOMO的特征一部分.

SITLP是LBP/LTP的一种改进,如下图,SILTP对光照和噪音都不敏感.

其中⊕表示将$S_\tau(I_c,I_k)$得到的二进制数值串联到一起形成一个二进制字符串.

LBP 有灰度不变的性质，但对噪声不鲁棒

SILTP 通过引入尺度不变局部比较容差，提高 LBP，实现强度尺度变化的不变性和图像噪声的鲁棒性。

LOMO

采用$10\times10$的子窗口,在$128\times48$的图像中,使用步长为5的像素,对每个子窗口提取2种SILTP直方图,加上一个$8\times8\times8$的HSV直方图.

XQDA

作者提出了XQDA的一种新的度量学习的方式,原理主要是利用交叉二次判别降低子空间的维度.

通过计算广义瑞利商，通过广义特征值分解获得封闭式的解。这里得到闭式解的方法和LDA里面的方法一样.周志华那本西瓜书也有详细的推导.

其主要是基于KISSME[^2], Bayesian face recognition[^3] 改进而来的方法.

KISSME度量在我之前的文章中有详细的提到.

XQDA利用了相似的原理,它定义了两个概率密度函数分别表示同一类的样本,和不同一类的样本.

$$P(\Delta|\Omega_I)={1 \over (2\pi)^{d/2}|\Sigma_I|^{1/2}}e^{-{1\over 2}\Delta^T \Sigma_I^{-1}\Delta}$$

$$P(\Delta|\Omega_E)={1 \over (2\pi)^{d/2}|\Sigma_E|^{1/2}}e^{-{1\over 2}\Delta^T \Sigma_E^{-1}\Delta}$$

然后相除,化简得到

$$f(\Delta)=\Delta^T(\Sigma_I^{-1}-\Sigma_E^{-1})\Delta$$

Other

作者在其中一个实验结果中有点问题??得不到论文中那么好的结果??是我们做的工作有问题?

[^1]: Liao S, Hu Y, Zhu X, et al. Person re-identification by local maximal occurrence representation and metric learning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015: 2197-2206.

[^2]: Köstinger M, Hirzer M, Wohlhart P, et al. Large scale metric learning from equivalence constraints[C]// IEEE Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, 2012:2288-2295.

[^3]: Moghaddam B, Jebara T, Pentland A. Bayesian face recognition[J]. Pattern Recognition, 2000, 33(11): 1771-1782.