Research
My research is on machine learning and computer vision for video processing. Here are my recent projects and papers:
|
|
The Box Size Confidence Bias Harms Your Object Detector
J. Gilg,
T. Teepe,
F. Herzog,
and G. Rigoll
arXiv, 2022
arXiv /
code
We demonstrate how to modify the histogram binning calibration to improve performance through conditional confidence calibration.
|
|
Face Aggregation Network For Video Face Recognition
S. Hörmann,
Z. Chao,
M. Knoche,
F. Herzog,
and G. Rigoll
IEEE International Conference on Image Processing (ICIP), 2021
IEEE Article
***Accepted for ICIP 2021***
We suggest a permutation invariant U-Net architecture capable of processing an arbitrary number of frames in video face recognition.
|
|
GaitGraph: Graph Convolutional Network for Skeleton-Based Gait Recognition
T. Teepe,
A. Khan,
J. Gilg,
F. Herzog,
S. Hörmann,
and G. Rigoll
IEEE International Conference on Image Processing (ICIP), 2021
arXiv /
code
***Accepted for ICIP 2021***
A novel Graph Convolutional Network (GCN) to obtain a modern model-based approach for gait recognition.
|
|
Lightweight Multi-Branch Network for Person Re-Identification
F. Herzog,
X. Ji,
T. Teepe,
S. Hörmann,
J. Gilg, and
G. Rigoll
IEEE International Conference on Image Processing (ICIP), 2021
arXiv /
code
***Accepted for ICIP 2021***
Lightweight network that combines global, part-based, and channel features in a unified multi-branch architecture.
|
|
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing
O. Köpüklü,
S. Hörmann,
F. Herzog,
H. Cevikalp, and
G. Rigoll
arXiv, 2020
arXiv
/
code
***Accepted for Computer Vision and Image Understanding Journal***
We propose dissected 3D CNNs to address several serious handicaps of 3D ResNet models during online operation.
|
|
Comparative Analysis of CNN-Based Spatiotemporal Reasoning in Videos
O. Köpüklü,
F. Herzog, and
G. Rigoll
ICPR International Workshops and Challenges. ICPR 2021.
arXiv
/
code
/
video
/
publication
***Accepted for ICPR International Workshops 2021***
A comparative analysis of different spatiotemporal modeling techniques for action and gesture recognition.
|
|