Publications

2024

S Marques, P Kouba, A Legrand, J Sedlar, L Disson, J Planas-Iglesias, Z Sanusi, A Kunka, J Damborský, T Pajdla, Z Prokop, S Mazurenko, J Sivic, D Bednar
CoVAMPnet: Comparative Markov State Analysis for Studying Effects of Drug Candidates on Disordered Biomolecules
JACS Au (2024)
pdf | project page

T Soucek, JB Alayrac, A Miech, I Laptev, J Sivic
Multi-Task Learning of Object States and State-Modifying Actions from Web Videos
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024)
pdf | project page | code

A Bushuiev, R Bushuiev, P Kouba, A Filkin, M Gabrielova, M Gabriel, J Sedlar, T Pluskal, J Damborsky, S Mazurenko, J Sivic
Learning to design protein-protein interactions with enhanced generalization
International Conference on Learning Representations (ICLR) (2024)
pdf | code | supp | demo | report

T Soucek, D Damen, M Wray, I Laptev, J Sivic
GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024)
pdf | project page | code

2023

P Kouba, P Kohout, F Haddadi, A Bushuiev, R Samusevich, J Sedlar, J Damborsky, T Pluskal, J Sivic, S Mazurenko
Machine Learning-Guided Protein Engineering
ACS Catalysis (2023)
bibtex | pdf | supp

J Sedlar, K Stepanova, R Skoviera, J K Behrens, M Tuna, G Sejnova, J Sivic, R Babuska
Imitrob: Imitation Learning Dataset for Training and Evaluating 6D Object Pose Estimators
IEEE Robotics and Automation Letters (RA-L), also presented at IROS (2023)
bibtex | pdf | project page | code | supp | video

D McKee, J Salamon, J Sivic, B Russell
Language-Guided Music Recommendation for Video via Prompt Analogies
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
CVPR Highlight paper
bib-short | bib-full | pdf | project page | code | video

A Yang, A Nagrani, I Laptev, J Sivic, C Schmid
VidChapters-7M: Video Chapters at Scale
Conference on Neural Information Processing Systems (NeurIPS) (2023)
bib-short | bib-full | pdf | project page | code

A Vobecky, O Siméoni, D Hurych, S Gidaris, A Bursuc, P Pérez, J Sivic
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Conference on Neural Information Processing Systems (NeurIPS) (2023)
pdf | project page | code | supp

A Yang, A Nagrani, P H Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, C Schmid
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
bibtex | pdf | code | supp

K Zorina, D Kovar, F Lamiraux, N Mansard, J Carpentier, J Sivic, V Petrik
Multi-Contact Task and Motion Planning Guided by Video Demonstration
IEEE International Conference on Robotics and Automation (ICRA) (2023)
pdf

L Montaut, Q Le Lidec, A Bambade, V Petrik, J Sivic, J Carpentier
Differentiable Collision Detection: a Randomized Smoothing Approach
IEEE International Conference on Robotics and Automation (ICRA) (2023)
pdf

2022

Y Labbé, L Manuelli, A Mousavian, S Tyree, S Birchfield, J Tremblay, J Carpentier, M Aubry, D Fox, J Sivic

MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

Conference on Robot Learning (CoRL) (2022)

pdf | project page

A Yang, A Miech, J Sivic, I Laptev, C Schmid

Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Advances in Neural Information Processing Systems (NeurIPS) (2022)

pdf | project page | code

A Vobecky, D Hurych, O Siméoni, S Gydaris, A Bursuc, P Pérez, J Sivic

Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation

European Conference on Computer Vision (ECCV) (2022)

Oral

pdf | project page | code

A Yang, A Miech, J Sivic, I Laptev, C Schmid

Learning to Answer Visual Questions from Web Videos

IEEE Transactions on Pattern Analysis and Machine Intelligence (2022)

Special issue of TPAMI with a selection of best papers from ICCV 2021

pdf

V Petrik, M N Qureshi, J Sivic, M Tapaswi

Learning Object Manipulation Skills from Video via Approximate Differentiable Physics

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2022)

pdf | project page | code

H Cisneros, T Mikolov, J Sivic

Benchmarking Learning Efficiency in Deep Reservoir Computing

Conference on Lifelong Learning Agents (CoLLA) (2022)

pdf | code

T Soucek, J-B Alayrac, A Miech, I Laptev, J Sivic

Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022)

pdf | project page | code

G Ponimatkin, Y Labbe, B Russell, M Aubry, Josef Sivic

Focal Length and Object Pose Estimation via Render and Compare

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022)

pdf | project page | code

A Yang, A Miech, J Sivic, I Laptev, C Schmid

TubeDETR: Spatio-Temporal Video Grounding with Transformers

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022)

Oral

pdf | project page | code

L Montaut, Q Le Lidec, V Petrik, J Sivic, J Carpentier

Collision Detection Accelerated: An Optimization Perspective

Robotics: Science and Systems (RSS) (2022)

pdf

2021

Z Li, J Sedlar, J Carpentier, I Laptev, N Mansard, J Sivic

Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos

International Journal of Computer Vision (IJCV) (2021)

pdf | project page | supp

K Zorina, J Carpentier, J Sivic, V Petrík

Learning to Manipulate Tools by Aligning Simulation to Video Demonstration

IEEE Robotics and Automation Letters (RA-L) (2021)

pdf | Project page | Code | Video | Best poster prize at 2nd International Workshop on AI for Robotics

J Bielcikova, R Kunnawalkam Elayavalli, G Ponimatkin, J H Putschke, J Sivic

Identifying heavy-flavor jets using vectors of locally aggregated descriptors

Journal of Instrumentation (2021)

pdf | code

S Li, Y Du, A Torralba, J Sivic, B Russell

Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions

IEEE International Conference on Computer Vision (ICCV) (2021)

project page | code | supp

A Yang, A Miech, J Sivic, I Laptev, C Schmid

Just Ask: Learning to Answer Questions from Millions of Narrated Videos

IEEE International Conference on Computer Vision (ICCV) (2021)

pdf | project page | code | demo

A Miech, J-B Alayrac, Laptev, J Sivic, A Zisserman

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021)

bib-short | bib-full | pdf

Y Labbé, J Carpentier, M Aubry, J Sivic

Single-view robot pose and joint angle estimation via render & compare

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021)

bib-short | bib-full | pdf | project page | code

A Vobecký, D Hurych, M Uřičář, P Pérez, J Sivic

Artificial Dummies for Urban Dataset Augmentation

35th AAAI Conference on Artificial Intelligence (2021)

bib-short | bib-full | pdf | project page | code

J Macke, J Sedlar, M Olsak, J Urban, J Sivic

Learning to solve geometric construction problems from images

Conference on Intelligent Computer Mathematics (CICM) (2021)

2020

C Toft, W Maddern, A Torii, L Hammarstrand, E Stenborg, D Safari, M Okutomi, M Pollefeys, J Sivic, T Pajdla, F Kahl, T Sattler

Long-Term Visual Localization Revisited

IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)

pdf

I Rocco, M Cimpoi, R Arandjelovic, A Torii, T Pajdla, J Sivic

NCNet: Neighbourhood Consensus Networks for Estimating Image Correspondences

IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)

pdf

Y Labbe, S Zagoruyko, I Kalevatykh, I Laptev, J Carpentier, M Aubry, J Sivic

Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning

IEEE Robotics and Automation Letters (2020)

pdf | project page

H Cisneros, J Sivic, T Mikolov

Visualizing computation in large-scale cellular automata

Artificial Life Conference Proceedings (2020)

pdf | project page | video

V Petrik, M Tapaswi, I Laptev, J Sivic

Learning Object Manipulation Skills via Approximate State Estimation from Real Videos

Conference on Robot Learning (CoRL) (2020)

pdf | project page | code | supp | video

R Strudel, A Pashevich, I Kalevatykh, I Laptev, J Sivic, C Schmid

Learning to combine primitive skills: A step towards versatile robotic manipulation

ICRA 2020

pdf | project page | code

D Zhukov, J-B Alayrac, I Laptev, J Sivic

Learning actionness via long-range temporal order verification

European Conference on Computer Vision (ECCV) (2020)

pdf | project page | supp

Y Labbe, J Carpentier, M Aubry, J Sivic

CosyPose: Consistent multi-view multi-object 6D pose estimation

European Conference on Computer Vision (ECCV) (2020)

I Rocco, R Arandjelovic, J Sivic

Efficient Neighbourhood Consensus Networks via Submanifold Sparse Convolutions

European Conference on Computer Vision (ECCV) (2020)

pdf | project page | code | supp

A Miech, J-B Alayrac, L Smaira, I Laptev, J Sivic, A Zisserman

End-to-End Learning of Visual Representations from Uncurated Instructional Videos

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)

pdf | project page | code

2019

H Cisneros, J Sivic, T Mikolov

Evolving Structures in Complex Systems

IEEE Symposium Series on Computational Intelligence (2019)

Best student paper

pdf

Z Li, J Sedlar, J Carpentier, I Laptev, N Mansard, J Sivic

Estimating 3D Motion and Forces of Person-Object Interactions From Monocular Video

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019)

Best paper finalist

pdf | project page

T Dalens, M Aubry, J Sivic

Bilinear Image Translation for Temporal Analysis of Photo Collections

IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)

pdf | project page

A Torii, H Taira, J Sivic, M Pollefeys, M Okutomi, T Pajdla, T Sattler

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)

pdf | project page

I Rocco, R Arandjelovic, J Sivic

Convolutional Neural Network Architecture for Geometric Matching

IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)

pdf | project page

J Peyre, I Laptev, C Schmid, J Sivic

Detecting Unseen Visual Relations Using Analogies

IEEE International Conference on Computer Vision (ICCV) (2019)

pdf | code

A Miech, I Laptev, J Sivic, H Wang, L Torresani, D Tran

Leveraging the Present to Anticipate the Future in Videos

IEEE Conference on Computer Vision and Pattern Recognition Workshops (2019)

pdf

A Miech, D Zhukov, J-B Alayrac, M Tapaswi, I Laptev, J Sivic

Howto100M: Learning a Text-video Embedding by Watching Hundred Million Narrated Video Clips

IEEE International Conference on Computer Vision (ICCV) (2019)

pdf | project page | code | demo

H Taira, I Rocco, J Sedlar, M Okutomi, J Sivic, T Pajdla, T Sattler, A Torii

Is This the Right Place? Geometric-Semantic Pose Verification for Indoor Visual Localization

IEEE International Conference on Computer Vision (ICCV) (2019)

pdf | project page

M Dusmanu, I Rocco, T Pajdla, M Pollefeys, J Sivic, A Torii, T Sattler

D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019)

pdf | project page | code | supp

D Zhukov, J-B Alayrac, R G Cinbis, D Fouhey, I Laptev, J Sivic

Cross-Task Weakly Supervised Learning From Instructional Videos

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019)

pdf | code

2018

R Arandjelovic, P Gronat, A Torii, T Pajdla, J Sivic

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence (2018)

pdf | project page | video

A Torii, R Arandjelovic, J Sivic, M Okutomi, T Pajdla

24/7 Place Recognition by View Synthesis

IEEE Transactions on Pattern Analysis and Machine Intelligence (2018)

pdf | project page

I Rocco, R Arandjelovic, J Sivic

End-to-end Weakly-supervised Semantic Alignment

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

pdf | project page | code

T Sattler, W Maddern, C Toft, A Torii, L Hammarstrand, E Stenborg, D Safari, M Okutomi, M Pollefeys, J Sivic, F Kahl, T Pajdla

Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

pdf | project page

Hendricks, L. A., Wang, O., Shechtman, E., Sivic, J., Darrell, T. and Russell, B.,

Localizing moments in video with temporal language

Empirical Methods in Natural Language Processing (EMNLP), 2018

pdf

I Rocco, M Cimpoi, R Arandjelovic, A Torii, T Pajdla, J Sivic

Neighbourhood Consensus Networks

Advances in Neural Information Processing Systems (2018)

pdf | project page | code

H Taira, M Okutomi, T Sattler, M Cimpoi, M Pollefeys, J Sivic, T Pajdla, A Torii

InLoc: Indoor Visual Localization with Dense Matching and View Synthesis

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

pdf | project page | code

2017

Peyre, J., Laptev, I., Schmid, C. and Sivic, J.

Weakly-supervised learning of visual relations

IEEE International Conference on Computer Vision (2017)

Bibtex source | Document: PDF | Project page

Alayrac, J.-B., Laptev, I., Sivic, J. and Lacoste-Julien, S.

Joint Discovery of Object States and Manipulation Actions

IEEE International Conference on Computer Vision (2017)

Bibtex source | Document: PDF | Project page

Miech, A., Alayrac, J.-B., Bojanowski, P., Laptev, I. and Sivic, J.

Learning from Video and Text via Large-Scale Discriminative Clustering

IEEE International Conference on Computer Vision (2017)

Bibtex source | Document: PDF | Project page

Hendricks, L., Wang, O., Shechtman, E., Sivic, J., Darrell, T. and Russell, B.

Localizing Moments in Video with Natural Language

IEEE International Conference on Computer Vision (2017)

Bibtex source | Document: PDF | Project page

Miech, A., Laptev, I. and Sivic, J.

Learnable pooling with Context Gating for video classification

CVPR 2017 Workshop on YouTube-8M Large-Scale Video Understanding (2017)

Bibtex source | Document: PDF | Project page

Rocco, I., Arandjelovic, R. and Sivic, J.

Convolutional Neural Network Architecture for Geometric Matching

IEEE Conference on Computer Vision and Pattern Recognition (2017)

Bibtex source | Document: PDF | Project page

Girdhar, R., Ramanan, D., Gupta, A., Sivic, J. and Russell, B.

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

IEEE Conference on Computer Vision and Pattern Recognition (2017)

Bibtex source | Document: PDF | Project page

Sattler, T., Torii, A., Sivic, J., Pollefeys, M., Taira, H., Okutomi, M. and Pajdla, T.

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

IEEE Conference on Computer Vision and Pattern Recognition (2017)

Bibtex source | Document: PDF | Project page

Alayrac, J.-B., Bojanowski, P., Agrawal, N., Laptev, I., Sivic, J. and Lacoste-Julien, S.,

Learning from narrated instruction videos

IEEE Transactions on Pattern Analysis and Machine Intelligence (2017)

Bibtex source | Document: PDF | Link to publisher’s webpage

Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T. and Sivic, J.

NetVLAD: CNN architecture for weakly supervised place recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence (2017)

Bibtex source | Document: PDF | Project page | Link to publisher’s webpage

Torii, A., Arandjelovic, R., Sivic, J., Okutomi, M. and Pajdla, T.

24/7 place recognition by view synthesis

IEEE Transactions on Pattern Analysis and Machine Intelligence (2017)

Bibtex source | Document: PDF | Project page | Link to publisher’s webpage

2016

Alayrac, J.-B., Bojanowski, P., Agrawal, N., Laptev, I., Sivic, J. and Lacoste-Julien, S.,

Unsupervised learning from narrated instruction videos

IEEE Conference on Computer Vision and Pattern Recognition (2016)

Bibtex source | Document: PDF | Project page

Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T. and Sivic, J.

NetVLAD: CNN architecture for weakly supervised place recognition

IEEE Conference on Computer Vision and Pattern Recognition (2016)

Bibtex source | Document: PDF | Project page

Gronat, P., Obozinski, G., Sivic, J. and Pajdla, T.

Learning per-location classifiers for visual place recognition ,

International Journal of Computer Vision (2016)

Bibtex source | Document: PDF

Aubry, M., Russell, B. and Sivic, J.

Visual geo-localization of non-photographic depictions via 2D-3D alignment

In Visual Analysis and Geolocalization of Large-Scale Imagery, Springer (2016)

Bibtex source | Document: PDF | Link to publisher’s webpage

2015

Lee, S., Maisonneuve, N., Crandall, D., Efros, A. and Sivic, J.

Linking Past to Present: Discovering Style in Two Centuries of Architecture

International Conference on Computational Photography (2015)

Bibtex source | Document: PDF | Project page

Oquab, M., Bottou, L., Laptev, I. and Sivic, J.

Is object localization for free? – Weakly-supervised learning with convolutional neural networks

IEEE Conference on Computer Vision and Pattern Recognition (2015)

Bibtex source | Document: PDF | Project page

Torii, A., Arandjelovic, R., Sivic, J., Pajdla, T. and Okutomi, M.

24/7 place recognition by view synthesis

IEEE Conference on Computer Vision and Pattern Recognition (2015)

Bibtex source | Document: PDF | Project page

Chari, V., Lacoste-Julien, S., Laptev, I. and Sivic, J.,

On Pairwise Costs for Network Flow Multi-Object Tracking

IEEE Conference on Computer Vision and Pattern Recognition (2015)

Bibtex source | Document: PDF | Project page

Torii, A., Sivic, J., Pajdla, T. and Okutomi, M.

Visual place recognition with repetitive structures

IEEE Transactions on Pattern Analysis and Machine Intelligence (2015)

Bibtex source | Document: PDF | Project page | Code | Link to publisher’s webpage

Seguin, G., Alahari, K., Sivic, J. and Laptev, I.

Pose Estimation and Segmentation of Multiple People in Stereoscopic Movies

IEEE Transactions on Pattern Analysis and Machine Intelligence (2015)

Bibtex source | Document: PDF | Project page | Link to publisher’s webpage

Doersch, C., Singh, S., Gupta, A., Sivic, J., and Efros, A..

What Makes Paris Look like Paris?

Communications of ACM magazine (2015)

Bibtex source | Document: PDF | Link to publisher’s webpage

2014

Aubry, M., Maturana, D., Efros, A., Russell, B. and Sivic, J.

Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models

IEEE Conference on Computer Vision and Pattern Recognition (2014)

Bibtex source | Document: PDF | Project page

Aubry, M., Russell, B. and Sivic, J.

Painting-to-3D Model Alignment Via Discriminative Visual Elements

ACM Transactions on Graphics (2014)

Bibtex source | Document (pre-print): PDF | Project page

Bojanowski, P., Lajugie, R., Bach, F., Laptev, I., Ponce, J., Schmid, C. and Sivic, J.

Weakly Supervised Action Labeling in Videos Under Ordering Constraints

European Conference on Computer Vision (2014)

Bibtex source | Document: PDF | Extended TR on arXiv | Project page

Fouhey, D., Delaitre, V., Gupta, A., Efros, A., Laptev, I. and Sivic, J.

People watching: human actions as a cue for single-view geometry

International Journal of Computer Vision (2014)

Bibtex source | Document (pre-print): PDF | Journal page | Project page

Oquab, M., Bottou, L., Laptev, I. and Sivic, J.

Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks

IEEE Conference on Computer Vision and Pattern Recognition (2014)

Bibtex source | Document: PDF | Project page

Vu, T.-H., Olsson, C., Laptev, I., Oliva, A. and Sivic, J.

Predicting Actions from Static Scenes

European Conference on Computer Vision (2014)

Bibtex source | Document: PDF | Project page

Whyte, O., Sivic, J. and Zisserman, A.

Deblurring Shaken and Partially Saturated Images

International Journal of Computer Vision (2014)

Bibtex source | Document: PDF | Journal page

Whyte, O., Sivic, J., Zisserman, A. and Ponce, J.,

Efficient, Blind, Spatially-Variant Deblurring for Shaken Images

In Motion Deblurring: Algorithms and Systems

Cambridge University Press (2014)

Bibtex source | Link to publisher’s webpage

2013

Alahari, K., Seguin, G., Sivic, J. and Laptev, I.

Pose estimation and segmentation of people in 3D movies

IEEE International Conference on Computer Vision (2013)

Bibtex source | Document: PDF | Project page

Bojanowski, P., Bach, F., Laptev, I., Ponce, J., Schmid, C. and Sivic, J.

Finding actors and actions in movies

IEEE International Conference on Computer Vision (2013)

Bibtex source | Document: PDF | Project page

Torii, A., Sivic, J., Pajdla, T. and Okutomi, M.

Visual place recognition with repetitive structures

IEEE Conference on Computer Vision and Pattern Recognition (2013)

Bibtex source | Document: PDF | Project page | Code

Gronat, P., Obozinski, G., Sivic, J. and Pajdla, T.

Learning per-location classifiers for visual place recognition ,

IEEE Conference on Computer Vision and Pattern Recognition (2013)

Bibtex source | Document: PDF

2012

Delaitre, V., Fouhey, D., Laptev, I., Sivic, J., Gupta, A. and Efros, A.

Scene semantics from long-term observation of people

European Conference on Computer Vision (2012)

Bibtex source | Document: PDF | Project page

Doersch, C., Singh, S., Gupta, A., Sivic, J. and Efros, A.,

What makes Paris look like Paris?

ACM Transactions on Graphics (SIGGRAPH 2012)

Bibtex source | PDF (71MB) PDF (8MB) | Project page | Video | Siggraph talk slides (pptx 238MB)

Fouhey, D., Delaitre, V., Gupta, A., Efros, A., Laptev, I. and Sivic, J.

People watching: human actions as a cue for single-view geometry

European Conference on Computer Vision (2012)

Bibtex source | Document: PDF | Project page

Rodriguez, M., Sivic, J. and Laptev, I.

Analysis of crowded scenes in video

Intelligent Video Surveillance Systems, Wiley (2012)

Bibtex source | Link to publisher ’s webpage

Whyte, O., Sivic, J., Zisserman, A. and Ponce, J.

Non-uniform deblurring for shaken images

International Journal of Computer Vision (2012)

Bibtex source | Document: PDF | Journal page

2011

Delaitre, V., Sivic, J. and Laptev, I.

Learning person-object interactions for action recognition in still images

Advances in Neural Information Processing Systems (2011)

Bibtex source | Document: PDF

Rodriguez, M., Sivic, J., Laptev, I. and Audibert, J.-Y.

Data-driven crowd analysis in videos

Proceedings of the International Conference on Computer Vision (2011)

Bibtex source | Document: PDF | Project page

Rodriguez, M., Laptev, I., Sivic, J. and Audibert, J.-Y.

Density-aware person detection and tracking in crowds

Proceedings of the International Conference on Computer Vision (2011)

Bibtex source | Document: PDF | Project page

Lezama, J., Alahari, K., Sivic. J. and Laptev, I.

Track to the future: Spatio-temporal video segmentation with long-range motion cues

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2011)

Bibtex source | Document: PDF | Project page

Russell, B., Sivic, J., Ponce, J. and Dessales, H.

Automatic alignment of paintings and photographs depicting a 3D scene

3rd International IEEE Workshop on 3D Representation for Recognition (2011)

Bibtex source | Document: PDF | Project page

Torii, A., Sivic, J. and Pajdla, T.

Visual localization by linear combination of image descriptors

2nd IEEE Workshop on Mobile Vision (2011)

Bibtex source | Document: PDF

Whyte, O., Sivic, J. and Zisserman, A.

Deblurring shaken and partially saturated images

IEEE Workshop on Color and Photometry in Computer Vision (2011)

Bibtex source | Document: PDF | Project page

2010

Philbin, J., Sivic, J. and Zisserman, A.

Geometric latent Dirichlet allocation on a matching graph for large-scale image datasets

International Journal of Computer Vision (2010)

Bibtex source | Document: PDF

Knopp, J., Sivic, J. and Pajdla, T.

Avoiding confusing features in place recognition

Proceedings of the European Conference on Computer Vision (2010)

Bibtex source | Document: PDF | Project page

Philbin, J., Isard, M., Sivic, J. and Zisserman, A.

Descriptor learning for efficient retrieval

Proceedings of the European Conference on Computer Vision (2010)

Bibtex source | Document: PDF

Cherniavsky, N., Laptev, I., Sivic, J. and Zisserman, A.

Semi-supervised learning of facial attributes in video

First International Workshop on Parts and Attributes, European Conference on Computer Vision (2010)

Bibtex source | Document: PDF

Kaneva, B., Sivic, J., Torralba, A., Avidan, S. and Freeman, W. T.

Matching and predicting street level images

Workshop on Vision for Cognitive Tasks, European Conference on Computer Vision (2010)

Bibtex source | Document: PDF

Delaitre, V., Laptev, I. and Sivic, J.

Recognizing human actions in still images: a study of bag-of-features and part-based representations

Proceedings of the British Machine Vision Conference (2010)

Bibtex source | Document: PDF

Whyte, O., Sivic, J., Zisserman, A. and Ponce, J.

Non-uniform deblurring for shaken images

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2010)

Bibtex source | Document: PDF | Project page

Kaneva, B., Sivic, J., Torralba, A., Avidan, S. and Freeman, W.

Inﬁnite Images: Creating and Exploring a Large Photorealistic Virtual Space

Proceedings of the IEEE (2010)

Bibtex source | Document: PDF

2009

Sivic, J. and Zisserman, A.

Efficient Visual Search Cast as Text Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence (2009)

Bibtex source | Document: PDF

Sivic, J., Everingham, M. and Zisserman, A.

"Who are you?'': Learning person specific classifiers from video

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2009)

Bibtex source | Document: PDF | Video: AVI

Duchenne, O., Laptev, I., Sivic, J., Bach, F. and Ponce, J.

Automatic annotation of human actions in video

Proceedings of the International Conference on Computer Vision (2009)

Bibtex source | Document: PDF | Video: AVI

Whyte, O., Sivic, J. and Zisserman, A.

Get out of my picture! Internet-based inpainting

Proceedings of the British Machine Vision Conference (2009)

Bibtex source | Document: PDF | Project page

Russell, B., Efros, A., Sivic, J., Freeman, W. and Zisserman, A.

Segmenting Scenes by Matching Image Composites

Advances in Neural Information Processing Systems (2009)

Bibtex source | Document: PDF | Project page

2008

Everingham, M., Sivic, J. and Zisserman, A.

Taking the Bite out of Automated Naming of Characters in TV Video

Image and Vision Computing (2008)

Bibtex source | Abstract | Document: PDF

Liu, C., Yuen, J., Torralba, A., Sivic, J. and Freeman, W.T.

SIFT Flow: Dense Correspondence across Different Scenes

Proceedings of the 10th European Conference on Computer Vision, Marseille, France (2008)

Bibtex source | Abstract | Document: PDF | Project page

Sivic, J., Kaneva, B., Torralba, A., Avidan, S. and Freeman, W.T.

Creating and Exploring a Large Photorealistic Virtual Space

Proceedings of the First IEEE Workshop on Internet Vision (2008)

Bibtex source | Abstract | Document: PDF

Philbin, J. , Chum, O. , Isard, M. , Sivic, J. and Zisserman, A.

Lost in Quantization: Improving Particular Object Retrieval in Large Scale Image Databases

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2008)

Bibtex source | Abstract | Document: ps.gz PDF

Philbin, J. , Sivic, J. and Zisserman, A.

Geometric LDA: A Generative Model for Particular Object Discovery

Proceedings of the British Machine Vision Conference (2008)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. , Russell, B. C. , Zisserman, A. , Freeman, W. T. and Efros, A. A.

Unsupervised Discovery of Visual Object Class Hierarchies

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2008)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. and Zisserman, A.

Efficient Visual Search for Objects in Videos

Proceedings of the IEEE (2008)

Bibtex source | Abstract | Document: PDF

2007

Chum, O. , Philbin, J. , Sivic, J. , Isard, M. and Zisserman, A.

Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval

Proceedings of the 11th International Conference on Computer Vision, Rio de Janeiro, Brazil (2007)

Bibtex source | Abstract | Document: ps.gz PDF

Philbin, J. , Chum, O. , Isard, M. , Sivic, J. and Zisserman, A.

Object retrieval with large vocabularies and fast spatial matching

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2007)

Bibtex source | Abstract | Document: ps.gz PDF

2006

Everingham, M. , Sivic, J. and Zisserman, A.

Hello! My name is... Buffy -- Automatic Naming of Characters in TV Video

Proceedings of the British Machine Vision Conference (2006)

Bibtex source | Abstract | Document: ps.gz PDF

Philbin, J. , Bosch, A. , Chum, O. , Geusebroek, J. , Sivic, J. and Zisserman, A.

Oxford TRECVID 2006 - Notebook Paper

Proceedings of the TRECVID 2006 Workshop (2006)

Bibtex source | Abstract | Document: ps.gz PDF

Russell, B. C. , Efros, A. A. , Sivic, J. , Freeman, W. T. and Zisserman, A.

Using Multiple Segmentations to Discover Objects and their Extent in Image Collections

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2006)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. , Schaffalitzky, F. and Zisserman, A.

Object Level Grouping for Video Shots

International Journal of Computer Vision (2006)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. , Zitnick, C. L. and Szeliski, R.

Finding people in repeated shots of the same scene

Proceedings of the British Machine Vision Conference (2006)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J.

Efficient visual search of images and videos

PhD thesis (2006)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. and Zisserman, A.

Video Google: Efficient Visual Search of Videos

Toward Category-Level Object Recognition (2006)

Bibtex source | Abstract | Document: ps.gz PDF

2005

Sivic, J. , Everingham, M. and Zisserman, A.

Person spotting: video shot retrieval for face sets

International Conference on Image and Video Retrieval (CIVR 2005), Singapore (2005)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. , Russell, B. C. , Efros, A. A. , Zisserman, A. and Freeman, W. T.

Discovering objects and their location in images

Proceedings of the International Conference on Computer Vision (2005)

Bibtex source | Abstract | Document: ps.gz PDF

2004

Sivic, J. , Schaffalitzky, F. and Zisserman, A.

Object Level Grouping for Video Shots

Proceedings of the 8th European Conference on Computer Vision, Prague, Czech Republic (2004)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. and Zisserman, A.

Video Data Mining Using Configurations of Viewpoint Invariant Regions

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC (2004)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. , Schaffalitzky, F. and Zisserman, A.

Efficient Object Retrieval from Videos

Proceedings of the 12th European Signal Processing Conference (EUSIPCO '04), Vienna, Austria (2004)

Bibtex source | Abstract | Document: ps.gz PDF

Sivic, J. and Zisserman, A.

Efficient Visual Content Retrieval and Mining in Videos

Pacific-Rim Conference on Multimedia, (PCM 2004), Tokyo, Japan (2004)

Bibtex source | Abstract | Document: ps.gz PDF

2003

Sivic, J. and Zisserman, A.

Video Google: A Text Retrieval Approach to Object Matching in Videos

Proceedings of the International Conference on Computer Vision (2003)

Bibtex source | Abstract | Document: ps.gz PDF