iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.dagstuhl.de/pid/34/6488.xml

Du Tran Tarun Kalluri Weiyao Wang 0001 Heng Wang Manmohan Chandraker Lorenzo Torresani Du Tran Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision. 2693-2703 2024 CVPR Workshops https://doi.org/10.1109/CVPRW63382.2024.00275 conf/cvpr/2024w db/conf/cvpr/cvprw2024.html#Kalluri0WCTT22

Tarun Kalluri Deepak Pathak Manmohan Chandraker Du Tran FLAVR: flow-free architecture for fast video frame interpolation. 83 2023 September 34 Mach. Vis. Appl. 5 https://doi.org/10.1007/s00138-023-01433-y db/journals/mva/mva34.html#KalluriPCT23

Xitong Yang Fu-Jen Chu Matt Feiszli Raghav Goyal Lorenzo Torresani Du Tran Relational Space-Time Query in Long-Form Videos. 6398-6408 2023 CVPR https://doi.org/10.1109/CVPR52729.2023.00619 conf/cvpr/2023 db/conf/cvpr/cvpr2023.html#YangCFGTT23 Tarun Kalluri Deepak Pathak Manmohan Chandraker Du Tran FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation. 2070-2081 2023 WACV https://doi.org/10.1109/WACV56688.2023.00211 conf/wacv/2023 db/conf/wacv/wacv2023.html#KalluriPCT23

Raghav Goyal Effrosyni Mavroudi Xitong Yang Sainbayar Sukhbaatar Leonid Sigal Matt Feiszli Lorenzo Torresani Du Tran MINOTAUR: Multi-task Video Grounding From Multimodal Queries. 2023 abs/2302.08063 CoRR https://doi.org/10.48550/arXiv.2302.08063 db/journals/corr/corr2302.html#abs-2302-08063

Tarun Kalluri Weiyao Wang 0001 Heng Wang Manmohan Chandraker Lorenzo Torresani Du Tran Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision. 2023 abs/2303.05503 CoRR https://doi.org/10.48550/arXiv.2303.05503 db/journals/corr/corr2303.html#abs-2303-05503

Du Tran Jitendra Malik Learning Space-Time Semantic Correspondences. 2023 abs/2306.10208 CoRR https://doi.org/10.48550/arXiv.2306.10208 db/journals/corr/corr2306.html#abs-2306-10208

Weiyao Wang 0001 Matt Feiszli Heng Wang Jitendra Malik Du Tran Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity. 4412-4422 2022 CVPR https://doi.org/10.1109/CVPR52688.2022.00438 conf/cvpr/2022 db/conf/cvpr/cvpr2022.html#0001FWMT22 Jue Wang 0001 Gedas Bertasius Du Tran Lorenzo Torresani Long-Short Temporal Contrastive Learning of Video Transformers. 13990-14000 2022 CVPR https://doi.org/10.1109/CVPR52688.2022.01362 conf/cvpr/2022 db/conf/cvpr/cvpr2022.html#WangBTT22

Weiyao Wang 0001 Matt Feiszli Heng Wang Jitendra Malik Du Tran Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity. 2022 abs/2204.06107 CoRR https://doi.org/10.48550/arXiv.2204.06107 db/journals/corr/corr2204.html#abs-2204-06107

Weiyao Wang 0001 Matt Feiszli Heng Wang Du Tran Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation. 10756-10765 2021 ICCV https://doi.org/10.1109/ICCV48922.2021.01060 conf/iccv/2021 db/conf/iccv/iccv2021.html#0001FWT21

Weiyao Wang 0001 Matt Feiszli Heng Wang Du Tran Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation. 2021 abs/2104.04691 CoRR https://arxiv.org/abs/2104.04691 db/journals/corr/corr2104.html#abs-2104-04691

Jue Wang 0001 Gedas Bertasius Du Tran Lorenzo Torresani Long-Short Temporal Contrastive Learning of Video Transformers. 2021 abs/2106.09212 CoRR https://arxiv.org/abs/2106.09212 db/journals/corr/corr2106.html#abs-2106-09212

Linchao Zhu Du Tran Laura Sevilla-Lara Yi Yang 0001 Matt Feiszli Heng Wang FASTER Recurrent Networks for Efficient Video Classification. 13098-13105 2020 AAAI https://doi.org/10.1609/aaai.v34i07.7012 conf/aaai/2020 db/conf/aaai/aaai2020.html#ZhuTSYFW20 Heng Wang Du Tran Lorenzo Torresani Matt Feiszli Video Modeling With Correlation Networks. 349-358 2020 CVPR https://openaccess.thecvf.com/content_CVPR_2020/html/Wang_Video_Modeling_With_Correlation_Networks_CVPR_2020_paper.html https://doi.org/10.1109/CVPR42600.2020.00043 conf/cvpr/2020 db/conf/cvpr/cvpr2020.html#WangTTF20 Weiyao Wang 0001 Du Tran Matt Feiszli What Makes Training Multi-Modal Classification Networks Hard? 12692-12702 2020 CVPR https://openaccess.thecvf.com/content_CVPR_2020/html/Wang_What_Makes_Training_Multi-Modal_Classification_Networks_Hard_CVPR_2020_paper.html https://doi.org/10.1109/CVPR42600.2020.01271 conf/cvpr/2020 db/conf/cvpr/cvpr2020.html#WangTF20 Humam Alwassel Dhruv Mahajan 0001 Bruno Korbar Lorenzo Torresani Bernard Ghanem Du Tran Self-Supervised Learning by Cross-Modal Audio-Video Clustering. 2020 NeurIPS https://proceedings.neurips.cc/paper/2020/hash/6f2268bd1d3d3ebaabb04d6b5d099425-Abstract.html conf/nips/2020 db/conf/nips/neurips2020.html#Alwassel0KTGT20

Tarun Kalluri Deepak Pathak Manmohan Chandraker Du Tran FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation. 2020 abs/2012.08512 CoRR https://arxiv.org/abs/2012.08512 db/journals/corr/corr2012.html#abs-2012-08512

Antoine Miech Ivan Laptev Josef Sivic Heng Wang Lorenzo Torresani Du Tran Leveraging the Present to Anticipate the Future in Videos. 2915-2922 2019 CVPR Workshops http://openaccess.thecvf.com/content_CVPRW_2019/html/Precognition/Miech_Leveraging_the_Present_to_Anticipate_the_Future_in_Videos_CVPRW_2019_paper.html https://doi.org/10.1109/CVPRW.2019.00351 conf/cvpr/2019w db/conf/cvpr/cvprw2019.html#MiechLSWTT19 Deepti Ghadiyaram Du Tran Dhruv Mahajan 0001 Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition. 12046-12055 2019 CVPR http://openaccess.thecvf.com/content_CVPR_2019/html/Ghadiyaram_Large-Scale_Weakly-Supervised_Pre-Training_for_Video_Action_Recognition_CVPR_2019_paper.html https://doi.org/10.1109/CVPR.2019.01232 conf/cvpr/2019 db/conf/cvpr/cvpr2019.html#GhadiyaramTM19 Rohit Girdhar Du Tran Lorenzo Torresani Deva Ramanan DistInit: Learning Video Representations Without a Single Labeled Video. 852-861 2019 ICCV https://doi.org/10.1109/ICCV.2019.00094 conf/iccv/2019 db/conf/iccv/iccv2019.html#GirdharTTR19 Du Tran Heng Wang Matt Feiszli Lorenzo Torresani Video Classification With Channel-Separated Convolutional Networks. 5551-5560 2019 ICCV https://doi.org/10.1109/ICCV.2019.00565 conf/iccv/2019 db/conf/iccv/iccv2019.html#TranWFT19 Bruno Korbar Du Tran Lorenzo Torresani SCSampler: Sampling Salient Clips From Video for Efficient Action Recognition. 6231-6241 2019 ICCV https://doi.org/10.1109/ICCV.2019.00633 conf/iccv/2019 db/conf/iccv/iccv2019.html#KorbarTT19 Gedas Bertasius Christoph Feichtenhofer Du Tran Jianbo Shi Lorenzo Torresani Learning Temporal Pose Estimation from Sparsely-Labeled Videos. 3021-3032 2019 NeurIPS https://proceedings.neurips.cc/paper/2019/hash/7137debd45ae4d0ab9aa953017286b20-Abstract.html http://papers.nips.cc/paper/8567-learning-temporal-pose-estimation-from-sparsely-labeled-videos conf/nips/2019 db/conf/nips/nips2019.html#BertasiusFTST19

Rohit Girdhar Du Tran Lorenzo Torresani Deva Ramanan DistInit: Learning Video Representations without a Single Labeled Video. 2019 abs/1901.09244 CoRR http://arxiv.org/abs/1901.09244 db/journals/corr/corr1901.html#abs-1901-09244

Du Tran Heng Wang Lorenzo Torresani Matt Feiszli Video Classification with Channel-Separated Convolutional Networks. 2019 abs/1904.02811 CoRR http://arxiv.org/abs/1904.02811 db/journals/corr/corr1904.html#abs-1904-02811

Bruno Korbar Du Tran Lorenzo Torresani SCSampler: Sampling Salient Clips from Video for Efficient Action Recognition. 2019 abs/1904.04289 CoRR http://arxiv.org/abs/1904.04289 db/journals/corr/corr1904.html#abs-1904-04289

Deepti Ghadiyaram Matt Feiszli Du Tran Xueting Yan Heng Wang Dhruv Mahajan 0001 Large-scale weakly-supervised pre-training for video action recognition. 2019 abs/1905.00561 CoRR http://arxiv.org/abs/1905.00561 db/journals/corr/corr1905.html#abs-1905-00561

Weiyao Wang 0001 Du Tran Matt Feiszli What Makes Training Multi-Modal Networks Hard? 2019 abs/1905.12681 CoRR http://arxiv.org/abs/1905.12681 db/journals/corr/corr1905.html#abs-1905-12681

Heng Wang Du Tran Lorenzo Torresani Matt Feiszli Video Modeling with Correlation Networks. 2019 abs/1906.03349 CoRR http://arxiv.org/abs/1906.03349 db/journals/corr/corr1906.html#abs-1906-03349

Yufei Wang Du Tran Lorenzo Torresani UniDual: A Unified Model for Image and Video Understanding. 2019 abs/1906.03857 CoRR http://arxiv.org/abs/1906.03857 db/journals/corr/corr1906.html#abs-1906-03857

Gedas Bertasius Christoph Feichtenhofer Du Tran Jianbo Shi Lorenzo Torresani Learning Temporal Pose Estimation from Sparsely-Labeled Videos. 2019 abs/1906.04016 CoRR http://arxiv.org/abs/1906.04016 db/journals/corr/corr1906.html#abs-1906-04016

Linchao Zhu Laura Sevilla-Lara Du Tran Matt Feiszli Yi Yang 0001 Heng Wang FASTER Recurrent Networks for Video Classification. 2019 abs/1906.04226 CoRR http://arxiv.org/abs/1906.04226 db/journals/corr/corr1906.html#abs-1906-04226

Humam Alwassel Dhruv Mahajan 0001 Lorenzo Torresani Bernard Ghanem Du Tran Self-Supervised Learning by Cross-Modal Audio-Video Clustering. 2019 abs/1911.12667 CoRR http://arxiv.org/abs/1911.12667 db/journals/corr/corr1911.html#abs-1911-12667

Rohit Girdhar Georgia Gkioxari Lorenzo Torresani Manohar Paluri Du Tran Detect-and-Track: Efficient Pose Estimation in Videos. 350-359 2018 CVPR http://openaccess.thecvf.com/content_cvpr_2018/html/Girdhar_Detect-and-Track_Efficient_Pose_CVPR_2018_paper.html https://doi.org/10.1109/CVPR.2018.00044 https://doi.ieeecomputersociety.org/10.1109/CVPR.2018.00044 conf/cvpr/2018 db/conf/cvpr/cvpr2018.html#GirdharGTPT18 Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun Manohar Paluri A Closer Look at Spatiotemporal Convolutions for Action Recognition. 6450-6459 2018 CVPR http://openaccess.thecvf.com/content_cvpr_2018/html/Tran_A_Closer_Look_CVPR_2018_paper.html https://doi.org/10.1109/CVPR.2018.00675 https://doi.ieeecomputersociety.org/10.1109/CVPR.2018.00675 conf/cvpr/2018 db/conf/cvpr/cvpr2018.html#TranWTRLP18 Jamie Ray Heng Wang Du Tran Yufei Wang Matt Feiszli Lorenzo Torresani Manohar Paluri Scenes-Objects-Actions: A Multi-task, Multi-label Video Dataset. 660-676 2018 ECCV (14) https://doi.org/10.1007/978-3-030-01264-9_39 conf/eccv/2018-14 db/conf/eccv/eccv2018-14.html#RayWTWFTP18 Bruno Korbar Du Tran Lorenzo Torresani Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization. 7774-7785 2018 NeurIPS https://proceedings.neurips.cc/paper/2018/hash/c4616f5a24a66668f11ca4fa80525dc4-Abstract.html http://papers.nips.cc/paper/8002-cooperative-learning-of-audio-and-video-models-from-self-supervised-synchronization conf/nips/2018 db/conf/nips/nips2018.html#KorbarTT18

Bruno Korbar Du Tran Lorenzo Torresani Co-Training of Audio and Video Representations from Self-Supervised Temporal Synchronization. 2018 abs/1807.00230 CoRR http://arxiv.org/abs/1807.00230 db/journals/corr/corr1807.html#abs-1807-00230

Gedas Bertasius Christoph Feichtenhofer Du Tran Jianbo Shi Lorenzo Torresani Learning Discriminative Motion Features Through Detection. 2018 abs/1812.04172 CoRR http://arxiv.org/abs/1812.04172 db/journals/corr/corr1812.html#abs-1812-04172

Shruti Agarwal Du Tran Lorenzo Torresani Hany Farid Deciphering Severely Degraded License Plates. 138-143 2017 Media Watermarking, Security, and Forensics https://doi.org/10.2352/ISSN.2470-1173.2017.7.MWSF-337 conf/mediaforensics/2017 db/conf/mediaforensics/mediaforensics2017.html#AgarwalTTF17

Joost R. van Amersfoort Anitha Kannan Marc'Aurelio Ranzato Arthur Szlam Du Tran Soumith Chintala Transformation-Based Models of Video Sequences. 2017 abs/1701.08435 CoRR http://arxiv.org/abs/1701.08435 db/journals/corr/corr1701.html#AmersfoortKRSTC17

Du Tran Jamie Ray Zheng Shou 0001 Shih-Fu Chang Manohar Paluri ConvNet Architecture Search for Spatiotemporal Feature Learning. 2017 abs/1708.05038 CoRR http://arxiv.org/abs/1708.05038 db/journals/corr/corr1708.html#abs-1708-05038

Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun Manohar Paluri A Closer Look at Spatiotemporal Convolutions for Action Recognition. 2017 abs/1711.11248 CoRR http://arxiv.org/abs/1711.11248 db/journals/corr/corr1711.html#abs-1711-11248

Rohit Girdhar Georgia Gkioxari Lorenzo Torresani Manohar Paluri Du Tran Detect-and-Track: Efficient Pose Estimation in Videos. 2017 abs/1712.09184 CoRR http://arxiv.org/abs/1712.09184 db/journals/corr/corr1712.html#abs-1712-09184

Du Tran Representations and Models for Large-Scale Video Understanding. Dartmouth College, USA 2016 https://digitalcommons.dartmouth.edu/dissertations/53

Du Tran Lorenzo Torresani EXMOVES: Mid-level Features for Efficient Action Recognition and Video Analysis. 239-253 2016 119 Int. J. Comput. Vis. 3 https://doi.org/10.1007/s11263-016-0905-6 db/journals/ijcv/ijcv119.html#TranT16

Du Tran Lubomir D. Bourdev Rob Fergus Lorenzo Torresani Manohar Paluri Deep End2End Voxel2Voxel Prediction. 402-409 2016 CVPR Workshops https://doi.org/10.1109/CVPRW.2016.57 https://doi.ieeecomputersociety.org/10.1109/CVPRW.2016.57 conf/cvpr/2016w db/conf/cvpr/cvprw2016.html#TranBFTP16

Du Tran Manohar Paluri Lorenzo Torresani ViCom: Benchmark and Methods for Video Comprehension. 2016 abs/1606.07373 CoRR http://arxiv.org/abs/1606.07373 db/journals/corr/corr1606.html#TranPT16

Du Tran Lubomir D. Bourdev Rob Fergus Lorenzo Torresani Manohar Paluri Learning Spatiotemporal Features with 3D Convolutional Networks. 4489-4497 2015 ICCV https://doi.org/10.1109/ICCV.2015.510 https://doi.ieeecomputersociety.org/10.1109/ICCV.2015.510 conf/iccv/2015 db/conf/iccv/iccv2015.html#TranBFTP15

Du Tran Lubomir D. Bourdev Rob Fergus Lorenzo Torresani Manohar Paluri Deep End2End Voxel2Voxel Prediction. 2015 abs/1511.06681 CoRR http://arxiv.org/abs/1511.06681 db/journals/corr/corr1511.html#TranBFTP15

Du Tran Junsong Yuan 0001 David A. Forsyth Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search. 404-416 2014 36 IEEE Trans. Pattern Anal. Mach. Intell. 2 https://doi.org/10.1109/TPAMI.2013.137 http://doi.ieeecomputersociety.org/10.1109/TPAMI.2013.137 https://www.wikidata.org/entity/Q46225083 db/journals/pami/pami36.html#TranYF14

Du Tran Lorenzo Torresani EXMOVES: Classifier-based Features for Scalable Action Recognition. 2014 conf/iclr/2014 ICLR (Poster) http://arxiv.org/abs/1312.5785 db/conf/iclr/iclr2014.html#TranT13

Du Tran Lubomir D. Bourdev Rob Fergus Lorenzo Torresani Manohar Paluri C3D: Generic Features for Video Analysis. 2014 abs/1412.0767 CoRR http://arxiv.org/abs/1412.0767 db/journals/corr/corr1412.html#TranBFTP14

Du Tran Junsong Yuan 0001 Max-Margin Structured Output Regression for Spatio-Temporal Action Localization. 359-367 2012 NIPS https://proceedings.neurips.cc/paper/2012/hash/9872ed9fc22fc182d371c3e9ed316094-Abstract.html http://papers.nips.cc/paper/4794-max-margin-structured-output-regression-for-spatio-temporal-action-localization conf/nips/2012 db/conf/nips/nips2012.html#TranY12 Du Tran Junsong Yuan 0001 Optimal spatio-temporal path discovery for video event detection. 3321-3328 2011 CVPR https://doi.org/10.1109/CVPR.2011.5995416 https://doi.ieeecomputersociety.org/10.1109/CVPR.2011.5995416 conf/cvpr/2011 db/conf/cvpr/cvpr2011.html#TranY11 Du Tran Alexander Sorokin Human Activity Recognition with Metric Learning. 548-561 2008 ECCV (1) https://doi.org/10.1007/978-3-540-88682-2_42 conf/eccv/2008-1 db/conf/eccv/eccv2008-1.html#TranS08 Shruti Agarwal Humam Alwassel Joost R. van Amersfoort Gedas Bertasius Lubomir D. Bourdev Manmohan Krishna ChandrakerManmohan Chandraker Shih-Fu Chang Soumith Chintala Fu-Jen Chu Hany Farid Christoph Feichtenhofer Matt Feiszli Rob Fergus David A. Forsyth Deepti Ghadiyaram Bernard Ghanem Rohit Girdhar Georgia Gkioxari Raghav Goyal Tarun Kalluri Anitha Kannan Bruno Korbar Ivan Laptev Yann LeCun Dhruv Mahajan 0001 Jitendra Malik Effrosyni Mavroudi Antoine Miech Manohar Paluri Deepak Pathak Deva Ramanan Marc'Aurelio Ranzato Jamie Ray Laura Sevilla-Lara Jianbo Shi Zheng Shou 0001 Leonid Sigal Josef Sivic Alexander Sorokin Sainbayar Sukhbaatar Arthur Szlam Lorenzo Torresani Heng Wang Jue Wang 0001 Weiyao Wang 0001 Yufei Wang Xueting Yan Xitong Yang Yi Yang 0001 Junsong Yuan 0001 Linchao Zhu