publications

Contributions of RLLab members to the scientific community

Please check Mila publications page for a full list of publications of RL Lab members.

Years active: 2022, 2021, 2020, 2019, 2018, 2017, 2016, 2015, 2014, 2013, 2012, 2011, 2010, 2009, 2008, 2007, 2006, 2005, 2004, 2003, 2002, 2001, 2000, 1999, 1998

2022

Shagun Sodhani,Franziska Meier,Joelle Pineau,Amy Zhang; Block Contextual MDPs for Continual Learning. L4DC (2022)
Anthony GX-Chen,Veronica Chelu,Blake A. Richards,Joelle Pineau; A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions. CoRR (2022)
Annie Xie,Shagun Sodhani,Chelsea Finn,Joelle Pineau,Amy Zhang; Robust Policy Learning over Multiple Uncertainty Sets. CoRR (2022)
Thang Doan,Seyed-Iman Mirzadeh,Joelle Pineau,Mehrdad Farajtabar; Efficient Continual Learning Ensembles in Neural Network Subspaces. CoRR (2022)
Martin Cousineau,Vedat Verter,Susan A. Murphy,Joelle Pineau; Estimating causal effects with optimization-based methods: A review and empirical comparison. CoRR (2022)
Devendra Singh Sachan,Mike Lewis,Mandar Joshi,Armen Aghajanyan,Wen-tau Yih,Joelle Pineau,Luke Zettlemoyer; Improving Passage Retrieval with Zero-Shot Question Generation. CoRR (2022)
Raviteja Chunduru,Doina Precup; Attention Option-Critic. CoRR (2022)
Andrei Cristian Nica,Khimya Khetarpal,Doina Precup; The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning. CoRR (2022)
Scott Fujimoto,David Meger,Doina Precup,Ofir Nachum,Shixiang Shane Gu; Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error. CoRR (2022)
Amir Ardalan Kalantari,Mohammad Amini,Sarath Chandar,Doina Precup; Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers. CoRR (2022)
Veronica Chelu,Diana Borsa,Doina Precup,Hado van Hasselt; Selective Credit Assignment. CoRR (2022)
Arushi Jain,Sharan Vaswani,Reza Babanezhad,Csaba Szepesvári,Doina Precup; Towards Painless Policy Optimization for Constrained MDPs. CoRR (2022)
Jongmin Lee,Cosmin Paduraru,Daniel J. Mankowitz,Nicolas Heess,Doina Precup,Kee-Eung Kim,Arthur Guez; COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation. CoRR (2022)
Leo Schwinn,Doina Precup,Björn M. Eskofier,Dario Zanca; Behind the Machine's Gaze: Biologically Constrained Neural Networks Exhibit Human-like Visual Attention. CoRR (2022)
Gheorghe Comanici,Amelia Glaese,Anita Gergely,Daniel Toyama,Zafarali Ahmed,Tyler Jackson,Philippe Hamel,Doina Precup; Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning. CoRR (2022)
Leo Schwinn,Leon Bungert,An Nguyen,René Raab,Falk Pulsmeyer,Doina Precup,Björn M. Eskofier,Dario Zanca; Improving Robustness against Real-World and Worst-Case Distribution Shifts through Decision Region Quantification. CoRR (2022)
Kushal Arora,Layla El Asri,Hareesh Bahuleyan,Jackie Chi Kit Cheung; Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation. ACL (Findings) (2022)
Zichao Li,Prakhar Sharma,Xing Han Lu,Jackie Chi Kit Cheung,Siva Reddy; Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment. ACL (Findings) (2022)
Meng Cao,Yue Dong,Jackie Chi Kit Cheung; Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive Summarization. ACL (1) (2022)
Michaela Socolof,Jackie Chi Kit Cheung,Michael Wagner,Timothy J. O'Donnell; Characterizing Idioms: Conventionality and Contingency. ACL (1) (2022)
Yu Lu Liu,Rachel Bawden,Thomas Scaliom,Benoît Sagot,Jackie Chi Kit Cheung; MaskEval: Weighted MLM-Based Evaluation for Text Summarization and Simplification. CoRR (2022)
Borja Balle,Pascale Gourdeau,Prakash Panangaden; Bisimulation metrics and norms for real-weighted automata. Inf. Comput. (2022)
Clara Lacroce,Prakash Panangaden,Guillaume Rabusseau; Towards an AAK Theory Approach to Approximate Minimization in the Multi-Letter Case. CoRR (2022)
Yifei Li,Pratheeksha Nair,Kellin Pelrine,Reihaneh Rabbany; Extracting Person Names from User Generated Text: Named-Entity Recognition for Combating Human Trafficking. ACL (Findings) (2022)

2021

Joelle Pineau,Philippe Vincent-Lamarre,Koustuv Sinha,Vincent Larivière,Alina Beygelzimer,Florence d'Alché-Buc,Emily B. Fox,Hugo Larochelle; Improving Reproducibility in Machine Learning Research(A Report from the NeurIPS 2019 Reproducibility Program). J. Mach. Learn. Res. (2021)
Denis Yarats,Amy Zhang,Ilya Kostrikov,Brandon Amos,Joelle Pineau,Rob Fergus; Improving Sample Efficiency in Model-Free Reinforcement Learning from Images. AAAI (2021)
Koustuv Sinha,Prasanna Parthasarathi,Joelle Pineau,Adina Williams; UnNatural Language Inference. ACL/IJCNLP (1) (2021)
Joshua Romoff,Peter Henderson,David Kanaa,Emmanuel Bengio,Ahmed Touati,Pierre-Luc Bacon,Joelle Pineau; TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? AAMAS (2021)
Dora Jambor,Komal K. Teru,Joelle Pineau,William L. Hamilton; Exploring the Limits of Few-Shot Link Prediction in Knowledge Graphs. EACL (2021)
Koustuv Sinha,Robin Jia,Dieuwke Hupkes,Joelle Pineau,Adina Williams,Douwe Kiela; Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little. EMNLP (1) (2021)
Prasanna Parthasarathi,Koustuv Sinha,Joelle Pineau,Adina Williams; Sometimes We Want Ungrammatical Translations. EMNLP (Findings) (2021)
Amy Zhang,Shagun Sodhani,Khimya Khetarpal,Joelle Pineau; Learning Robust State Abstractions for Hidden-Parameter Block MDPs. ICLR (2021)
Wonseok Jeon,Chen-Yang Su,Paul Barde,Thang Doan,Derek Nowrouzezahrai,Joelle Pineau; Regularized Inverse Reinforcement Learning. ICLR (2021)
Jongmin Lee,Wonseok Jeon,Byung-Jun Lee,Joelle Pineau,Kee-Eung Kim; OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation. ICML (2021)
Shagun Sodhani,Amy Zhang,Joelle Pineau; Multi-Task Reinforcement Learning with Context-based Representations. ICML (2021)
Harsh Satija,Philip S. Thomas,Joelle Pineau,Romain Laroche; Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs. NeurIPS (2021)
Prasanna Parthasarathi,Mohamed Abdelsalam,Sarath Chandar,Joelle Pineau; A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss. SIGDIAL (2021)
Prasanna Parthasarathi,Joelle Pineau,Sarath Chandar; Do Encoder Representations of Generative Dialogue Models have sufficient summary of the Information about the task ? SIGDIAL (2021)
Sylvie Delacroix,Joelle Pineau,Jessica Montgomery; Democratising the Digital Revolution: The Role of Data Governance. Reflections on Artificial Intelligence for Humanity (2021)
Koustuv Sinha,Prasanna Parthasarathi,Joelle Pineau,Adina Williams; Unnatural Language Inference. CoRR (2021)
Anuroop Sriram,Matthew J. Muckley,Koustuv Sinha,Farah Shamout,Joelle Pineau,Krzysztof J. Geras,Lea Azour,Yindalon Aphinyanaphongs,Nafissa Yakubova,William Moore; COVID-19 Prognosis via Self-Supervised Representation Learning and Multi-Image Prediction. CoRR (2021)
Bonnie Li,Vincent François-Lavet,Thang Doan,Joelle Pineau; Domain Adversarial Reinforcement Learning. CoRR (2021)
Manan Tomar,Amy Zhang,Roberto Calandra,Matthew E. Taylor,Joelle Pineau; Model-Invariant State Abstractions for Model-Based Reinforcement Learning. CoRR (2021)
Kalesha Bullard,Douwe Kiela,Joelle Pineau,Jakob N. Foerster; Quasi-Equivalence Discovery for Zero-Shot Emergent Communication. CoRR (2021)
Lucas Caccia,Rahaf Aljundi,Tinne Tuytelaars,Joelle Pineau,Eugene Belilovsky; Reducing Representation Drift in Online Continual Learning. CoRR (2021)
Prasanna Parthasarathi,Koustuv Sinha,Joelle Pineau,Adina Williams; Sometimes We Want Translationese. CoRR (2021)
Emmanuel Bengio,Joelle Pineau,Doina Precup; Correcting Momentum in Temporal Difference Learning. CoRR (2021)
Lucas Caccia,Joelle Pineau; SPeCiaL: Self-Supervised Pretraining for Continual Learning. CoRR (2021)
Prasanna Parthasarathi,Joelle Pineau,Sarath Chandar; Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ? CoRR (2021)
David Silver,Satinder Singh,Doina Precup,Richard S. Sutton; Reward is enough. Artif. Intell. (2021)
Arushi Jain,Gandharv Patil,Ayush Jain,Khimya Khetarpal,Doina Precup; Variance Penalized On-Policy and Off-Policy Actor-Critic. AAAI (2021)
Haiping Wu,Khimya Khetarpal,Doina Precup; Self-Supervised Attention-Aware Reinforcement Learning. AAAI (2021)
Borja Balle,Clara Lacroce,Prakash Panangaden,Doina Precup,Guillaume Rabusseau; Optimal Spectral-Norm Approximate Minimization of Weighted Finite Automata. ICALP (2021)
Susan Amin,Maziar Gomrokchi,Hossein Aboutalebi,Harsh Satija,Doina Precup; Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards. ICML (2021)
Nishanth V. Anand,Doina Precup; Preferential Temporal Difference Learning. ICML (2021)
Scott Fujimoto,David Meger,Doina Precup; A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation. ICML (2021)
Haque Ishfaq,Qiwen Cui,Viet Nguyen,Alex Ayoub,Zhuoran Yang,Zhaoran Wang,Doina Precup,Lin Yang; Randomized Exploration in Reinforcement Learning with General Value Function Approximation. ICML (2021)
Mohammad Pezeshki,Sékou-Oumar Kaba,Yoshua Bengio,Aaron C. Courville,Doina Precup,Guillaume Lajoie; Gradient Starvation: A Learning Proclivity in Neural Networks. NeurIPS (2021)
Mingde Zhao,Zhen Liu,Sitao Luan,Shuyuan Zhang,Doina Precup,Yoshua Bengio; A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning. NeurIPS (2021)
Khimya Khetarpal,Zafarali Ahmed,Gheorghe Comanici,Doina Precup; Temporally Abstract Partial Models. NeurIPS (2021)
Martin Klissarov,Doina Precup; Flexible Option Learning. NeurIPS (2021)
David Abel,Will Dabney,Anna Harutyunyan,Mark K. Ho,Michael L. Littman,Doina Precup,Satinder Singh; On the Expressivity of Markov Reward. NeurIPS (2021)
Emmanuel Bengio,Moksh Jain,Maksym Korablyov,Doina Precup,Yoshua Bengio; Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation. NeurIPS (2021)
Vlad Firoiu,Eser Aygün,Ankit Anand,Zafarali Ahmed,Xavier Glorot,Laurent Orseau,Lei M. Zhang,Doina Precup,Shibl Mourad; Training a First-Order Theorem Prover from Synthetic Data. CoRR (2021)
Safa Alver,Doina Precup; What is Going on Inside Recurrent Meta Reinforcement Learning Agents? CoRR (2021)
Daniel Toyama,Philippe Hamel,Anita Gergely,Gheorghe Comanici,Amelia Glaese,Zafarali Ahmed,Tyler Jackson,Shibl Mourad,Doina Precup; AndroidEnv: A Reinforcement Learning Platform for Android. CoRR (2021)
Bogdan Mazoure,Paul Mineiro,Pavithra Srinath,Reza Sharifi Sedeh,Doina Precup,Adith Swaminathan; Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL. CoRR (2021)
Haque Ishfaq,Qiwen Cui,Viet Nguyen,Alex Ayoub,Zhuoran Yang,Zhaoran Wang,Doina Precup,Lin F. Yang; Randomized Exploration for Reinforcement Learning with General Value Function Approximation. CoRR (2021)
André Barreto,Diana Borsa,Shaobo Hou,Gheorghe Comanici,Eser Aygün,Philippe Hamel,Daniel Toyama,Jonathan J. Hunt,Shibl Mourad,David Silver,Doina Precup; The Option Keyboard: Combining Skills in Reinforcement Learning. CoRR (2021)
David Venuto,Elaine Lau,Doina Precup,Ofir Nachum; Policy Gradients Incorporating the Future. CoRR (2021)
Susan Amin,Maziar Gomrokchi,Harsh Satija,Herke van Hoof,Doina Precup; A Survey of Exploration Methods in Reinforcement Learning. CoRR (2021)
Maziar Gomrokchi,Susan Amin,Hossein Aboutalebi,Alexander Wong,Doina Precup; Where Did You Learn That From? Surprising Effectiveness of Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning. CoRR (2021)
Sitao Luan,Chenqing Hua,Qincheng Lu,Jiaqi Zhu,Mingde Zhao,Shuyuan Zhang,Xiao-Wen Chang,Doina Precup; Is Heterophily A Real Nightmare For Graph Neural Networks To Do Node Classification? CoRR (2021)
Marlos C. Machado,André Barreto,Doina Precup; Temporal Abstraction in Reinforcement Learning with the Successor Representation. CoRR (2021)
Eser Aygün,Laurent Orseau,Ankit Anand,Xavier Glorot,Vlad Firoiu,Lei M. Zhang,Doina Precup,Shibl Mourad; Proving Theorems using Incremental Learning and Hindsight Experience Replay. CoRR (2021)
Safa Alver,Doina Precup; Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates. CoRR (2021)
Samin Yeasar Arnob,Riashat Islam,Doina Precup; Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning. CoRR (2021)
Samin Yeasar Arnob,Riyasat Ohib,Sergey M. Plis,Doina Precup; Single-Shot Pruning for Offline Reinforcement Learning. CoRR (2021)
Matt Grenander,Robert Belfer,Ekaterina Kochmar,Iulian Vlad Serban,François St-Hilaire,Jackie Chi Kit Cheung; Deep Discourse Analysis for Generating Personalized Feedback in Intelligent Tutor Systems. AAAI (2021)
Yue Dong,Chandra Bhagavatula,Ximing Lu,Jena D. Hwang,Antoine Bosselut,Jackie Chi Kit Cheung,Yejin Choi; On-the-Fly Attention Modulation for Neural Generation. ACL/IJCNLP (Findings) (2021)
Peng Xu,Dhruv Kumar,Wei Yang,Wenjie Zi,Keyi Tang,Chenyang Huang,Jackie Chi Kit Cheung,Simon J. D. Prince,Yanshuai Cao; Optimizing Deeper Transformers on Small Datasets. ACL/IJCNLP (1) (2021)
Ali Emami,Ian Porada,Alexandra Olteanu,Kaheer Suleman,Adam Trischler,Jackie Chi Kit Cheung; ADEPT: An Adjective-Dependent Plausibility Task. ACL/IJCNLP (1) (2021)
Yue Dong,Andrei Romascanu,Jackie Chi Kit Cheung; Discourse-Aware Unsupervised Summarization for Long Scientific Documents. EACL (2021)
Jad Kabbara,Jackie Chi Kit Cheung; Post-Editing Extractive Summaries by Definiteness Prediction. EMNLP (Findings) (2021)
Akshatha Arodi,Jackie Chi Kit Cheung; Textual Time Travel: A Temporally Informed Approach to Theory of Mind. EMNLP (Findings) (2021)
Malik H. Altakrori,Jackie Chi Kit Cheung,Benjamin C. M. Fung; The Topic Confusion Task: A Novel Evaluation Scenario for Authorship Attribution. EMNLP (Findings) (2021)
Ian Porada,Kaheer Suleman,Adam Trischler,Jackie Chi Kit Cheung; Modeling Event Plausibility with Consistent Conceptual Abstraction. NAACL-HLT (2021)
Jiapeng Wu,Yishi Xu,Yingxue Zhang,Chen Ma,Mark Coates,Jackie Chi Kit Cheung; TIE: A Framework for Embedding-based Incremental Temporal Knowledge Graph Completion. SIGIR (2021)
Yue Dong,Chandra Bhagavatula,Ximing Lu,Jena D. Hwang,Antoine Bosselut,Jackie Chi Kit Cheung,Yejin Choi; On-the-Fly Attention Modularization for Neural Generation. CoRR (2021)
Malik H. Altakrori,Jackie Chi Kit Cheung,Benjamin C. M. Fung; The Topic Confusion Task: A Novel Scenario for Authorship Attribution. CoRR (2021)
Meng Cao,Yue Dong,Jackie Chi Kit Cheung; Inspecting the Factuality of Hallucinated Entities in Abstractive Summarization. CoRR (2021)
Ian Porada,Alessandro Sordoni,Jackie Chi Kit Cheung; Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge. CoRR (2021)
Artem Kaznatcheev,Prakash Panangaden; Weighted automata are compact and actively learnable. Inf. Process. Lett. (2021)
Giorgio Bacci,Radu Mardare,Prakash Panangaden,Gordon D. Plotkin; Tensor of Quantitative Equational Theories. CALCO (2021)
Clara Lacroce,Prakash Panangaden,Guillaume Rabusseau; Extracting Weighted Automata for Approximate Minimization in Language Modelling. ICGI (2021)
Pedro H. Azevedo de Amorim,Dexter Kozen,Radu Mardare,Prakash Panangaden,Michael Roberts; Universal Semantics for the Stochastic λ-Calculus. LICS (2021)
Radu Mardare,Prakash Panangaden,Gordon D. Plotkin; Fixed-Points for Quantitative Equational Logics. LICS (2021)
Pablo Samuel Castro,Tyler Kastner,Prakash Panangaden,Mark Rowland; MICo: Improved representations via sampling-based state similarity for Markov decision processes. NeurIPS (2021)
Benoît Valiron,Shane Mansfield,Pablo Arrighi,Prakash Panangaden; Proceedings 17th International Conference on Quantum Physics and Logic, QPL 2020, Paris, France, June 2 - 6, 2020. EPTCS (2021)
Pablo Samuel Castro,Tyler Kastner,Prakash Panangaden,Mark Rowland; MICo: Learning improved representations via sampling-based state similarity for Markov decision processes. CoRR (2021)
Robert Furber,Radu Mardare,Prakash Panangaden,Dana S. Scott; Interpreting Lambda Calculus in Domain-Valued Random Variables. CoRR (2021)
Xiaoye Ding,Shenyang Huang,Abby Leung,Reihaneh Rabbany; Incorporating dynamic flight network in SEIR to model mobility between populations. Appl. Netw. Sci. (2021)
Meng-Chieh Lee,Catalina Vajiac,Aayushi Kulshrestha,Sacha Levy,Namyong Park,Cara Jones,Reihaneh Rabbany,Christos Faloutsos; INFOSHIELD: Generalizable Information-Theoretic Human-Trafficking Detection. ICDE (2021)
Zachary Yang,Anne Imouza,Kellin Pelrine,Sacha Levy,Jiewen Liu,Gabrielle Desrosiers-Brisebois,Jean-François Godbout,André Blais,Reihaneh Rabbany; Online Partisan Polarization of COVID-19. ICDM (Workshops) (2021)
Farimah Poursafaei,Reihaneh Rabbany,Zeljko Zilic; SigTran: Signature Vectors for Detecting Illicit Activities in Blockchain Transaction Networks. PAKDD (1) (2021)
Liheng Ma,Reihaneh Rabbany,Adriana Romero-Soriano; Graph Attention Networks with Positional Embeddings. PAKDD (1) (2021)
Kellin Pelrine,Jacob Danovitch,Reihaneh Rabbany; The Surprising Performance of Simple Baselines for Misinformation Detection. WWW (2021)

2020

Iulian Vlad Serban,Chinnadhurai Sankar,Michael Pieper,Joelle Pineau,Yoshua Bengio; The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach. J. Artif. Intell. Res. (2020)
Nathan Peiffer-Smadja,Redwan Maatoug,François-Xavier Lescure,Eric D'ortenzio,Joelle Pineau,Jean-Rémi King; Machine Learning for COVID-19 needs global collaboration and data-sharing. Nat. Mach. Intell. (2020)
Eric Crawford,Joelle Pineau; Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking. AAAI (2020)
Qizhen Zhang,Audrey Durand,Joelle Pineau; Literature Mining for Incorporating Inductive Bias in Biomedical Prediction Tasks (Student Abstract). AAAI (2020)
Koustuv Sinha,Prasanna Parthasarathi,Jasmine Wang,Ryan Lowe,William L. Hamilton,Joelle Pineau; Learning an Unreferenced Metric for Online Dialogue Evaluation. ACL (2020)
Ekaterina Kochmar,Dung Do Vu,Robert Belfer,Varun Gupta,Iulian Vlad Serban,Joelle Pineau; Automated Personalized Feedback Improves Learning Gains in An Intelligent Tutoring System. AIED (2) (2020)
Iulian Vlad Serban,Varun Gupta,Ekaterina Kochmar,Dung Do Vu,Robert Belfer,Joelle Pineau,Aaron C. Courville,Laurent Charlin,Yoshua Bengio; A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM. AIED (2) (2020)
Joelle Pineau; Building reproducible, reusable, and robust machine learning software. DEBS (2020)
Massimo Caccia,Lucas Caccia,William Fedus,Hugo Larochelle,Joelle Pineau,Laurent Charlin; Language GANs Falling Short. ICLR (2020)
Ryan Lowe,Abhinav Gupta,Jakob N. Foerster,Douwe Kiela,Joelle Pineau; On the interaction between supervision and self-play in emergent communication. ICLR (2020)
Emmanuel Bengio,Joelle Pineau,Doina Precup; Interference and Generalization in Temporal Difference Learning. ICML (2020)
Lucas Caccia,Eugene Belilovsky,Massimo Caccia,Joelle Pineau; Online Learned Continual Compression with Adaptive Quantization Modules. ICML (2020)
Harsh Satija,Philip Amortila,Joelle Pineau; Constrained Markov Decision Processes via Backward Value Functions. ICML (2020)
Amy Zhang,Clare Lyle,Shagun Sodhani,Angelos Filos,Marta Kwiatkowska,Joelle Pineau,Yarin Gal,Doina Precup; Invariant Causal Prediction for Block MDPs. ICML (2020)
Maxime Wabartha,Audrey Durand,Vincent François-Lavet,Joelle Pineau; Handling Black Swan Events in Deep Learning with Diversely Extrapolated Neural Networks. IJCAI (2020)
Vincent François-Lavet,Guillaume Rabusseau,Joelle Pineau,Damien Ernst,Raphael Fonteneau; On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract). IJCAI (2020)
Ge Yang,Amy Zhang,Ari S. Morcos,Joelle Pineau,Pieter Abbeel,Roberto Calandra; Plan2Vec: Unsupervised Representation Learning by Latent Plans. L4DC (2020)
Paul Barde,Julien Roy,Wonseok Jeon,Joelle Pineau,Chris Pal,Derek Nowrouzezahrai; Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization. NeurIPS (2020)
Ruo Yu Tao,Vincent François-Lavet,Joelle Pineau; Novelty Search in Representational Space for Sample Efficient Exploration. NeurIPS (2020)
Ahmed Touati,Amy Zhang,Joelle Pineau,Pascal Vincent; Stable Policy Optimization via Off-Policy Divergence Regularization. UAI (2020)
Bogdan Mazoure,Thang Doan,Tianyu Li,Vladimir Makarenkov,Joelle Pineau,Doina Precup,Guillaume Rabusseau; Provably efficient reconstruction of policy networks. CoRR (2020)
Peter Henderson,Jieru Hu,Joshua Romoff,Emma Brunskill,Dan Jurafsky,Joelle Pineau; Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning. CoRR (2020)
Wonseok Jeon,Paul Barde,Derek Nowrouzezahrai,Joelle Pineau; Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic. CoRR (2020)
Koustuv Sinha,Shagun Sodhani,Joelle Pineau,William L. Hamilton; Evaluating Logical Generalization in Graph Neural Networks. CoRR (2020)
Joelle Pineau,Philippe Vincent-Lamarre,Koustuv Sinha,Vincent Larivière,Alina Beygelzimer,Florence d'Alché-Buc,Emily B. Fox,Hugo Larochelle; Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program). CoRR (2020)
Ekaterina Kochmar,Dung Do Vu,Robert Belfer,Varun Gupta,Iulian Vlad Serban,Joelle Pineau; Automated Personalized Feedback Improves Learning Gains in an Intelligent Tutoring System. CoRR (2020)
Deepak Sharma,Audrey Durand,Marc-André Legault,Louis-Philippe Lemieux Perreault,Audrey Lemaçon,Marie-Pierre Dubé,Joelle Pineau; Deep interpretability for GWAS. CoRR (2020)
Joshua Romoff,Peter Henderson,David Kanaa,Emmanuel Bengio,Ahmed Touati,Pierre-Luc Bacon,Joelle Pineau; TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning? CoRR (2020)
Amy Zhang,Shagun Sodhani,Khimya Khetarpal,Joelle Pineau; Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP. CoRR (2020)
Prasanna Parthasarathi,Joelle Pineau,Sarath Chandar; How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics. CoRR (2020)
Ruo Yu Tao,Vincent François-Lavet,Joelle Pineau; Novelty Search in representational space for sample efficient exploration. CoRR (2020)
Kalesha Bullard,Franziska Meier,Douwe Kiela,Joelle Pineau,Jakob N. Foerster; Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations. CoRR (2020)
Melissa Mozifian,Amy Zhang,Joelle Pineau,David Meger; Intervention Design for Effective Sim2Real Transfer. CoRR (2020)
Tanya Nair,Doina Precup,Douglas L. Arnold,Tal Arbel; Exploring uncertainty measures in deep networks for Multiple sclerosis lesion detection and segmentation. Medical Image Anal. (2020)
André Barreto,Shaobo Hou,Diana Borsa,David Silver,Doina Precup; Fast reinforcement learning with generalized policy updates. Proc. Natl. Acad. Sci. USA (2020)
Di Wu,Boyu Wang,Doina Precup,Benoit Boulet; Multiple Kernel Learning-Based Transfer Regression for Electric Load Forecasting. IEEE Trans. Smart Grid (2020)
Vishal Jain,William Fedus,Hugo Larochelle,Doina Precup,Marc G. Bellemare; Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction. AAAI (2020)
Khimya Khetarpal,Martin Klissarov,Maxime Chevalier-Boisvert,Pierre-Luc Bacon,Doina Precup; Options of Interest: Temporal Abstraction with Interest Functions. AAAI (2020)
Andrei Lupu,Doina Precup; Gifting in Multi-Agent Reinforcement Learning (Student Abstract). AAAI (2020)
David Abel,Nate Umbanhowar,Khimya Khetarpal,Dilip Arumugam,Doina Precup,Michael L. Littman; Value Preserving State-Action Abstractions. AISTATS (2020)
Tianyu Li,Bogdan Mazoure,Doina Precup,Guillaume Rabusseau; Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning. AISTATS (2020)
Philip Amortila,Doina Precup,Prakash Panangaden,Marc G. Bellemare; A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms. AISTATS (2020)
Andrei Lupu,Doina Precup; Gifting in Multi-Agent Reinforcement Learning. AAMAS (2020)
Mingde Zhao,Sitao Luan,Ian Porada,Xiao-Wen Chang,Doina Precup; META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation. AAMAS (2020)
Jhelum Chakravorty,Patrick Nadeem Ward,Julien Roy,Maxime Chevalier-Boisvert,Sumana Basu,Andrei Lupu,Doina Precup; Option-Critic in Cooperative Multi-agent Systems. AAMAS (2020)
Faizy Ahsan,Alexandre Drouin,François Laviolette,Doina Precup,Mathieu Blanchette; Phylogenetic Manifold Regularization: A semi-supervised approach to predict transcription factor binding sites. BIBM (2020)
Ivana Kajic,Eser Aygün,Doina Precup; Learning to cooperate: Emergent communication in multi-agent navigation. CogSci (2020)
Doina Precup; Keynote Lecture - Building Knowledge For AI AgentsWith Reinforcement Learning. ICCP (2020)
Khimya Khetarpal,Zafarali Ahmed,Gheorghe Comanici,David Abel,Doina Precup; What can I do here? A Theory of Affordances in Reinforcement Learning. ICML (2020)
Zilun Peng,Ahmed Touati,Pascal Vincent,Doina Precup; SVRG for Policy Evaluation with Fewer Gradient Evaluations. IJCAI (2020)
Veronica Chelu,Doina Precup,Hado van Hasselt; Forethought and Hindsight in Credit Assignment. NeurIPS (2020)
Scott Fujimoto,David Meger,Doina Precup; An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay. NeurIPS (2020)
Arthur Guez,Fabio Viola,Theophane Weber,Lars Buesing,Steven Kapturowski,Doina Precup,David Silver,Nicolas Heess; Value-driven Hindsight Modelling. NeurIPS (2020)
Martin Klissarov,Doina Precup; Reward Propagation Using Graph Convolutional Networks. NeurIPS (2020)
Zheng Wen,Doina Precup,Morteza Ibrahimi,André Barreto,Benjamin Van Roy,Satinder Singh; On Efficiency in Hierarchical Reinforcement Learning. NeurIPS (2020)
David Venuto,Jhelum Chakravorty,Léonard Boussioux,Junhao Wang,Gavin McCracken,Doina Precup; oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions. CoRR (2020)
Jean Harb,Tom Schaul,Doina Precup,Pierre-Luc Bacon; Policy Evaluation Networks. CoRR (2020)
Safa Alver,Doina Precup; A Brief Look at Generalization in Visual Meta-Reinforcement Learning. CoRR (2020)
Eser Aygün,Zafarali Ahmed,Ankit Anand,Vlad Firoiu,Xavier Glorot,Laurent Orseau,Doina Precup,Shibl Mourad; Learning to Prove from Synthetic Theorems. CoRR (2020)
Sitao Luan,Mingde Zhao,Xiao-Wen Chang,Doina Precup; Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks. CoRR (2020)
Sitao Luan,Mingde Zhao,Chenqing Hua,Xiao-Wen Chang,Doina Precup; Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Networks. CoRR (2020)
Charles C. Onu,Jacob E. Miller,Doina Precup; A Fully Tensorized Recurrent Neural Network. CoRR (2020)
Tianyu Li,Doina Precup,Guillaume Rabusseau; Connecting Weighted Automata, Tensor Networks and Recurrent Neural Networks through Spectral Learning. CoRR (2020)
Gavin McCracken,Colin Daniels,Rosie Zhao,Anna Brandenberger,Prakash Panangaden,Doina Precup; A Study of Policy Gradient on a Class of Exactly Solvable Models. CoRR (2020)
Anand Kamat,Doina Precup; Diversity-Enriched Option-Critic. CoRR (2020)
Khimya Khetarpal,Matthew Riemer,Irina Rish,Doina Precup; Towards Continual Reinforcement Learning: A Review and Perspectives. CoRR (2020)
Kushal Arora,Aishik Chakraborty,Jackie Chi Kit Cheung; Learning Lexical Subspaces in a Distributional Vector Space. Trans. Assoc. Comput. Linguistics (2020)
Jingyi He,K. C. Tsiolis,Kian Kenyon-Dean,Jackie Chi Kit Cheung; Learning Efficient Task-Specific Meta-Embeddings with Word Prisms. COLING (2020)
Ali Emami,Kaheer Suleman,Adam Trischler,Jackie Chi Kit Cheung; An Analysis of Dataset Overlap on Winograd-Style Tasks. COLING (2020)
Jiapeng Wu,Meng Cao,Jackie Chi Kit Cheung,William L. Hamilton; TeMP: Temporal Message Passing for Temporal Knowledge Graph Completion. EMNLP (1) (2020)
Meng Cao,Yue Dong,Jiapeng Wu,Jackie Chi Kit Cheung; Factual Error Correction for Abstractive Summarization Models. EMNLP (1) (2020)
Clément Jumel,Annie Louis,Jackie Chi Kit Cheung; TESA: A Task in Entity Semantic Aggregation for Abstractive Summarization. EMNLP (1) (2020)
Kian Kenyon-Dean,Edward Newell,Jackie Chi Kit Cheung; Deconstructing word embedding algorithms. EMNLP (1) (2020)
Yue Dong,Shuohang Wang,Zhe Gan,Yu Cheng,Jackie Chi Kit Cheung,Jingjing Liu; Multi-Fact Correction in Abstractive Text Summarization. EMNLP (1) (2020)
Peng Xu,Jackie Chi Kit Cheung,Yanshuai Cao; On Variational Learning of Controllable Representations for Text without Supervision. ICML (2020)
Abhilasha Ravichander,Eduard H. Hovy,Kaheer Suleman,Adam Trischler,Jackie Chi Kit Cheung; On the Systematicity of Probing Contextualized Word Representations: The Case of Hypernymy in BERT. *SEM@COLING (2020)
Manuel Sage,Pietro Cruciata,Raed Abdo,Jackie Chi Kit Cheung,Yaoyao Fiona Zhao; Investigating the Influence of Selected Linguistic Features on Authorship Attribution using German News Articles. SwissText/KONVENS (2020)
Yue Dong,Andrei Romascanu,Jackie Chi Kit Cheung; HipoRank: Incorporating Hierarchical and Positional Information into Graph-based Unsupervised Long Document Extractive Summarization. CoRR (2020)
Peng Xu,Wei Yang,Wenjie Zi,Keyi Tang,Chengyang Huang,Jackie Chi Kit Cheung,Yanshuai Cao; Optimizing Deeper Transformers on Small Datasets: An Application on Text-to-SQL Semantic Parsing. CoRR (2020)
Avishek Joey Bose,Ariella Smofsky,Renjie Liao,Prakash Panangaden,William L. Hamilton; Latent Variable Modelling with Hyperbolic Normalizing Flows. ICML (2020)
Linan Chen,Florence Clerc,Prakash Panangaden; Towards a Classification of Behavioural Equivalences in Continuous-time Markov Processes. MFPS (2020)
Nick Bezhanishvili,Marcello M. Bonsangue,Helle Hvid Hansen,Dexter Kozen,Clemens Kupke,Prakash Panangaden,Alexandra Silva; Minimisation in Logical Form. CoRR (2020)
Pedro H. Azevedo de Amorim,Dexter Kozen,Radu Mardare,Prakash Panangaden,Michael Roberts; Universal Semantics for the Stochastic Lambda-Calculus. CoRR (2020)
Kellin Pelrine,Jacob Danovitch,Albert Orozco Camacho,Reihaneh Rabbany; ComplexDataLab at W-NUT 2020 Task 2: Detecting Informative COVID-19 Tweets by Attending over Linked Documents. W-NUT@EMNLP (2020)
Shenyang Huang,Yasmeen Hitti,Guillaume Rabusseau,Reihaneh Rabbany; Laplacian Change Point Detection for Dynamic Graphs. KDD (2020)
Xiaoye Ding,Shenyang Huang,Abby Leung,Reihaneh Rabbany; Incorporating Dynamic Flight Network in SEIR to Model Mobility between Populations. CoRR (2020)
Abby Leung,Xiaoye Ding,Shenyang Huang,Reihaneh Rabbany; Contact Graph Epidemic Modelling of COVID-19 for Transmission and Intervention Strategies. CoRR (2020)

2019

Vincent François-Lavet,Guillaume Rabusseau,Joelle Pineau,Damien Ernst,Raphael Fonteneau; On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability. J. Artif. Intell. Res. (2019)
Eric Crawford,Joelle Pineau; Spatially Invariant Unsupervised Object Detection with Convolutional Neural Networks. AAAI (2019)
Thang Doan,João Monteiro,Isabela Albuquerque,Bogdan Mazoure,Audrey Durand,Joelle Pineau,R. Devon Hjelm; On-Line Adaptative Curriculum Learning for GANs. AAAI (2019)
Vincent François-Lavet,Yoshua Bengio,Doina Precup,Joelle Pineau; Combined Reinforcement Learning via Abstract Representations. AAAI (2019)
Boyu Wang,Hejia Zhang,Peng Liu,Zebang Shen,Joelle Pineau; Multitask Metric Learning: Theory and Algorithm. AISTATS (2019)
Ryan Lowe,Jakob N. Foerster,Y-Lan Boureau,Joelle Pineau,Yann N. Dauphin; On the Pitfalls of Measuring Emergent Communication. AAMAS (2019)
Bogdan Mazoure,Thang Doan,Audrey Durand,Joelle Pineau,R. Devon Hjelm; Leveraging exploration in off-policy algorithms via normalizing flows. CoRL (2019)
Abhinav Gupta,Ryan Lowe,Jakob N. Foerster,Douwe Kiela,Joelle Pineau; Seeded self-play for language learning. LANTERN@EMNLP-IJCNLP (2019)
Koustuv Sinha,Shagun Sodhani,Jin Dong,Joelle Pineau,William L. Hamilton; CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text. EMNLP/IJCNLP (1) (2019)
Abhishek Das,Théophile Gervet,Joshua Romoff,Dhruv Batra,Devi Parikh,Mike Rabbat,Joelle Pineau; TarMAC: Targeted Multi-Agent Communication. ICML (2019)
Joshua Romoff,Peter Henderson,Ahmed Touati,Yann Ollivier,Joelle Pineau,Emma Brunskill; Separable value functions across time-scales. ICML (2019)
Lucas Caccia,Herke van Hoof,Aaron C. Courville,Joelle Pineau; Deep Generative Modeling of LiDAR Data. IROS (2019)
Philip Paquette,Yuchen Lu,Steven Bocco,Max O. Smith,Satya Ortiz-Gagne,Jonathan K. Kummerfeld,Joelle Pineau,Satinder Singh,Aaron C. Courville; No-Press Diplomacy: Modeling Multi-Agent Gameplay. NeurIPS (2019)
Mahmoud Assran,Joshua Romoff,Nicolas Ballas,Joelle Pineau,Mike Rabbat; Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning. NeurIPS (2019)
Ahmed Touati,Harsh Satija,Joshua Romoff,Joelle Pineau,Pascal Vincent; Randomized Value Functions via Multiplicative Normalizing Flows. UAI (2019)
Emily Dinan,Varvara Logacheva,Valentin Malykh,Alexander H. Miller,Kurt Shuster,Jack Urbanek,Douwe Kiela,Arthur Szlam,Iulian Serban,Ryan Lowe,Shrimai Prabhumoye,Alan W. Black,Alexander I. Rudnicky,Jason Williams,Joelle Pineau,Mikhail S. Burtsev,Jason Weston; The Second Conversational Intelligence Challenge (ConvAI2). CoRR (2019)
Joshua Romoff,Peter Henderson,Ahmed Touati,Yann Ollivier,Emma Brunskill,Joelle Pineau; Separating value functions across time-scales. CoRR (2019)
Pierre Thodoroff,Nishanth Anand,Lucas Caccia,Doina Precup,Joelle Pineau; Recurrent Value Functions. CoRR (2019)
Amy Zhang,Zachary C. Lipton,Luis Pineda,Kamyar Azizzadenesheli,Anima Anandkumar,Laurent Itti,Joelle Pineau,Tommaso Furlanello; Learning Causal State Representations of Partially Observable Environments. CoRR (2019)
Philip Paquette,Yuchen Lu,Steven Bocco,Max O. Smith,Satya Ortiz-Gagne,Jonathan K. Kummerfeld,Satinder Singh,Joelle Pineau,Aaron C. Courville; No Press Diplomacy: Modeling Multi-Agent Gameplay. CoRR (2019)
Thang Doan,Bogdan Mazoure,Audrey Durand,Joelle Pineau,R. Devon Hjelm; Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning. CoRR (2019)
Scott Fujimoto,Edoardo Conti,Mohammad Ghavamzadeh,Joelle Pineau; Benchmarking Batch Deep Reinforcement Learning Algorithms. CoRR (2019)
Viswanath Sivakumar,Tim Rocktäschel,Alexander H. Miller,Heinrich Küttler,Nantas Nardelli,Mike Rabbat,Joelle Pineau,Sebastian Riedel; MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions. CoRR (2019)
Lucas Caccia,Eugene Belilovsky,Massimo Caccia,Joelle Pineau; Online Learned Continual Compression with Stacked Quantization Module. CoRR (2019)
Borja Balle,Prakash Panangaden,Doina Precup; Singular value automata and approximate minimization. Math. Struct. Comput. Sci. (2019)
Philip Amortila,Marc G. Bellemare,Prakash Panangaden,Doina Precup; Temporally Extended Metrics for Markov Decision Processes. SafeAI@AAAI (2019)
Andrei Lupu,Audrey Durand,Doina Precup; Leveraging Observations in Bandits: Between Risks and Benefits. AAAI (2019)
Khimya Khetarpal,Doina Precup; Learning Options with Interest Functions. AAAI (2019)
Guillaume Rabusseau,Tianyu Li,Doina Precup; Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning. AISTATS (2019)
Anna Harutyunyan,Will Dabney,Diana Borsa,Nicolas Heess,Rémi Munos,Doina Precup; The Termination Critic. AISTATS (2019)
Doina Precup; Building Knowledge for AI Agents with Reinforcement Learning. AAMAS (2019)
Martin Weiss,Simon Chamorro,Roger Girgis,Margaux Luck,Samira Ebrahimi Kahou,Joseph Paul Cohen,Derek Nowrouzezahrai,Doina Precup,Florian Golemo,Chris Pal; Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments. CoRL (2019)
Zafarali Ahmed,Arjun Karuvally,Doina Precup,Simon Gravel; Learning proposals for sequential importance samplers using reinforced variational inference. DeepRLStructPred@ICLR (2019)
Scott Fujimoto,David Meger,Doina Precup; Off-Policy Deep Reinforcement Learning without Exploration. ICML (2019)
Anna Harutyunyan,Peter Vrancx,Philippe Hamel,Ann Nowé,Doina Precup; Per-Decision Option Discounting. ICML (2019)
Sanjay Thakur,Herke van Hoof,Juan Camilo Gamboa Higuera,Doina Precup,David Meger; Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks. ICRA (2019)
Hossein Aboutalebi,Doina Precup,Tibor Schuster; Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials. AISafety@IJCAI (2019)
Hossein Aboutalebi,Doina Precup,Tibor Schuster; Learning Reliable Policies in the Bandit Setting with Application to Adaptive Clinical Trials. KHD@IJCAI (2019)
Charles C. Onu,Jonathan Lebensold,William L. Hamilton,Doina Precup; Neural Transfer Learning for Cry-Based Diagnosis of Perinatal Asphyxia. INTERSPEECH (2019)
Barleen Kaur,Paul Lemaître,Raghav Mehta,Nazanin Mohammadi Sepahvand,Doina Precup,Douglas L. Arnold,Tal Arbel; Improving Pathological Structure Segmentation via Transfer Learning Across Diseases. DART/MIL3ID@MICCAI (2019)
Sumana Basu,Konrad Wagstyl,Azar Zandifar,D. Louis Collins,Adriana Romero,Doina Precup; Early Prediction of Alzheimer's Disease Progression Using Variational Autoencoders. MICCAI (4) (2019)
Adrian Tousignant,Paul Lemaître,Doina Precup,Douglas L. Arnold,Tal Arbel; Prediction of Disease Progression in Multiple Sclerosis Patients using Deep Learning Analysis of MRI Data. MIDL (2019)
Sitao Luan,Mingde Zhao,Xiao-Wen Chang,Doina Precup; Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks. NeurIPS (2019)
Anna Harutyunyan,Will Dabney,Thomas Mesnard,Mohammad Gheshlaghi Azar,Bilal Piot,Nicolas Heess,Hado van Hasselt,Gregory Wayne,Satinder Singh,Doina Precup,Rémi Munos; Hindsight Credit Assignment. NeurIPS (2019)
Olivier Tieleman,Angeliki Lazaridou,Shibl Mourad,Charles Blundell,Doina Precup; Community size effect in artificial learning systems. ViGIL@NeurIPS (2019)
Charles C. Onu,Jonathan Lebensold,William L. Hamilton,Doina Precup; Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia. CoRR (2019)
Srinivas Venkattaramanujam,Eric Crawford,Thang Doan,Doina Precup; Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning. CoRR (2019)
Vincent Michalski,Vikram Voleti,Samira Ebrahimi Kahou,Anthony Ortiz,Pascal Vincent,Chris Pal,Doina Precup; An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation. CoRR (2019)
Sitao Luan,Xiao-Wen Chang,Doina Precup; Revisit Policy Optimization in Matrix Form. CoRR (2019)
David Venuto,Léonard Boussioux,Junhao Wang,Rola Dali,Jhelum Chakravorty,Yoshua Bengio,Doina Precup; Avoidance Learning Using Observational Reinforcement Learning. CoRR (2019)
Shruti Mishra,Abbas Abdolmaleki,Arthur Guez,Piotr Trochim,Doina Precup; Augmenting learning using symmetry in a biologically-inspired domain. CoRR (2019)
Jonathan Lebensold,William L. Hamilton,Borja Balle,Doina Precup; Actor Critic with Differentially Private Critic. CoRR (2019)
Vishal Jain,William Fedus,Hugo Larochelle,Doina Precup,Marc G. Bellemare; Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction. CoRR (2019)
Jhelum Chakravorty,Patrick Nadeem Ward,Julien Roy,Maxime Chevalier-Boisvert,Sumana Basu,Andrei Lupu,Doina Precup; Option-critic in cooperative multi-agent systems. CoRR (2019)
Riashat Islam,Raihan Seraj,Pierre-Luc Bacon,Doina Precup; Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods. CoRR (2019)
Riashat Islam,Raihan Seraj,Samin Yeasar Arnob,Doina Precup; Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning. CoRR (2019)
Riashat Islam,Zafarali Ahmed,Doina Precup; Marginalized State Distribution Entropy Regularization in Policy Optimization. CoRR (2019)
Olivier Tieleman,Angeliki Lazaridou,Shibl Mourad,Charles Blundell,Doina Precup; Shaping representations through communication: community size effect in artificial learning systems. CoRR (2019)
Pengfei Liu,Jie Fu,Yue Dong,Xipeng Qiu,Jackie Chi Kit Cheung; Learning Multi-Task Communication with Message Passing for Sequence Learning. AAAI (2019)
Pengfei Liu,Shuaichen Chang,Xuanjing Huang,Jian Tang,Jackie Chi Kit Cheung; Contextualized Non-Local Neural Networks for Sequence Learning. AAAI (2019)
Weiwei Zhang,Jackie Chi Kit Cheung,Joel Oren; Generating Character Descriptions for Automatic Summarization of Fiction. AAAI (2019)
Peng Xu,Hamidreza Saghir,Jin Sung Kang,Teng Long,Avishek Joey Bose,Yanshuai Cao,Jackie Chi Kit Cheung; A Cross-Domain Transferable Neural Coherence Model. ACL (1) (2019)
Yue Dong,Zichao Li,Mehdi Rezagholizadeh,Jackie Chi Kit Cheung; EditNTS: An Neural Programmer-Interpreter Model for Sentence Simplification through Explicit Editing. ACL (1) (2019)
Ali Emami,Paul Trichelair,Adam Trischler,Kaheer Suleman,Hannes Schulz,Jackie Chi Kit Cheung; The KnowRef Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora Resolution. ACL (1) (2019)
Meng Cao,Jackie Chi Kit Cheung; Referring Expression Generation Using Entity Profiles. EMNLP/IJCNLP (1) (2019)
Paul Trichelair,Ali Emami,Adam Trischler,Kaheer Suleman,Jackie Chi Kit Cheung; How Reasonable are Common-Sense Reasoning Tasks: A Case-Study on the Winograd Schema Challenge and SWAG. EMNLP/IJCNLP (1) (2019)
Matt Grenander,Yue Dong,Jackie Chi Kit Cheung,Annie Louis; Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses. EMNLP/IJCNLP (1) (2019)
Krtin Kumar,Jackie Chi Kit Cheung; Understanding the Behaviour of Neural Abstractive Summarizers using Contrastive Examples. NAACL-HLT (1) (2019)
Jingyun Liu,Jackie Chi Kit Cheung,Annie Louis; What comes next? Extractive summarization by next-sentence prediction. CoRR (2019)
Peng Xu,Yanshuai Cao,Jackie Chi Kit Cheung; Unsupervised Controllable Text Generation with Global Variation Discovery and Disentanglement. CoRR (2019)
Teng Long,Yanshuai Cao,Jackie Chi Kit Cheung; Preventing Posterior Collapse in Sequence VAEs with Pooling. CoRR (2019)
Ian Porada,Kaheer Suleman,Jackie Chi Kit Cheung; Can a Gorilla Ride a Camel? Learning Semantic Plausibility from Text. CoRR (2019)
Edward Newell,Kian Kenyon-Dean,Jackie Chi Kit Cheung; Deconstructing and reconstructing word embedding algorithms. CoRR (2019)
Florence Clerc,Nathanaël Fijalkow,Bartek Klin,Prakash Panangaden; Expressiveness of probabilistic modal logics: A gradual approach. Inf. Comput. (2019)
Linan Chen,Florence Clerc,Prakash Panangaden; Bisimulation for Feller-Dynkin Processes. MFPS (2019)
Junhao Wang,Renhao Wang,Aayushi Kulshrestha,Reihaneh Rabbany; Anomaly Detection with Joint Representation Learning of Content and Connection. CoRR (2019)
Junhao Wang,Sacha Levy,Ren Wang,Aayushi Kulshrestha,Reihaneh Rabbany; SGP: Spotting Groups Polluting the Online Political Discourse. CoRR (2019)

2018

Iulian Vlad Serban,Ryan Lowe,Peter Henderson,Laurent Charlin,Joelle Pineau; A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version. Dialogue Discourse (2018)
Vincent François-Lavet,Peter Henderson,Riashat Islam,Marc G. Bellemare,Joelle Pineau; An Introduction to Deep Reinforcement Learning. Found. Trends Mach. Learn. (2018)
Mahmoud Ghorbel,Joelle Pineau,Richard Gourdeau,Shervin Javdani,Siddhartha S. Srinivasa; A Decision-Theoretic Approach for the Collaborative Control of a Smart Wheelchair. Int. J. Soc. Robotics (2018)
Audrey Durand,Odalric-Ambrym Maillard,Joelle Pineau; Streaming kernel regression with provably adaptive mean, variance, and regularization. J. Mach. Learn. Res. (2018)
Peter Henderson,Wei-Di Chang,Pierre-Luc Bacon,David Meger,Joelle Pineau,Doina Precup; OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning. AAAI (2018)
Peter Henderson,Riashat Islam,Philip Bachman,Joelle Pineau,Doina Precup,David Meger; Deep Reinforcement Learning That Matters. AAAI (2018)
Peter Henderson,Koustuv Sinha,Nicolas Angelard-Gontier,Nan Rosemary Ke,Genevieve Fried,Ryan Lowe,Joelle Pineau; Ethical Challenges in Data-Driven Dialogue Systems. AIES (2018)
Joshua Romoff,Peter Henderson,Alexandre Piché,Vincent François-Lavet,Joelle Pineau; Reward Estimation for Variance Reduction in Deep Reinforcement Learning. CoRL (2018)
Prasanna Parthasarathi,Joelle Pineau; Extending Neural Generative Conversational Model using External Knowledge Sources. EMNLP (2018)
Amy Zhang,Harsh Satija,Joelle Pineau; Decoupling Dynamics and Reward for Transfer Learning. ICLR (Workshop) (2018)
Nan Rosemary Ke,Konrad Zolna,Alessandro Sordoni,Zhouhan Lin,Adam Trischler,Yoshua Bengio,Joelle Pineau,Laurent Charlin,Christopher J. Pal; Focused Hierarchical RNNs for Conditional Sequence Processing. ICML (2018)
Matthew J. A. Smith,Herke van Hoof,Joelle Pineau; An Inference-Based Policy Gradient Method for Learning Options. ICML (2018)
Audrey Durand,Charis Achilleos,Demetris Iacovides,Katerina Strati,Georgios D. Mitsis,Joelle Pineau; Contextual Bandits for Adapting Treatment in a Mouse Model of de Novo Carcinogenesis. MLHC (2018)
Pierre Thodoroff,Audrey Durand,Joelle Pineau,Doina Precup; Temporal Regularization for Markov Decision Process. NeurIPS (2018)
Iulian Vlad Serban,Chinnadhurai Sankar,Mathieu Germain,Saizheng Zhang,Zhouhan Lin,Sandeep Subramanian,Taesup Kim,Michael Pieper,Sarath Chandar,Nan Rosemary Ke,Sai Rajeswar,Alexandre de Brébisson,Jose M. R. Sotelo,Dendi Suhubdy,Vincent Michalski,Alexandre Nguyen,Joelle Pineau,Yoshua Bengio; A Deep Reinforcement Learning Chatbot (Short Version). CoRR (2018)
Valentin Thomas,Emmanuel Bengio,William Fedus,Jules Pondard,Philippe Beaudoin,Hugo Larochelle,Joelle Pineau,Doina Precup,Yoshua Bengio; Disentangling the independently controllable factors of variation by interacting with the world. CoRR (2018)
Amy Zhang,Nicolas Ballas,Joelle Pineau; A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning. CoRR (2018)
Iulian Vlad Serban,Chinnadhurai Sankar,Michael Pieper,Joelle Pineau,Yoshua Bengio; The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach. CoRR (2018)
Thang Doan,João Monteiro,Isabela Albuquerque,Bogdan Mazoure,Audrey Durand,Joelle Pineau,R. Devon Hjelm; Online Adaptative Curriculum Learning for GANs. CoRR (2018)
Eric Crawford,Guillaume Rabusseau,Joelle Pineau; Sequential Coordination of Deep Models for Learning Visual Arithmetic. CoRR (2018)
Peter Henderson,Joshua Romoff,Joelle Pineau; Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods. CoRR (2018)
Pierre Thodoroff,Audrey Durand,Joelle Pineau,Doina Precup; Temporal Regularization in Markov Decision Process. CoRR (2018)
Peter Henderson,Koustuv Sinha,Nan Rosemary Ke,Joelle Pineau; Adversarial Gain. CoRR (2018)
Nicolas Gontier,Koustuv Sinha,Peter Henderson,Iulian Serban,Michael Noseworthy,Prasanna Parthasarathi,Joelle Pineau; The RLLChatbot: a solution to the ConvAI challenge. CoRR (2018)
Koustuv Sinha,Shagun Sodhani,William L. Hamilton,Joelle Pineau; Compositional Language Understanding with Text-based Relational Reasoning. CoRR (2018)
Amy Zhang,Yuxin Wu,Joelle Pineau; Natural Environment Benchmarks for Reinforcement Learning. CoRR (2018)
Pierre-Luc Bacon,Doina Precup; Constructing Temporal Abstractions Autonomously in Reinforcement Learning. AI Mag. (2018)
Yuri Grinberg,Hossein Aboutalebi,Melanie Lyman-Abramovitch,Borja Balle,Doina Precup; Learning Predictive State Representations From Non-Uniform Sampling. AAAI (2018)
Jean Harb,Pierre-Luc Bacon,Martin Klissarov,Doina Precup; When Waiting Is Not an Option: Learning Options With a Deliberation Cost. AAAI (2018)
Anna Harutyunyan,Peter Vrancx,Pierre-Luc Bacon,Doina Precup,Ann Nowé; Learning With Options That Terminate Off-Policy. AAAI (2018)
Daniel J. Mankowitz,Timothy A. Mann,Pierre-Luc Bacon,Doina Precup,Shie Mannor; Learning Robust Options. AAAI (2018)
Andrei Lupu,Doina Precup; Imitation Upper Confidence Bound for Bandits on a Graph. AAAI (2018)
Tianyu Li,Guillaume Rabusseau,Doina Precup; Nonlinear Weighted Finite Automata. AISTATS (2018)
Ayush Jain,Doina Precup; Eligibility Traces for Options. AAMAS (2018)
Andrei Lupu,Audrey Durand,Doina Precup; Leveraging Observational Learning for Exploration in Bandits. AAMAS (2018)
Lara J. Kanbar,Charles C. Onu,Wissam Shalish,Karen A. Brown,Guilherme M. Sant'Anna,Doina Precup,Robert E. Kearney; Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants. EMBC (2018)
Ahmed Touati,Pierre-Luc Bacon,Doina Precup,Pascal Vincent; Convergent TREE BACKUP and RETRACE with Function Approximation. ICML (2018)
Tanya Nair,Doina Precup,Douglas L. Arnold,Tal Arbel; Exploring Uncertainty Measures in Deep Networks for Multiple Sclerosis Lesion Detection and Segmentation. MICCAI (1) (2018)
Jessie Huang,Fa Wu,Doina Precup,Yang Cai; Learning Safe Policies with Expert Guidance. NeurIPS (2018)
Kian Kenyon-Dean,Jackie Chi Kit Cheung,Doina Precup; Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization. *SEM@NAACL-HLT (2018)
Ryan Faulkner,Doina Precup; Dyna Planning using a Feature Based Generative Model. CoRR (2018)
Arushi Jain,Khimya Khetarpal,Doina Precup; Safe Option-Critic: Learning Safety in the Option-Critic Architecture. CoRR (2018)
Khimya Khetarpal,Doina Precup; Attend Before you Act: Leveraging human visual attention for continual learning. CoRR (2018)
Charles C. Onu,Lara J. Kanbar,Wissam Shalish,Karen A. Brown,Guilherme M. Sant'Anna,Robert E. Kearney,Doina Precup; Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing. CoRR (2018)
Tom Schaul,Hado van Hasselt,Joseph Modayil,Martha White,Adam White,Pierre-Luc Bacon,Jean Harb,Shibl Mourad,Marc G. Bellemare,Doina Precup; The Barbados 2018 List of Open Issues in Continual Learning. CoRR (2018)
Khimya Khetarpal,Shagun Sodhani,Sarath Chandar,Doina Precup; Environments for Lifelong Reinforcement Learning. CoRR (2018)
Kian Kenyon-Dean,Andre Cianflone,Lucas Page-Caccia,Guillaume Rabusseau,Jackie Chi Kit Cheung,Doina Precup; Clustering-Oriented Representation Learning with Attractive-Repulsive Loss. CoRR (2018)
Andre Cianflone,Yulan Feng,Jad Kabbara,Jackie Chi Kit Cheung; Let's do it "again": A First Computational Approach to Detecting Adverbial Presupposition Triggers. ACL (1) (2018)
Koustuv Sinha,Yue Dong,Jackie Chi Kit Cheung,Derek Ruths; A Hierarchical Neural Attention-based Text Classifier. EMNLP (2018)
Ali Emami,Noelia De La Cruz,Adam Trischler,Kaheer Suleman,Jackie Chi Kit Cheung; A Knowledge Hunting Framework for Common Sense Reasoning. EMNLP (2018)
Yue Dong,Yikang Shen,Eric Crawford,Herke van Hoof,Jackie Chi Kit Cheung; BanditSum: Extractive Summarization as a Contextual Bandit. EMNLP (2018)
Edward Newell,Jackie Chi Kit Cheung; Constructing a Lexicon of Relational Nouns. LREC (2018)
Ali Emami,Adam Trischler,Kaheer Suleman,Jackie Chi Kit Cheung; A Generalized Knowledge Hunting Framework for the Winograd Schema Challenge. NAACL-HLT (Student Research Workshop) (2018)
Laura Kallmeyer,Behrang QasemiZadeh,Jackie Chi Kit Cheung; Coarse Lexical Frame Acquisition at the Syntax-Semantics Interface Using a Latent-Variable PCFG Model. *SEM@NAACL-HLT (2018)
Ebrahim Bagheri,Jackie Chi Kit Cheung; Advances in Artificial Intelligence - 31st Canadian Conference on Artificial Intelligence, Canadian AI 2018, Toronto, ON, Canada, May 8-11, 2018, Proceedings. Lecture Notes in Computer Science (2018)
Stanislaw Jastrzebski,Dzmitry Bahdanau,Seyedarian Hosseini,Michael Noukhovitch,Yoshua Bengio,Jackie Chi Kit Cheung; Commonsense mining as knowledge base completion? A study on the impact of novelty. CoRR (2018)
Ali Emami,Paul Trichelair,Adam Trischler,Kaheer Suleman,Hannes Schulz,Jackie Chi Kit Cheung; The Hard-CoRe Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora Resolution. CoRR (2018)
Paul Trichelair,Ali Emami,Jackie Chi Kit Cheung,Adam Trischler,Kaheer Suleman,Fernando Diaz; On the Evaluation of Common-Sense Reasoning in Natural Language Understanding. CoRR (2018)
Pengfei Liu,Shuaichen Chang,Xuanjing Huang,Jian Tang,Jackie Chi Kit Cheung; Contextualized Non-local Neural Networks for Sequence Learning. CoRR (2018)
Pengfei Liu,Jie Fu,Yue Dong,Xipeng Qiu,Jackie Chi Kit Cheung; Multi-task Learning over Graph Structures. CoRR (2018)
Radu Mardare,Prakash Panangaden,Gordon D. Plotkin; Free complete Wasserstein algebras. Log. Methods Comput. Sci. (2018)
Giorgio Bacci,Robert Furber,Dexter Kozen,Radu Mardare,Prakash Panangaden,Dana S. Scott; Boolean-Valued Semantics for the Stochastic λ-Calculus. LICS (2018)
Giorgio Bacci,Radu Mardare,Prakash Panangaden,Gordon D. Plotkin; An Algebraic Theory of Markov Processes. LICS (2018)
Nicolas Gagné,Prakash Panangaden; A Categorical Characterization of Relative Entropy on Standard Borel Spaces. MFPS (2018)
Radu Mardare,Prakash Panangaden,Gordon D. Plotkin; On the Axiomatizability of Quantitative Algebras. CoRR (2018)
Reihaneh Rabbany,David Bayani,Artur Dubrawski; Active Search of Connections for Case Building and Combating Human Trafficking. KDD (2018)
Dhivya Eswaran,Reihaneh Rabbany,Artur W. Dubrawski,Christos Faloutsos; Social-Affiliation Networks: Patterns and the SOAR Model. ECML/PKDD (2) (2018)
Reihaneh Rabbany,Mansoureh Takaffoli,Justin Fagnan,Osmar R. Zaïane,Ricardo J. G. B. Campello; Relative Validity Criteria for Community Mining Algorithms. Encyclopedia of Social Network Analysis and Mining. 2nd Ed. (2018)
Justin Fagnan,Afra Abnar,Reihaneh Rabbany,Osmar R. Zaïane; Modular Networks for Validating Community Detection Algorithms. CoRR (2018)

2017

Ryan Thomas Lowe,Nissan Pow,Iulian Vlad Serban,Laurent Charlin,Chia-Wei Liu,Joelle Pineau; Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus. Dialogue Discourse (2017)
Ali Emami,Joseph El Youssef,Remi Rabasa-Lhoret,Joelle Pineau,Jessica R. Castle,Ahmad Haidar; Modeling Glucagon Action in Patients With Type 1 Diabetes. IEEE J. Biomed. Health Informatics (2017)
Iulian Vlad Serban,Alessandro Sordoni,Ryan Lowe,Laurent Charlin,Joelle Pineau,Aaron C. Courville,Yoshua Bengio; A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues. AAAI (2017)
Ryan Lowe,Michael Noseworthy,Iulian Vlad Serban,Nicolas Angelard-Gontier,Yoshua Bengio,Joelle Pineau; Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses. ACL (1) (2017)
Matthew Smith,Laurent Charlin,Joelle Pineau; A Sparse Probabilistic Model of User Preference Data. Canadian Conference on AI (2017)
Iulian Vlad Serban,Alexander Ororbia,Joelle Pineau,Aaron C. Courville; Piecewise Latent Variables for Neural Variational Text Processing. SPNLP@EMNLP (2017)
Dzmitry Bahdanau,Philemon Brakel,Kelvin Xu,Anirudh Goyal,Ryan Lowe,Joelle Pineau,Aaron C. Courville,Yoshua Bengio; An Actor-Critic Algorithm for Sequence Prediction. ICLR (Poster) (2017)
Ryan Lowe,Michael Noseworthy,Iulian Vlad Serban,Nicolas Angelard-Gontier,Yoshua Bengio,Joelle Pineau; Towards an automatic Turing test: Learning to evaluate dialogue responses. ICLR (Workshop) (2017)
Guillaume Rabusseau,Borja Balle,Joelle Pineau; Multitask Spectral Learning of Weighted Automata. NIPS (2017)
Hoai Phuoc Truong,Prasanna Parthasarathi,Joelle Pineau; MACA: A Modular Architecture for Conversational Agents. SIGDIAL Conference (2017)
Michael Noseworthy,Jackie Chi Kit Cheung,Joelle Pineau; Predicting Success in Goal-Driven Human-Human Dialogues. SIGDIAL Conference (2017)
Emmanuel Bengio,Valentin Thomas,Joelle Pineau,Doina Precup,Yoshua Bengio; Independently Controllable Features. CoRR (2017)
Valentin Thomas,Jules Pondard,Emmanuel Bengio,Marc Sarfati,Philippe Beaudoin,Marie-Jean Meurs,Joelle Pineau,Doina Precup,Yoshua Bengio; Independently Controllable Factors. CoRR (2017)
Iulian Vlad Serban,Chinnadhurai Sankar,Mathieu Germain,Saizheng Zhang,Zhouhan Lin,Sandeep Subramanian,Taesup Kim,Michael Pieper,Sarath Chandar,Nan Rosemary Ke,Sai Mudumba,Alexandre de Brébisson,Jose Sotelo,Dendi Suhubdy,Vincent Michalski,Alexandre Nguyen,Joelle Pineau,Yoshua Bengio; A Deep Reinforcement Learning Chatbot. CoRR (2017)
Peter Henderson,Riashat Islam,Philip Bachman,Joelle Pineau,Doina Precup,David Meger; Deep Reinforcement Learning that Matters. CoRR (2017)
Peter Henderson,Wei-Di Chang,Pierre-Luc Bacon,David Meger,Joelle Pineau,Doina Precup; OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning. CoRR (2017)
Anirudh Goyal,Nan Rosemary Ke,Alex Lamb,R. Devon Hjelm,Chris Pal,Joelle Pineau,Yoshua Bengio; ACtuAL: Actor-Critic Under Adversarial Learning. CoRR (2017)
Xingwei Cao,Guillaume Rabusseau,Joelle Pineau; Tensor Regression Networks with various Low-Rank Tensor Approximations. CoRR (2017)
Pierre-Luc Bacon,Jean Harb,Doina Precup; The Option-Critic Architecture. AAAI (2017)
Negar Ghourchian,Michel Allegue-Martínez,Doina Precup; Real-Time Indoor Localization in Smart Homes Using Semi-Supervised Learning. AAAI (2017)
Charles C. Onu,Lara J. Kanbar,Wissam Shalish,Karen A. Brown,Guilherme M. Sant'Anna,Robert E. Kearney,Doina Precup; A semi-Markov chain approach to modeling respiratory patterns prior to extubation in preterm infants. EMBC (2017)
Lara J. Kanbar,Wissam Shalish,Doina Precup,Karen A. Brown,Guilherme M. Sant'Anna,Robert E. Kearney; APEX_SCOPE: A graphical user interface for visualization of multi-modal data in inter-disciplinary studies. EMBC (2017)
Teng Long,Emmanuel Bengio,Ryan Lowe,Jackie Chi Kit Cheung,Doina Precup; World Knowledge for Reading Comprehension: Rare Entity Prediction with Hierarchical LSTMs Using External Descriptions. EMNLP (2017)
Jesús Alejandro Cárdenes Cabré,Doina Precup,Ricardo Sanz; Horizontal and Vertical Self-Adaptive Cloud Controller with Reward Optimization for Resource Allocation. ICCAC (2017)
Timothy A. Mann,Shie Mannor,Doina Precup; Approximate Value Iteration with Temporally Extended Actions (Extended Abstract). IJCAI (2017)
Andrew Doyle,Doina Precup,Douglas L. Arnold,Tal Arbel; Predicting Future Disease Activity and Treatment Responders for Multiple Sclerosis Patients Using a Bag-of-Lesions Brain Representation. MICCAI (3) (2017)
Sharmin Nilufar,D. S. Wang,John Girgis,C. G. Palii,D. Yang,A. Blais,M. Brand,Doina Precup,Theodore J. Perkins; Learning-based interactive segmentation using the maximum mean cycle weight formalism. Medical Imaging: Image Processing (2017)
Di Wu,Boyu Wang,Doina Precup,Benoit Boulet; Boosting Based Multiple Kernel Learning and Transfer Regression for Electricity Load Forecasting. ECML/PKDD (3) (2017)
Charles C. Onu,Lara J. Kanbar,Wissam Shalish,Karen A. Brown,Guilherme M. Sant'Anna,Robert E. Kearney,Doina Precup; Predicting extubation readiness in extreme preterm infants based on patterns of breathing. SSCI (2017)
Doina Precup,Yee Whye Teh; Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017. Proceedings of Machine Learning Research (2017)
Peeyush Kumar,Doina Precup; Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options. CoRR (2017)
Jean Harb,Doina Precup; Investigating Recurrence and Eligibility Traces in Deep Q-Networks. CoRR (2017)
Ahmed Touati,Pierre-Luc Bacon,Doina Precup,Pascal Vincent; Convergent Tree-Backup and Retrace with Function Approximation. CoRR (2017)
Philip Bachman,Doina Precup; Variational Generative Stochastic Networks with Collaborative Shaping. CoRR (2017)
Riashat Islam,Peter Henderson,Maziar Gomrokchi,Doina Precup; Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control. CoRR (2017)
Tianyu Li,Guillaume Rabusseau,Doina Precup; Neural Network Based Nonlinear Weighted Finite Automata. CoRR (2017)
Jean Harb,Pierre-Luc Bacon,Martin Klissarov,Doina Precup; When Waiting is not an Option : Learning Options with a Deliberation Cost. CoRR (2017)
Anna Harutyunyan,Peter Vrancx,Pierre-Luc Bacon,Doina Precup,Ann Nowé; Learning with Options that Terminate Off-Policy. CoRR (2017)
Charles C. Onu,Innocent Udeogu,Eyenimi Ndiomu,Urbain Kengni,Doina Precup,Guilherme M. Sant'Anna,Edward Alikor,Peace Opara; Ubenwa: Cry-based Diagnosis of Birth Asphyxia. CoRR (2017)
Martin Klissarov,Pierre-Luc Bacon,Jean Harb,Doina Precup; Learnings Options End-to-End for Continuous Action Tasks. CoRR (2017)
Jean-Guy Meunier,Louis Chartrand,Jackie Chi Kit Cheung,Mathieu Valette,Marie-Noëlle Bayle; Computer-Assisted Conceptual Analysis of Textual Data as Applied to Philosophical Corpuses. DH (2017)
Louis Chartrand,Jackie Chi Kit Cheung,Mohamed Bouguessa; Detecting Large Concept Extensions for Conceptual Analysis. MLDM (2017)
Nick Parlante,Julie Zelenski,Dave Feinberg,Kunal Mishra,Josh Hug,Kevin Wayne,Michael Guerzhoy,Jackie Chi Kit Cheung,François Pitt; Nifty Assignments. SIGCSE (2017)
Lu Wang,Jackie Chi Kit Cheung,Giuseppe Carenini,Fei Liu; Proceedings of the Workshop on New Frontiers in Summarization, NFiS@EMNLP 2017, Copenhagen, Denmark, September 7, 2017.
Prakash Panangaden; Editorial comments on the short note by Barry Jay. J. Log. Algebraic Methods Program. (2017)
Prakash Panangaden; The 2017 Alonzo Church award. ACM SIGLOG News (2017)
Prakash Panangaden; 2017 LICS test-of-time award. ACM SIGLOG News (2017)
Prakash Panangaden; 2017 Kleene award. ACM SIGLOG News (2017)
Florence Clerc,Harrison Humphrey,Prakash Panangaden; Bicategories of Markov Processes. Models, Algorithms, Logics and Tools (2017)
Borja Balle,Pascale Gourdeau,Prakash Panangaden; Bisimulation Metrics for Weighted Automata. ICALP (2017)
Nathanaël Fijalkow,Bartek Klin,Prakash Panangaden; Expressiveness of Probabilistic Modal Logics, Revisited. ICALP (2017)
Robert Furber,Dexter Kozen,Kim G. Larsen,Radu Mardare,Prakash Panangaden; Unrestricted stone duality for Markov processes. LICS (2017)
Radu Mardare,Prakash Panangaden,Gordon D. Plotkin; On the axiomatizability of quantitative algebras. LICS (2017)
Nicolas Gagné,Prakash Panangaden; A categorical characterization of relative entropy on Polish spaces. CoRR (2017)

2016

Beomjoon Kim,Joelle Pineau; Socially Adaptive Path Planning in Human Environments Using Inverse Reinforcement Learning. Int. J. Soc. Robotics (2016)
André da Motta Salles Barreto,Doina Precup,Joelle Pineau; Practical Kernel-Based Reinforcement Learning. J. Mach. Learn. Res. (2016)
Boyu Wang,Joelle Pineau; Online Bagging and Boosting for Imbalanced Data Streams. IEEE Trans. Knowl. Data Eng. (2016)
André da Motta Salles Barreto,Rafael L. Beirigo,Joelle Pineau,Doina Precup; Incremental Stochastic Factorization for Online Reinforcement Learning. AAAI (2016)
Boyu Wang,Joelle Pineau,Borja Balle; Multitask Generalized Eigenvalue Program. AAAI (2016)
Iulian Vlad Serban,Alessandro Sordoni,Yoshua Bengio,Aaron C. Courville,Joelle Pineau; Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models. AAAI (2016)
Martin Gerdzhev,Joelle Pineau,Ian M. Mitchell,Pooja Viswanathan,Geneviève Foley; On the Use of Modular Software and Hardware for Designing Wheelchair Robots. AAAI Spring Symposia (2016)
Chia-Wei Liu,Ryan Lowe,Iulian Serban,Michael Noseworthy,Laurent Charlin,Joelle Pineau; How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation. EMNLP (2016)
Chenghui Zhou,Borja Balle,Joelle Pineau; Learning time series models for pedestrian motion prediction. ICRA (2016)
Boyu Wang,Joelle Pineau; Generalized Dictionary for Multitask Learning with Boosting. IJCAI (2016)
Pierre Thodoroff,Joelle Pineau,Andrew Lim; Learning Robust Features using Deep Learning for Automatic Seizure Detection. MLHC (2016)
Ryan Lowe,Iulian Vlad Serban,Michael Noseworthy,Laurent Charlin,Joelle Pineau; On the Evaluation of Dialogue Systems with Next Utterance Classification. SIGDIAL Conference (2016)
Mohammad Ghavamzadeh,Shie Mannor,Joelle Pineau,Aviv Tamar; Bayesian Reinforcement Learning: A Survey. CoRR (2016)
Iulian Vlad Serban,Ryan Lowe,Laurent Charlin,Joelle Pineau; Generative Deep Neural Networks for Dialogue: A Short Review. CoRR (2016)
Iulian Vlad Serban,Alexander G. Ororbia II,Joelle Pineau,Aaron C. Courville; Multi-modal Variational Encoder-Decoders. CoRR (2016)
Tal Arbel,Manuel Jorge Cardoso,William M. Wells III,Albert C. S. Chung,Doina Precup; Editorial on Special Issue on Probabilistic Models for Biomedical Image Analysis. Comput. Vis. Image Underst. (2016)
Meltem Demirkus,Doina Precup,James J. Clark,Tal Arbel; Hierarchical Spatio-Temporal Probabilistic Graphical Model with Multiple Feature Fusion for Binary Facial Attribute Classification in Real-World Face Videos. IEEE Trans. Pattern Anal. Mach. Intell. (2016)
Teng Long,Ryan Lowe,Jackie Chi Kit Cheung,Doina Precup; Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data. ACL (2) (2016)
Faizy Ahsan,Doina Precup,Mathieu Blanchette; Prediction of Cell Type Specific Transcription Factor Binding Site Occupancy. BCB (2016)
Lara J. Kanbar,Wissam Shalish,Doina Precup,Karen A. Brown,Guilherme M. Sant'Anna,Robert E. Kearney; Automated ongoing data validation and quality control of multi-institutional studies. EMBC (2016)
Kian Kenyon-Dean,Jackie Chi Kit Cheung,Doina Precup; Verb Phrase Ellipsis Resolution Using Discriminative and Margin-Infused Algorithms. EMNLP (2016)
Borja Balle,Maziar Gomrokchi,Doina Precup; Differentially Private Policy Evaluation. ICML (2016)
Lucas Langer,Borja Balle,Doina Precup; Learning Multi-Step Predictive State Representations. IJCAI (2016)
Pierre-Luc Bacon,Doina Precup; A Matrix Splitting Perspective on Planning with Options. CoRR (2016)
Meng Xuan Xia,Jackie Chi Kit Cheung; Accurate Pinyin-English Codeswitched Language Identification. CodeSwitch@EMNLP (2016)
Victor Chenal,Jackie Chi Kit Cheung; Predicting sentential semantic compatibility for aggregation in text-to-text generation. COLING (2016)
Jad Kabbara,Yulan Feng,Jackie Chi Kit Cheung; Capturing Pragmatic Knowledge in Article Usage Prediction using LSTMs. COLING (2016)
Matty Hoban,Bart Jacobs,Prakash Panangaden; Preface. Inf. Comput. (2016)
Ichiro Hasuo,Prakash Panangaden; Special Issue on Quantum Physics and Logic. New Gener. Comput. (2016)
Michael W. Mislove,Prakash Panangaden; Semantics column. ACM SIGLOG News (2016)
Prakash Panangaden; Fond (and Frank) Memories of Frank. Theory and Practice of Formal Methods (2016)
Radu Mardare,Prakash Panangaden,Gordon D. Plotkin; Quantitative Algebraic Reasoning. LICS (2016)

2015

Boyu Wang,Joelle Pineau; Online Boosting Algorithms for Anytime Transfer and Multitask Learning. AAAI (2015)
Hang Ma,Joelle Pineau; Information Gathering and Reward Exploitation of Subgoals for POMDPs. AAAI (2015)
Audrey Durand,Joelle Pineau; Adaptive Treatment Allocation Using Sub-Sampled Gaussian Processes. AAAI Fall Symposia (2015)
Andrew Sutcliffe,Neil Tenenholtz,Joelle Pineau; Missteps in Robot Social Navigation. AAAI Fall Symposia (2015)
Joelle Pineau; Improving the Design and Discovery of Dynamic Treatment Strategies Using Recent Results in Sequential Decision-Making. ICAPS (2015)
Joelle Pineau,Pierre-Luc Bacon; Analyzing Open Data from the City of Montreal. MUD@ICML (2015)
Angus Leigh,Joelle Pineau,Nicolas A. Olmedo,Hong Zhang; Person tracking and following with 2D laser scanners. ICRA (2015)
André da Motta Salles Barreto,Rafael L. Beirigo,Joelle Pineau,Doina Precup; An Expectation-Maximization Algorithm to Compute a Stochastic Factorization From Data. IJCAI (2015)
HiuKim Yuen,Joelle Pineau,Philippe S. Archambault; Automatically characterizing driving activities onboard smart wheelchairs from accelerometer data. IROS (2015)
Ryan Lowe,Nissan Pow,Iulian Serban,Joelle Pineau; The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems. SIGDIAL Conference (2015)
Iulian Vlad Serban,Alessandro Sordoni,Yoshua Bengio,Aaron C. Courville,Joelle Pineau; Hierarchical Neural Network Generative Models for Movie Dialogues. CoRR (2015)
Emmanuel Bengio,Pierre-Luc Bacon,Joelle Pineau,Doina Precup; Conditional Computation in Neural Networks for faster models. CoRR (2015)
Iulian Vlad Serban,Ryan Lowe,Peter Henderson,Laurent Charlin,Joelle Pineau; A Survey of Available Corpora for Building Data-Driven Dialogue Systems. CoRR (2015)
Meltem Demirkus,Doina Precup,James J. Clark,Tal Arbel; Hierarchical temporal graphical model for head pose estimation and subsequent attribute classification in real-world videos. Comput. Vis. Image Underst. (2015)
Timothy A. Mann,Shie Mannor,Doina Precup; Approximate Value Iteration with Temporally Extended Actions. J. Artif. Intell. Res. (2015)
Nastaran Jafarpour,Masoumeh T. Izadi,Doina Precup,David L. Buckeridge; Quantifying the determinants of outbreak detection performance through simulation and machine learning. J. Biomed. Informatics (2015)
Amir-massoud Farahmand,Doina Precup,André da Motta Salles Barreto,Mohammad Ghavamzadeh; Classification-Based Approximate Policy Iteration. IEEE Trans. Autom. Control. (2015)
Bjoern H. Menze,András Jakab,Stefan Bauer,Jayashree Kalpathy-Cramer,Keyvan Farahani,Justin S. Kirby,Yuliya Burren,Nicole Porz,Johannes Slotboom,Roland Wiest,Levente Lanczi,Elizabeth R. Gerstner,Marc-André Weber,Tal Arbel,Brian B. Avants,Nicholas Ayache,Patricia Buendia,D. Louis Collins,Nicolas Cordier,Jason J. Corso,Antonio Criminisi,Tilak Das,Herve Delingette,Çagatay Demiralp,Christopher R. Durst,Michel Dojat,Senan Doyle,Joana Festa,Florence Forbes,Ezequiel Geremia,Ben Glocker,Polina Golland,Xiaotao Guo,Andac Hamamci,Khan M. Iftekharuddin,Raj Jena,Nigel M. John,Ender Konukoglu,Danial Lashkari,José Antonio Mariz,Raphael Meier,Sérgio Pereira,Doina Precup,Stephen J. Price,Tammy Riklin Raviv,Syed M. S. Reza,Michael T. Ryan,Duygu Sarikaya,Lawrence H. Schwartz,Hoo-Chang Shin,Jamie Shotton,Carlos A. Silva,Nuno J. Sousa,Nagesh K. Subbanna,Gábor Székely,Thomas J. Taylor,Owen M. Thomas,Nicholas J. Tustison,Gözde B. Ünal,Flor Vasseur,Max Wintermark,Dong Hye Ye,Liang Zhao,Binsheng Zhao,Darko Zikic,Marcel Prastawa,Mauricio Reyes,Koen Van Leemput; The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS). IEEE Trans. Medical Imaging (2015)
Sherry Shanshan Ruan,Gheorghe Comanici,Prakash Panangaden,Doina Precup; Representation Discovery for MDPs Using Bisimulation Metrics. AAAI (2015)
Lara J. Kanbar,Wissam Shalish,Carlos A. Robles-Rubio,Doina Precup,Karen A. Brown,Guilherme M. Sant'Anna,Robert E. Kearney; Organizational principles of cloud storage to support collaborative biomedical research. EMBC (2015)
Pascale Gourdeau,Lara J. Kanbar,Wissam Shalish,Guilherme M. Sant'Anna,Robert E. Kearney,Doina Precup; Feature selection and oversampling in analysis of clinical data for extubation readiness in extreme preterm infants. EMBC (2015)
Lara J. Kanbar,Wissam Shalish,Carlos A. Robles-Rubio,Doina Precup,Karen A. Brown,Guilherme M. Sant'Anna,Robert E. Kearney; Correlation of clinical parameters with cardiorespiratory behavior in successfully extubated extremely preterm infants. EMBC (2015)
Nagesh K. Subbanna,Doina Precup,Douglas L. Arnold,Tal Arbel; IMaGe: Iterative Multilevel Probabilistic Graphical Model for Detection and Segmentation of Multiple Sclerosis Lesions in Brain MRI. IPMI (2015)
Borja Balle,Prakash Panangaden,Doina Precup; A Canonical Form for Weighted Automata and Applications to Approximate Minimization. LICS (2015)
Gheorghe Comanici,Doina Precup,Prakash Panangaden; Basis refinement strategies for linear value function approximation in MDPs. NIPS (2015)
Philip Bachman,Doina Precup; Data Generation as Sequential Decision Making. NIPS (2015)
Pierre-Luc Bacon,Borja Balle,Doina Precup; Learning and Planning with Timing Information in Markov Decision Processes. UAI (2015)
Philip Bachman,David Krueger,Doina Precup; Testing Visual Attention in Dynamic Environments. CoRR (2015)
Lucas Lehnert,Doina Precup; Policy Gradient Methods for Off-policy Control. CoRR (2015)
Priya Sidhaye,Jackie Chi Kit Cheung; Indicative Tweet Generation: An Extractive Summarization Problem? EMNLP (2015)
Prakash Panangaden; Probabilistic bisimulation. ACM SIGLOG News (2015)
Mohamed Yousri Mahmoud,Prakash Panangaden,Sofiène Tahar; On the Formal Verification of Optical Quantum Gates in HOL. FMICS (2015)
Costin Badescu,Prakash Panangaden; Quantum Alternation: Prospects and Problems. QPL (2015)

2014

André da Motta Salles Barreto,Joelle Pineau,Doina Precup; Policy Iteration Based on Stochastic Factorization. J. Artif. Intell. Res. (2014)
William L. Hamilton,Mahdi Milani Fard,Joelle Pineau; Efficient learning and planning with compressed predictive states. J. Mach. Learn. Res. (2014)
Andrew Sutcliffe,Daniel H. Grollman,Joelle Pineau; Estimating People's Subjective Experiences of Robot Behavior. AAAI Fall Symposia (2014)
Borja Balle,William L. Hamilton,Joelle Pineau; Methods of Moments for Learning Stochastic Languages: Unified Presentation and Empirical Comparison. ICML (2014)
Ouais Alsharif,Joelle Pineau; End-to-End Text Recognition with Hybrid HMM Maxout Models. ICLR (Workshop) (2014)
Stéphane Ross,Joelle Pineau,Sébastien Paquet,Brahim Chaib-draa; Online Planning Algorithms for POMDPs. CoRR (2014)
Mahdi Milani Fard,Joelle Pineau; Non-Deterministic Policies in Markovian Decision Processes. CoRR (2014)
Ouais Alsharif,Philip Bachman,Joelle Pineau; Lifelong Learning of Discriminative Representations. CoRR (2014)
Sara M. McCarthy,Doina Precup; Theoretical results on the effect of 'shortcut' actions in MDPs. Connect. Sci. (2014)
Negar Ghourchian,Doina Precup; Analyzing User Trajectories from Mobile Device Data with Hierarchical Dirichlet Processes. Canadian Conference on AI (2014)
Norm Ferns,Doina Precup,Sophia Knight; Bisimulation for Markov Decision Processes through Families of Functional Expressions. Horizons of the Mind (2014)
Nagesh K. Subbanna,Doina Precup,Tal Arbel; Iterative Multilevel MRF Leveraging Context and Voxel Information for Brain Tumour Segmentation in MRI. CVPR (2014)
Meltem Demirkus,Doina Precup,James J. Clark,Tal Arbel; Probabilistic Temporal Head Pose Estimation Using a Hierarchical Graphical Model. ECCV (1) (2014)
Meltem Demirkus,Doina Precup,James J. Clark,Tal Arbel; Multi-layer temporal graphical model for head pose estimation in real-world videos. ICIP (2014)
Richard S. Sutton,Ashique Rupam Mahmood,Doina Precup,Hado van Hasselt; A new Q(lambda) with interim forward view and Monte Carlo equivalence. ICML (2014)
Philip Bachman,Amir-massoud Farahmand,Doina Precup; Sample-based approximate regularization. ICML (2014)
Yuri Grinberg,Doina Precup,Michel Gendreau; Optimizing Energy Production Using Policy Search and Predictive State Representations. NIPS (2014)
Philip Bachman,Ouais Alsharif,Doina Precup; Learning with Pseudo-Ensembles. NIPS (2014)
Norman Ferns,Doina Precup; Bisimulation Metrics are Optimal Value Functions. UAI (2014)
Manuel Jorge Cardoso,Ivor J. A. Simpson,Tal Arbel,Doina Precup,Annemie Ribbens; Bayesian and grAphical Models for Biomedical Imaging - First International Workshop, BAMBI 2014, Cambridge, MA, USA, September 18, 2014, Revised Selected Papers. Lecture Notes in Computer Science (2014)
Volodymyr Kuleshov,Doina Precup; Algorithms for multi-armed bandit problems. CoRR (2014)
Amir-massoud Farahmand,Doina Precup,André da Motta Salles Barreto,Mohammad Ghavamzadeh; Classification-based Approximate Policy Iteration: Experiments and Extended Discussions. CoRR (2014)
Philippe Chaput,Vincent Danos,Prakash Panangaden,Gordon D. Plotkin; Approximating Markov Processes by Averaging. J. ACM (2014)
Prakash Panangaden; Causality in physics and computation. Theor. Comput. Sci. (2014)
Filippo Bonchi,Marcello M. Bonsangue,Helle Hvid Hansen,Prakash Panangaden,Jan J. M. M. Rutten,Alexandra Silva; Algebra-coalgebra duality in Brzozowski's minimization algorithm. ACM Trans. Comput. Log. (2014)
Richard Blute,Alessio Guglielmi,Ivan T. Ivanov,Prakash Panangaden,Lutz Straßburger; A Logical Basis for Quantum Evolution and Entanglement. Categories and Types in Logic, Language, and Physics (2014)
Andrew Cave,Francisco Ferreira,Prakash Panangaden,Brigitte Pientka; Fair reactive programming. POPL (2014)
Dexter Kozen,Radu Mardare,Prakash Panangaden; A Metrized Duality Theorem for Markov Processes. MFPS (2014)
Ross Duncan,Prakash Panangaden; Proceedings 9th Workshop on Quantum Physics and Logic, QPL 2012, Brussels, Belgium, 10-12 October 2012. EPTCS (2014)
Bob Coecke,Ichiro Hasuo,Prakash Panangaden; Proceedings of the 11th workshop on Quantum Physics and Logic, QPL 2014, Kyoto, Japan, 4-6th June 2014. EPTCS (2014)

2013

Guy Shani,Joelle Pineau,Robert Kaplow; A survey of point-based POMDP solvers. Auton. Agents Multi Agent Syst. (2013)
Jordan Frank,Shie Mannor,Joelle Pineau,Doina Precup; Time Series Analysis Using Geometric Template Matching. IEEE Trans. Pattern Anal. Mach. Intell. (2013)
Sylvie C. W. Ong,Yuri Grinberg,Joelle Pineau; Mixed Observability Predictive State Representations. AAAI (2013)
Joelle Pineau; Designing Intelligent Wheelchairs: Reintegrating AI. AAAI Spring Symposium: Designing Intelligent Robots (2013)
William L. Hamilton,Mahdi Milani Fard,Joelle Pineau; Modelling Sparse Dynamical Systems with Compressed Predictive State Representations. ICML (1) (2013)
Beomjoon Kim,Amir-massoud Farahmand,Joelle Pineau,Doina Precup; Learning from Limited Demonstrations. NIPS (2013)
Mahdi Milani Fard,Yuri Grinberg,Amir-massoud Farahmand,Joelle Pineau,Doina Precup; Bellman Error Based Feature Generation using Random Projections on Sparse Spaces. NIPS (2013)
Beomjoon Kim,Joelle Pineau; Maximum Mean Discrepancy Imitation Learning. Robotics: Science and Systems (2013)
Boyu Wang,Joelle Pineau; Online Ensemble Learning for Imbalanced Data Streams. CoRR (2013)
William L. Hamilton,Mahdi Milani Fard,Joelle Pineau; Efficient Learning and Planning with Compressed Predictive States. CoRR (2013)
Jordan Frank,Shie Mannor,Doina Precup; Generating storylines from sensor data. Pervasive Mob. Comput. (2013)
Nastaran Jafarpour,Doina Precup,Masoumeh T. Izadi,David L. Buckeridge; Using Hierarchical Mixture of Experts Model for Fusion of Outbreak Detection Methods. AMIA (2013)
Clement Gehring,Doina Precup; Smart exploration in reinforcement learning using absolute temporal difference errors. AAMAS (2013)
Arian Hosseinzadeh,Masoumeh T. Izadi,Aman Verma,Doina Precup,David L. Buckeridge; Assessing the Predictability of Hospital Readmission Using Machine Learning. IAAI (2013)
Yuri Grinberg,Doina Precup; Average Reward Optimization Objective In Partially Observable Domains. ICML (1) (2013)
Negar Ghourchian,Doina Precup; Smart Classifier Selection for Activity Recognition on Wearable Devices. ICPRAM (2013)
Nagesh K. Subbanna,Doina Precup,D. Louis Collins,Tal Arbel; Hierarchical Probabilistic Gabor and MRF Segmentation of Brain Tumours in MRI Volumes. MICCAI (1) (2013)
Philip Bachman,Doina Precup; Greedy Confidence Pursuit: A Pragmatic Approach to Multi-bandit Optimization. ECML/PKDD (1) (2013)
S. Barry Cooper,Elham Kashefi,Prakash Panangaden; Preface to special issue: Developments In Computational Models 2010. Math. Struct. Comput. Sci. (2013)
Prakash Panangaden; Quantum Field Theory for Legspinners. Computation, Logic, Games, and Quantum Foundations (2013)
Prakash Panangaden; Duality in Logic and Computation. LICS (2013)
Dexter Kozen,Kim G. Larsen,Radu Mardare,Prakash Panangaden; Stone Duality for Markov Processes. LICS (2013)
Dexter Kozen,Radu Mardare,Prakash Panangaden; Strong Completeness for Markovian Logics. MFCS (2013)
Bob Coecke,Luke Ong,Prakash Panangaden; Computation, Logic, Games, and Quantum Foundations. The Many Facets of Samson Abramsky - Essays Dedicated to Samson Abramsky on the Occasion of His 60th Birthday. Lecture Notes in Computer Science (2013)

2012

Finale Doshi-Velez,Joelle Pineau,Nicholas Roy; Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs. Artif. Intell. (2012)
ShaoWei Png,Joelle Pineau,Brahim Chaib-draa; Building Adaptive Dialogue Systems Via Bayes-Adaptive POMDPs. IEEE J. Sel. Top. Signal Process. (2012)
Mahdi Milani Fard,Yuri Grinberg,Joelle Pineau,Doina Precup; Compressed Least-Squares Regression on Sparse Spaces. AAAI (2012)
Emily Tsang,Sylvie C. W. Ong,Joelle Pineau; Design and Evaluation of a Flexible Interface for Spatial Navigation. CRV (2012)
Cosmin Paduraru,Doina Precup,Joelle Pineau,Gheorghe Comanici; An Empirical Analysis of Off-policy Learning in Discrete MDPs. EWRL (2012)
André da Motta Salles Barreto,Doina Precup,Joelle Pineau; On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization. NIPS (2012)
Kun Deng,Joelle Pineau,Susan A. Murphy; Active Learning for Developing Personalized Treatment. CoRR (2012)
Mahdi Milani Fard,Joelle Pineau,Csaba Szepesvári; PAC-Bayesian Policy Evaluation for Reinforcement Learning. CoRR (2012)
Stéphane Ross,Joelle Pineau; Model-Based Bayesian Reinforcement Learning in Large Structured Domains. CoRR (2012)
John Langford,Joelle Pineau; Proceedings of the 29th International Conference on Machine Learning (ICML-12). CoRR (2012)
Joelle Pineau,Geoffrey J. Gordon,Sebastian Thrun; Policy-contingent abstraction for robust robot control. CoRR (2012)
Noa Agmon,Vikas Agrawal,David W. Aha,Yiannis Aloimonos,Donagh Buckley,Prashant Doshi,Christopher W. Geib,Floriana Grasso,Nancy L. Green,Benjamin Johnston,Burt Kaliski,Christopher Kiekintveld,Edith Law,Henry Lieberman,Ole J. Mengshoel,Ted Metzler,Joseph Modayil,Douglas W. Oard,Nilufer Onder,Barry O'Sullivan,Katerina Pastra,Doina Precup,Sowmya Ramachandran,Chris Reed,Sanem Sariel Talay,Ted Selker,Lokendra Shastri,Stephen F. Smith,Satinder Singh,Siddharth Srivastava,Gita Sukthankar,David C. Uthus,Mary-Anne Williams; Reports of the AAAI 2011 Conference Workshops. AI Mag. (2012)
Philip A. Warrick,Emily F. Hamilton,Robert E. Kearney,Doina Precup; A Machine Learning Approach to the Detection of Fetal Hypoxia during Labor and Delivery. AI Mag. (2012)
Susanne Still,Doina Precup; An information-theoretic approach to curiosity-driven reinforcement learning. Theory Biosci. (2012)
Arian Hosseinzadeh,Masoumeh T. Izadi,Doina Precup,David L. Buckeridge; Mining Administrative Data to Predict Falls in the Elderly Population. Canadian Conference on AI (2012)
Meltem Demirkus,Doina Precup,James J. Clark,Tal Arbel; Soft biometric trait classification from real-world face videos conditioned on head pose estimation. CVPR Workshops (2012)
Doina Precup,Carlos A. Robles-Rubio,Karen A. Brown,Lara J. Kanbar,Jennifer Kaczmarek,Sanjay Chawla,Guilherme M. Sant'Anna,Robert E. Kearney; Prediction of extubation readiness in extreme preterm infants based on measures of cardiorespiratory variability. EMBC (2012)
Doina Precup,Philip Bachman; Improved Estimation in Time Varying Models. ICML (2012)
Amir Massoud Farahmand,Doina Precup; Value Pursuit Iteration. NIPS (2012)
Gheorghe Comanici,Prakash Panangaden,Doina Precup; On-the-Fly Algorithms for Bisimulation Metrics. QEST (2012)
Yuri Grinberg,Doina Precup; On Average Reward Policy Evaluation in Infinite-State Partially Observable Systems. AISTATS (2012)
Norman Ferns,Pablo Samuel Castro,Doina Precup,Prakash Panangaden; Methods for computing state similarity in Markov Decision Processes. CoRR (2012)
Norman Ferns,Prakash Panangaden,Doina Precup; Metrics for Markov Decision Processes with Infinite State Spaces. CoRR (2012)
Norman Ferns,Prakash Panangaden,Doina Precup; Metrics for Finite Markov Decision Processes. CoRR (2012)
Richard Blute,Prakash Panangaden,Sergey Slavnov; Deep Inference and Probabilistic Coherence Spaces. Appl. Categorical Struct. (2012)
Konstantinos Chatzikokolakis,Sophia Knight,Catuscia Palamidessi,Prakash Panangaden; Epistemic Strategies and Games on Concurrent Processes. ACM Trans. Comput. Log. (2012)
Sophia Knight,Radu Mardare,Prakash Panangaden; Combining Epistemic Logic and Hennessy-Milner Logic. Logic and Program Semantics (2012)
Prakash Panangaden; Dexter Kozen's Influence on the Theory of Labelled Markov Processes. Logic and Program Semantics (2012)
Sophia Knight,Catuscia Palamidessi,Prakash Panangaden,Frank D. Valencia; Spatial and Epistemic Modalities in Constraint-Based Process Calculi. CONCUR (2012)
Kim Guldstrand Larsen,Radu Mardare,Prakash Panangaden; Taking It to the Limit: Approximate Reasoning for Markov Processes. MFCS (2012)
Nick Bezhanishvili,Clemens Kupke,Prakash Panangaden; Minimization via Duality. WoLLIC (2012)
Stephen D. Brookes,Achim Jung,Catherine A. Meadows,Michael W. Mislove,Prakash Panangaden; Dedication. MFPS (2012)

2011

Stéphane Ross,Joelle Pineau,Brahim Chaib-draa,Pierre Kreitmann; A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes. J. Mach. Learn. Res. (2011)
Susan M. Shortreed,Eric B. Laber,Daniel J. Lizotte,T. Scott Stroup,Joelle Pineau,Susan A. Murphy; Informing sequential clinical decision-making through reinforcement learning: an empirical study. Mach. Learn. (2011)
Robert D. Vincent,Aaron C. Courville,Joelle Pineau; A bistable computational model of recurring epileptiform activity as observed in rodent slice preparations. Neural Networks (2011)
Guillaume Saulnier,Joelle Pineau; Automatic Seizure Detection in an In-Vivo Model of Epilepsy. AAAI Spring Symposium: Computational Physiology (2011)
Kun Deng,Joelle Pineau,Susan A. Murphy; Active learning for personalizing treatment. ADPRL (2011)
Athena K. Moghaddam,Joelle Pineau,Jordan Frank,Philippe S. Archambault,François Routhier,Therese Audet,Jan Polgar,François Michaud,Patrick Boissy; Mobility profile and wheelchair driving skills of powered wheelchair users: Sensor-based event recognition using a support vector machine classifier. EMBC (2011)
Sylvie C. W. Ong,Yuri Grinberg,Joelle Pineau; Goal-Directed Online Learning of Predictive Models. EWRL (2011)
Cosmin Paduraru,Doina Precup,Joelle Pineau; A Framework for Computing Bounds for the Return of a Policy. EWRL (2011)
ShaoWei Png,Joelle Pineau; Bayesian reinforcement learning for POMDP-based dialogue systems. ICASSP (2011)
André da Motta Salles Barreto,Doina Precup,Joelle Pineau; Reinforcement Learning using Kernel-Based Stochastic Factorization. NIPS (2011)
Monica Dinculescu,Christopher Hundt,Prakash Panangaden,Joelle Pineau,Doina Precup; The Duality of State and Observation in Probabilistic Transition Systems. TbiLLC (2011)
Joelle Pineau,Geoffrey J. Gordon,Sebastian Thrun; Anytime Point-Based Approximations for Large POMDPs. CoRR (2011)
Norm Ferns,Prakash Panangaden,Doina Precup; Bisimulation Metrics for Continuous Markov Decision Processes. SIAM J. Comput. (2011)
Philip Bachman,Doina Precup; Learning Compact Representations of Time-Varying Processes. AAAI (2011)
Gheorghe Comanici,Doina Precup; Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics. AAAI (2011)
Jordan Frank,Shie Mannor,Doina Precup; Activity Recognition with Time-Delay Emobeddings. AAAI Spring Symposium: Computational Physiology (2011)
Richard S. Sutton,Joseph Modayil,Michael Delp,Thomas Degris,Patrick M. Pilarski,Adam White,Doina Precup; Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. AAMAS (2011)
Gheorghe Comanici,Doina Precup; Basis function discovery using spectral clustering and bisimulation metrics. AAMAS (2011)
Pablo Samuel Castro,Doina Precup; Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics. EWRL (2011)
Nagesh K. Subbanna,Simon J. Francis,Doina Precup,D. Louis Collins,Douglas L. Arnold,Tal Arbel; Adapted MRF Segmentation of Multiple Sclerosis Lesions Using Local Contextual Information. MIUA (2011)
Jordan Frank,Shie Mannor,Doina Precup; Activity Recognition with Mobile Phones. ECML/PKDD (3) (2011)
Prakash Panangaden; Quantum Information Channels in Curved Spacetime. CiE (2011)
Prakash Panangaden; The Search for Structure in Quantum Computation. FoSSaCS (2011)
Prakash Panangaden; The Meaning of Semantics. LICS (2011)

2010

Keith Bush,Joelle Pineau; Treating Epilepsy by Reinforcement Learning Via Manifold-Based Simulation. AAAI Fall Symposium: Manifold Learning and Its Applications (2010)
Robert West,Doina Precup,Joelle Pineau; Automatically suggesting topics for augmenting text documents. CIKM (2010)
Robert Kaplow,Amin Atrash,Joelle Pineau; Variable resolution decomposition for robotic navigation under a POMDP framework. ICRA (2010)
Arthur Guez,Joelle Pineau; Multi-tasking SLAM. ICRA (2010)
Mahdi Milani Fard,Joelle Pineau; PAC-Bayesian Model Selection for Reinforcement Learning. NIPS (2010)
Joelle Pineau,Robert West,Amin Atrash,Julien Villemure,François Routhier; Towards a standardized test for intelligent wheelchairs. PerMIS (2010)
Philip A. Warrick,Emily F. Hamilton,Doina Precup,Robert E. Kearney; Classification of Normal and Hypoxic Fetuses From Systems Modeling of Intrapartum Cardiotocography. IEEE Trans. Biomed. Eng. (2010)
Pablo Samuel Castro,Doina Precup; Using Bisimulation for Policy Transfer in MDPs. AAAI (2010)
Jordan Frank,Shie Mannor,Doina Precup; Activity and Gait Recognition with Time-Delay Embeddings. AAAI (2010)
Gheorghe Comanici,Doina Precup; Optimal policy switching algorithms for reinforcement learning. AAMAS (2010)
Pablo Samuel Castro,Doina Precup; Using bisimulation for policy transfer in MDPs. AAMAS (2010)
Prakash Panangaden,Caitlin Phillips,Doina Precup,Mehrnoosh Sadrzadeh; An Algebraic Approach to Dynamic Epistemic Logic. Description Logics (2010)
Jordan Frank,Shie Mannor,Doina Precup; A novel similarity measure for time series data with applications to gait and activity recognition. UbiComp (Adjunct Papers) (2010)
Monica Dinculescu,Doina Precup; Approximate Predictive Representations of Partially Observable Systems. ICML (2010)
Pablo Samuel Castro,Doina Precup; Smarter Sampling in Model-Based Bayesian Reinforcement Learning. ECML/PKDD (1) (2010)
Fabian Kaelin,Doina Precup; A Study of Approximate Inference in Probabilistic Relational Models. ACML (2010)
Josée Desharnais,Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Weak bisimulation is sound and complete for pCTL*. Inf. Comput. (2010)
Susanna Donatelli,Prakash Panangaden,Gerardo Rubino; Special Issue on "Quantitative Evaluation of Systems". Perform. Evaluation (2010)
Prakash Panangaden,Mehrnoosh Sadrzadeh; Learning in a Changing World, an Algebraic Modal Logical Approach. AMAST (2010)
Prakash Panangaden,Mehrnoosh Sadrzadeh; Towards a Logic for Reasoning About Learning in a Changing World. LAM@LICS (2010)
S. Barry Cooper,Prakash Panangaden,Elham Kashefi; Proceedings Sixth Workshop on Developments in Computational Models: Causality, Computation, and Physics, DCM 2010, Edinburgh, Scotland, 9-10th July 2010. EPTCS (2010)

2009

2008

Mahdi Milani Fard,Joelle Pineau,Peng Sun; A Variance Analysis for POMDP Policy Evaluation. AAAI (2008)
Arthur Guez,Robert D. Vincent,Massimo Avoli,Joelle Pineau; Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning. AAAI (2008)
Finale Doshi,Joelle Pineau,Nicholas Roy; Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs. ICML (2008)
Stéphane Ross,Brahim Chaib-draa,Joelle Pineau; Bayesian reinforcement learning in continuous POMDPs with application to robot navigation. ICRA (2008)
Finale Doshi,Joelle Pineau,Nicholas Roy; Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs. ISAIM (2008)
Joelle Pineau,Stéphane Ross,Brahim Chaib-draa; Bayes-Adaptive POMDPs: A New Perspective on the Explore-Exploit Tradeoff in Partially Observable Domains. ISAIM (2008)
Mahdi Milani Fard,Joelle Pineau; MDPs with Non-Deterministic Policies. NIPS (2008)
Rupert Brooks,Tal Arbel,Doina Precup; Anytime similarity measures for faster alignment. Comput. Vis. Image Underst. (2008)
Masoumeh T. Izadi,Doina Precup; Point-Based Planning for Predictive State Representations. Canadian Conference on AI (2008)
Jordan Frank,Shie Mannor,Doina Precup; Reinforcement learning in the presence of rare events. ICML (2008)
Jonathan Taylor,Doina Precup,Prakash Panangaden; Bounding Performance Loss in Approximate MDP Homomorphisms. NIPS (2008)
Konstantinos Chatzikokolakis,Catuscia Palamidessi,Prakash Panangaden; Anonymity protocols as noisy channels. Inf. Comput. (2008)
Konstantinos Chatzikokolakis,Catuscia Palamidessi,Prakash Panangaden; On the Bayes risk in information-hiding protocols. J. Comput. Secur. (2008)
Ralph Kopperman,Prakash Panangaden,Michael B. Smyth,Dieter Spreen; Foreword. Theor. Comput. Sci. (2008)
Keye Martin,Prakash Panangaden; Domain Theory and the Causal Structure of Space-Time. CiE (2008)
Prakash Panangaden; Knowledge and Information in Probabilistic Systems. CONCUR (2008)
Yannick Delbecque,Prakash Panangaden; Game Semantics for Quantum Stores. MFPS (2008)
Keye Martin,Prakash Panangaden; A Technique for Verifying Measurements. MFPS (2008)

2007

Robin Jaulmes,Joelle Pineau,Doina Precup; Apprentissage actif dans les processus décisionnels de Markov partiellement observables L'algorithme MEDUSA. Rev. d'Intelligence Artif. (2007)
Christopher Hundt,Prakash Panangaden,Joelle Pineau,Doina Precup; Representing Systems with Hidden State. AAAI Fall Symposium: Computational Approaches to Representation Change during Learning and Development (2007)
Joelle Pineau,Amin Atrash; SmartWheeler: A Robotic Wheelchair Test-Bed for Investigating New Models of Human-Robot Interaction. AAAI Spring Symposium: Multidisciplinary Collaboration for Socially Assistive Robotics (2007)
Robert D. Vincent,Joelle Pineau,Philip de Guzman,Massimo Avoli; Recurrent Boosting for Classification of Natural and Synthetic Time-Series Data. Canadian Conference on AI (2007)
Robin Jaulmes,Joelle Pineau,Doina Precup; A formal framework for robot learning and control under model uncertainty. ICRA (2007)
Stéphane Ross,Brahim Chaib-draa,Joelle Pineau; Bayes-Adaptive POMDPs. NIPS (2007)
Stéphane Ross,Joelle Pineau,Brahim Chaib-draa; Theoretical Analysis of Heuristic Search Methods for Online POMDPs. NIPS (2007)
Marc G. Bellemare,Doina Precup; Context-Driven Predictions. IJCAI (2007)
Rupert Brooks,Tal Arbel,Doina Precup; Fast Image Alignment Using Anytime Algorithms. IJCAI (2007)
Pablo Samuel Castro,Doina Precup; Using Linear Programming for Bayesian Exploration in Markov Decision Processes. IJCAI (2007)
Vincent Danos,Ellie D'Hondt,Elham Kashefi,Prakash Panangaden; Distributed Measurement-based Quantum Computation. Electron. Notes Theor. Comput. Sci. (2007)
Richard Blute,Prakash Panangaden,Dorette Pronk; Conformal Field Theory as a Nuclear Functor. Electron. Notes Theor. Comput. Sci. (2007)
Vincent Danos,Elham Kashefi,Prakash Panangaden; The measurement calculus. J. ACM (2007)
Konstantinos Chatzikokolakis,Catuscia Palamidessi,Prakash Panangaden; Probability of Error in Information-Hiding Protocols. CSF (2007)
Romain Beauxis,Konstantinos Chatzikokolakis,Catuscia Palamidessi,Prakash Panangaden; Formal Approaches to Information-Hiding (Tutorial). TGC (2007)
Ralph Kopperman,Prakash Panangaden,Michael B. Smyth,Dieter Spreen; Computational Structures for Modelling Space, Time and Causality, 20.08. - 25.08.2006. Dagstuhl Seminar Proceedings (2007)

2006

Nikos Vlassis,Geoffrey J. Gordon,Joelle Pineau; Planning under uncertainty in robotics. Robotics Auton. Syst. (2006)
Daniel Burfoot,Joelle Pineau,Gregory Dudek; RRT-Plan: A Randomized Algorithm for STRIPS Planning. ICAPS (2006)
Ricard Gavaldà,Philipp W. Keller,Joelle Pineau,Doina Precup; PAC-Learning of Markov Models with Hidden State. ECML (2006)
Masoumeh T. Izadi,Doina Precup,Danielle Azar; Belief Selection in Point-Based Planning Algorithms for POMDPs. Canadian Conference on AI (2006)
Philip A. Warrick,Robert E. Kearney,Doina Precup,Emily F. Hamilton; Linear models of intrapartum uterine pressure-fetal heart rate interaction for the normal and hypoxic fetus. EMBC (2006)
Philipp W. Keller,Shie Mannor,Doina Precup; Automatic basis function construction for approximate dynamic programming and reinforcement learning. ICML (2006)
Beibei Zou,Xuesong Ma,Bettina Kemme,Glen Newton,Doina Precup; Data Mining Using Relational Database Management Systems. PAKDD (2006)
Norm Ferns,Pablo Samuel Castro,Doina Precup,Prakash Panangaden; Methods for Computing State Similarity in Markov Decision Processes. UAI (2006)
Vincent Danos,Josée Desharnais,François Laviolette,Prakash Panangaden; Bisimulation and cocongruence for probabilistic systems. Inf. Comput. (2006)
Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Approximate reasoning for real-time probabilistic processes. Log. Methods Comput. Sci. (2006)
Ellie D'Hondt,Prakash Panangaden; Quantum weakest preconditions. Math. Struct. Comput. Sci. (2006)
Ellie D'Hondt,Prakash Panangaden; The computational power of the W And GHZ States. Quantum Inf. Comput. (2006)
Vincent Danos,Elham Kashefi,Prakash Panangaden; The One Way to Quantum Computation. ICALP (2) (2006)
Konstantinos Chatzikokolakis,Catuscia Palamidessi,Prakash Panangaden; Anonymity Protocols as Noisy Channels. TGC (2006)
Ralph Kopperman,Prakash Panangaden,Michael B. Smyth,Dieter Spreen; 06341 Abstracts Collection -- Computational Structures for Modelling Space, Time and Causality. Computational Structures for Modelling Space, Time and Causality (2006)

2005

Robin Jaulmes,Joelle Pineau,Doina Precup; Active Learning in Partially Observable Markov Decision Processes. ECML (2005)
Joelle Pineau,Geoffrey J. Gordon; POMDP Planning for Robust Robot Control. ISRR (2005)
Ion Muslea,Virginia Dignum,Daniel D. Corkill,Catholijn M. Jonker,Frank Dignum,Silvia Coradeschi,Alessandro Saffiotti,Dan Fu,Jeff Orkin,William Cheetham,Kai Goebel,Piero P. Bonissone,Leen-Kiat Soh,Randolph M. Jones,Robert E. Wray III,Matthias Scheutz,Daniela Pucci de Farias,Shie Mannor,Georgios Theocharous,Doina Precup,Bamshad Mobasher,Sarabjot S. Anand,Bettina Berendt,Andreas Hotho,Hans W. Guesgen,Michael T. Rosenstein,Mohammad Ghavamzadeh; The Workshop Program at the Nineteenth National Conference on Artificial Intelligence. AI Mag. (2005)
Masoumeh T. Izadi,Doina Precup; Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes. ECML (2005)
Masoumeh T. Izadi,Doina Precup; Model minimization by linear PSR. IJCAI (2005)
Masoumeh T. Izadi,Ajit V. Rajwade,Doina Precup; Using core beliefs for point-based value iteration. IJCAI (2005)
Doina Precup,Richard S. Sutton,Cosmin Paduraru,Anna Koop,Satinder Singh; Off-policy Learning with Options and Recognizers. NIPS (2005)
Alexandre Bouchard-Côté,Norm Ferns,Prakash Panangaden,Doina Precup; An approximation algorithm for labelled Markov processes: towards realistic approximation. QEST (2005)
Ellie D'Hondt,Prakash Panangaden; Reasoning About Quantum Knowledge. FSTTCS (2005)
Ralph Kopperman,Prakash Panangaden,Michael B. Smyth,Dieter Spreen,Julian Webster; 04351 Summary - Spatial Representation: Discrete vs. Continuous Computational Models. Spatial Representation (2005)
Ralph Kopperman,Prakash Panangaden,Michael B. Smyth,Dieter Spreen,Julian Webster; 04351 Abstracts Collection - Spatial Representation: Discrete vs. Continuous Computational Models. Spatial Representation (2005)
Keye Martin,Prakash Panangaden; A domain of spacetime intervals in general relativity. Spatial Representation (2005)

2004

Doina Precup,Paul E. Utgoff; Classification Using Phi-Machines and Constructive Function Approximation. Mach. Learn. (2004)
Philipp W. Keller,Felix-Olivier Duguay,Doina Precup; Redagent: winner of TAC SCM 2003. SIGecom Exch. (2004)
Philipp W. Keller,Felix-Olivier Duguay,Doina Precup; RedAgent-2003: An Autonomous Market-Based Supply-Chain Management Agent. AAMAS (2004)
Bohdana Ratitch,Doina Precup; Sparse Distributed Memories for On-Line Value-Based Reinforcement Learning. ECML (2004)
Jan J. M. M. Rutten,Marta Z. Kwiatkowska,Gethin Norman,David Parker,Prakash Panangaden; Mathematical techniques for analyzing concurrent and probabilistic systems. CRM monograph series (2004)
Vincent Danos,Josée Desharnais,Prakash Panangaden; Labelled Markov Processes: Stronger and Faster Approximations. Electron. Notes Theor. Comput. Sci. (2004)
Thomas T. Hildebrandt,Prakash Panangaden,Glynn Winskel; A relational model of non-deterministic dataflow. Math. Struct. Comput. Sci. (2004)
Josée Desharnais,Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Metrics for labelled Markov processes. Theor. Comput. Sci. (2004)
Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Approximate Reasoning for Real-Time Probabilistic Processes. QEST (2004)
Riccardo Pucella,Prakash Panangaden; On the Expressive Power of First-Order Boolean Functions in PCF. CoRR (2004)

2003

Bohdana Ratitch,Doina Precup; Using MDP Characteristics to Guide Exploration in Reinforcement Learning. ECML (2003)
François Rivest,Doina Precup; Combining TD-learning with Cascade-correlation Networks. ICML (2003)
Masoumeh T. Izadi,Doina Precup; A Planning Algorithm for Predictive State Representations. IJCAI (2003)
Josée Desharnais,Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Approximating labelled Markov processes. Inf. Comput. (2003)
Josée Desharnais,Prakash Panangaden; Continuous stochastic logic characterizes bisimulation of continuous-time Markov processes. J. Log. Algebraic Methods Program. (2003)
Uwe Nestmann,Prakash Panangaden; Guest Editors' Foreword. Nord. J. Comput. (2003)
Vincent Danos,Josée Desharnais,Prakash Panangaden; Conditional Expectation and the Approximation of Labelled Markov Processes. CONCUR (2003)
Stephen D. Brookes,Prakash Panangaden; Proceedings of 19th Conference on the Mathematical Foundations of Programming Semantics, MFPS 2003, Université de Montréal, QC, Canada, March 19-22, 2003. Electronic Notes in Theoretical Computer Science (2003)

2002

Ioan Alfred Letia,Doina Precup; Developing Collaborative Golog Agents by Reinforcement Learning. Int. J. Artif. Intell. Tools (2002)
Bohdana Ratitch,Doina Precup; Characterizing Markov Decision Processes. ECML (2002)
Danielle Azar,Doina Precup,Salah Bouktif,Balázs Kégl,Houari A. Sahraoui; Combining and Adapting Software Quality Predictive Models by Genetic Algorithms. ASE (2002)
Theodore J. Perkins,Doina Precup; A Convergent Form of Approximate Policy Iteration. NIPS (2002)
Martin Stolle,Doina Precup; Learning Options in Reinforcement Learning. SARA (2002)
Josée Desharnais,Abbas Edalat,Prakash Panangaden; Bisimulation for Labelled Markov Processes. Inf. Comput. (2002)
Josée Desharnais,Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Weak Bisimulation is Sound and Complete for PCTL*. CONCUR (2002)
Josée Desharnais,Radha Jagadeesan,Vineet Gupta,Prakash Panangaden; The Metric Analogue of Weak Bisimulation for Probabilistic Processes. LICS (2002)
Luca Aceto,Prakash Panangaden; 8th International Workshop on Expressiveness in Concurrency, EXPRESS 2001, Satellite Workshop from CONCUR 2001, Aalborg, Denmark, August 20, 2001. Electronic Notes in Theoretical Computer Science (2002)
Uwe Nestmann,Prakash Panangaden; 9th International Workshop on Expressiveness in Concurrency, EXPRESS 2002, Satellite Workshop from CONCUR 2002, Brno, Czech Republic, August 19, 2002. Electronic Notes in Theoretical Computer Science (2002)

2001

Doina Precup,Richard S. Sutton,Sanjoy Dasgupta; Off-Policy Temporal Difference Learning with Function Approximation. ICML (2001)
Prakash Panangaden; Does Combining Nondeterminism and Probability Make Sense? Bull. EATCS (2001)
Prakash Panangaden; Measure and probability for concurrency theorists. Theor. Comput. Sci. (2001)
Riccardo Pucella,Prakash Panangaden; On the expressive power of first-order boolean functions in PCF. Theor. Comput. Sci. (2001)
Prakash Panangaden; Does Concurrency Theory Have Anything to Say About Parallel Programming? Current Trends in Theoretical Computer Science (2001)

2000

Prakash Panangaden,Clark Verbrugge; Generating irregular partitionable data structures. Theor. Comput. Sci. (2000)
Josée Desharnais,Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Approximating Labeled Markov Processes. LICS (2000)
Prakash Panangaden; From logic to stochastic processes (abstract only). PPDP (2000)

1999

Prakash Panangaden; The Category of Markov Kernels. Electron. Notes Theor. Comput. Sci. (1999)
Josée Desharnais,Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Metrics for Labeled Markov Systems. CONCUR (1999)
Vineet Gupta,Radha Jagadeesan,Prakash Panangaden; Stochastic Processes as Concurrent Constraint Programs. POPL (1999)

1998

Thomas T. Hildebrandt,Prakash Panangaden,Glynn Winskel; A Relational Model of Non-deterministic Dataflow. CONCUR (1998)
Josée Desharnais,Abbas Edalat,Prakash Panangaden; A Logical Characterization of Bisimulation for Labeled Markov Processes. LICS (1998)