Skip to main content

George Dimitri Konidaris

Adjunct Assistant Professor of Computer Science
Computer Science
421 Chapel Dr, Durham, NC 27710

Selected Publications


Skill Transfer for Temporal Task Specification

Conference Proceedings - IEEE International Conference on Robotics and Automation · January 1, 2024 Deploying robots in real-world environments, such as households and manufacturing lines, requires generalization across novel task specifications without violating safety constraints. Linear temporal logic (LTL) is a widely used task specification language ... Full text Cite

Composable Interaction Primitives: A Structured Policy Class for Efficiently Learning Sustained-Contact Manipulation Skills

Conference Proceedings - IEEE International Conference on Robotics and Automation · January 1, 2024 We propose a new policy class, Composable Interaction Primitives (CIPs), specialized for learning sustained-contact manipulation skills like opening a drawer, pulling a lever, turning a wheel, or shifting gears. CIPs have two primary design goals: to minim ... Full text Cite

Robot Task Planning under Local Observability

Conference Proceedings - IEEE International Conference on Robotics and Automation · January 1, 2024 Real-world robot task planning is intractable in part due to partial observability. A common approach to reducing complexity is introducing additional structure into the decision process, such as mixed-observability, factored states, or temporally-extended ... Full text Cite

Language-guided Skill Learning with Temporal Variational Inference

Conference Proceedings of Machine Learning Research · January 1, 2024 We present an algorithm for skill discovery from expert demonstrations. The algorithm first utilizes Large Language Models (LLMs) to propose an initial segmentation of the trajectories. Following that, a hierarchical variational inference framework incorpo ... Cite

Model-based Reinforcement Learning for Parameterized Action Spaces

Conference Proceedings of Machine Learning Research · January 1, 2024 We propose a novel model-based reinforcement learning algorithm-Dynamics Learning and predictive control with Parameterized Actions (DLPA)-for Parameterized Action Markov Decision Processes (PAMDPs).The agent learns a parameterized-action-conditioned dynam ... Cite

Lang2LTL-2: Grounding Spatiotemporal Navigation Commands Using Large Language and Vision-Language Models

Conference IEEE International Conference on Intelligent Robots and Systems · January 1, 2024 Grounding spatiotemporal navigation commands to structured task specifications enables autonomous robots to understand a broad range of natural language and solve long-horizon tasks with safety guarantees. Prior works mostly focus on grounding spatial or t ... Full text Cite

EPO: Hierarchical LLM Agents with Environment Preference Optimization

Conference EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference · January 1, 2024 Long-horizon decision-making tasks present significant challenges for LLM-based agents due to the need for extensive planning over multiple steps. In this paper, we propose a hierarchical framework that decomposes complex tasks into manageable subgoals, ut ... Cite

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Conference Advances in Neural Information Processing Systems · January 1, 2024 Reinforcement learning algorithms typically rely on the assumption that the environment dynamics and value function can be expressed in terms of a Markovian state representation. However, when state information is only partially observable, how can an agen ... Cite

Q-functionals for Value-Based Continuous Control

Conference Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 · June 27, 2023 We present Q-functionals, an alternative architecture for continuous control deep reinforcement learning. Instead of returning a single value for a state-action pair, our network transforms a state into a function that can be rapidly evaluated in parallel ... Full text Cite

Automatic encoding and repair of reactive high-level tasks with learned abstract representations

Journal Article International Journal of Robotics Research · April 1, 2023 We present a framework for the automatic encoding and repair of high-level tasks. Given a set of skills a robot can perform, our approach first abstracts sensor data into symbols and then automatically encodes the robot’s capabilities in Linear Temporal Lo ... Full text Cite

A domain-agnostic approach for characterization of lifelong learning systems.

Journal Article Neural networks : the official journal of the International Neural Network Society · March 2023 Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original trainin ... Full text Cite

Coarse-Grained Smoothness for Reinforcement Learning in Metric Spaces

Conference Proceedings of Machine Learning Research · January 1, 2023 Principled decision-making in continuous state-action spaces is impossible without some assumptions. A common approach is to assume Lipschitz continuity of the Q-function. We show that, unfortunately, this property fails to hold in many typical domains. We ... Cite

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

Conference Proceedings of Machine Learning Research · January 1, 2023 We propose a new method for count-based exploration in high-dimensional state spaces. Unlike previous work which relies on density models, we show that counts can be derived by averaging samples from the Rademacher distribution (or coin flips). This insigh ... Cite

Meta-Learning Parameterized Skills

Conference Proceedings of Machine Learning Research · January 1, 2023 We propose a novel parameterized skill-learning algorithm that aims to learn transferable parameterized skills and synthesize them into a new action space that supports efficient learning in long-horizon tasks. We propose to leverage off-policy Meta-RL com ... Cite

RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents

Conference Proceedings of Machine Learning Research · January 1, 2023 We introduce RLang, a domain-specific language (DSL) for communicating domain knowledge to an RL agent. Unlike existing RL DSLs that ground to single elements of a decision-making formalism (e.g., the reward function or policy), RLang can specify informati ... Cite

Constrained Dynamic Movement Primitives for Collision Avoidance in Novel Environments

Conference IEEE International Conference on Intelligent Robots and Systems · January 1, 2023 Dynamic movement primitives are widely used for learning skills that can be demonstrated to a robot by a skilled human or controller. While their generalization capabilities and simple formulation make them very appealing to use, they possess no strong gua ... Full text Cite

Skill Generalization with Verbs

Conference IEEE International Conference on Intelligent Robots and Systems · January 1, 2023 It is imperative that robots can understand natural language commands issued by humans. Such commands typically contain verbs that signify what action should be performed on a given object and that are applicable to many objects. We propose a method for ge ... Full text Cite

Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback

Conference IEEE International Conference on Intelligent Robots and Systems · January 1, 2023 Conversational assistive robots can aid people, especially those with cognitive impairments, to accomplish various tasks such as cooking meals, performing exercises, or operating machines. However, to interact with people effectively, robots must recognize ... Full text Cite

Synthesizing Navigation Abstractions for Planning with Portable Manipulation Skills

Conference Proceedings of Machine Learning Research · January 1, 2023 We address the problem of efficiently learning high-level abstractions for task-level robot planning. Existing approaches require large amounts of data and fail to generalize learned abstractions to new environments. To address this, we propose to exploit ... Cite

Effectively Learning Initiation Sets in Hierarchical Reinforcement Learning

Conference Advances in Neural Information Processing Systems · January 1, 2023 An agent learning an option in hierarchical reinforcement learning must solve three problems: identify the option's subgoal (termination condition), learn a policy, and learn where that policy will succeed (initiation set). The termination condition is typ ... Cite