Harvard University | SEAS

The Boundary of instability as a powerful experimental paradigm for understanding complex dynamical sensorimotor behavior: dexterous manipulation as an …

Numerous studies of limbs and fingers propose that force-velocity properties of muscle limit maximal voluntary force production during anisometric tasks, i.e. when muscles are shortening or lengthening.

Bookmark
Download
- by Veronica J. Santos and +1
  Madhusudhan Venkadesan
- •
- 5
  Neuroscience, Skeletal muscle biology, Time Factors, Biomechanical Phenomena

Boosting is a general method for training an ensemble of classifiers with a view to improving performance relative to that of a single classifier. While the original AdaBoost algorithm has been defined for classification tasks, the... more

Bookmark
Download
- by Christos Dimitrakakis
- •
- Ensemble Methods

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schemes without value functions, which focus on policy... more

Bookmark
Download
- by Christos Dimitrakakis and +1
  Michail G Lagoudakis
- •
- Reinforcement Learning

Ensemble algorithms can improve the performance of a given learning algorithm through the combination of multiple base classifiers into an ensemble. In this paper we attempt to train and combine the base classifiers using an adaptive... more

Supervised learning deals with the inference of a distribution over an output or label space Y conditioned on points in an observation space X , given a training dataset D of pairs in X × Y. However, in a lot of applications of interest,... more

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds for the multi-armed bandit problem, one for the infinite... more

We apply boosting techniques to the problem of word error rate minimisation in speech recognition. This is achieved through a new definition of sample error for boosting and a training procedure for hidden Markov models. For this purpose... more

Bookmark
Download
- by Christos Dimitrakakis
- •
- Ensemble Methods

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The first employs a Bayesian framework, where optimality improves with increased computational time.... more

Ensemble algorithms can improve the performance of a given learning algorithm through the combination of multiple base classifiers into an ensemble. In this paper, the idea of using an adaptive policy for training and combining the base... more

Bookmark
Download
- by Christos Dimitrakakis
- •
- Statistics

A number of authentication protocols have been proposed recently, where at least some part of the authentication is performed during a phase, lasting n rounds, with no error correction. This requires assigning an acceptable threshold for... more

Bookmark
Download
- by Christos Dimitrakakis
- •
- 2
  Information Theory, Cryptography

Intrusion Detection is an invaluable part of computer networks defense. An important consideration is the fact that raising false alarms carries a significantly lower cost than not detecting attacks. For this reason, we examine how... more

We present a Bayesian variable order Markov model that shares many similarities with predictive state representations. The resulting models are compact and much easier to specify and learn than classical predictive state representations.... more

Bookmark
Download
- by Christos Dimitrakakis and +1
  Bas Terwijn
- •
- Reinforcement Learning

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most problems of interest, the optimal solution involves... more

We present a general method for maintaining estimates of the distribution of parameters in arbitrary models. This is then applied to the estimation of probability distributions over actions in value-based reinforcement learning. While... more

The vast majority of RFID authentication protocols assume the proximity between readers and tags due to the limited range of the radio channel. However, in real scenarios an intruder can be located between the prover (tag) and the... more

Bookmark
Download
- by Christos Dimitrakakis and +1
  Julio Hernández-Castro
- •
- 3
  Information Security, Cryptography, RFID Security and Privacy

Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervised learning problem, have been proposed recently. Finding good policies... more

Bookmark
Download
- by Christos Dimitrakakis and +1
  Michail G Lagoudakis
- •
- 2
  Reinforcement Learning, Complexity Theory

SEAS

Log In