Seguir
Shie Mannor
Shie Mannor
Professor of Electrical Engineering @ Technion & Researcher @ Nvidia
Dirección de correo verificada de technion.ac.il - Página principal
Título
Citado por
Citado por
Año
A Tutorial on the Cross-Entropy Method
B DE, P KROESE, S MANNOR
Annals of Operations Research 134 (1), 19-67, 2005
3316*2005
The kernel recursive least-squares algorithm
Y Engel, S Mannor, R Meir
IEEE Transactions on signal processing 52 (8), 2275-2285, 2004
12432004
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems.
E Even-Dar, S Mannor, Y Mansour, S Mahadevan
Journal of machine learning research 7 (6), 2006
7492006
Robustness and Regularization of Support Vector Machines.
H Xu, C Caramanis, S Mannor
Journal of machine learning research 10 (7), 2009
5882009
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015
5472015
Reward constrained policy optimization
C Tessler, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1805.11074, 2018
5432018
PAC bounds for multi-armed bandit and Markov decision processes
E Even-Dar, S Mannor, Y Mansour
Computational Learning Theory: 15th Annual Conference on Computational …, 2002
5272002
Robustness and generalization
H Xu, S Mannor
Machine learning 86, 391-423, 2012
5002012
Reinforcement learning with Gaussian processes
Y Engel, S Mannor, R Meir
ICML, 201-208, 2005
4972005
The sample complexity of exploration in the multi-armed bandit problem
S Mannor, JN Tsitsiklis
Journal of Machine Learning Research 5 (Jun), 623-648, 2004
4812004
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
4472017
Robust regression and lasso
H Xu, C Caramanis, S Mannor
Advances in neural information processing systems 21, 2008
3722008
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
Advances in neural information processing systems 28, 2015
3652015
Q-cut—dynamic discovery of sub-goals in reinforcement learning
I Menache, S Mannor, N Shimkin
Machine Learning: ECML 2002: 13th European Conference on Machine Learning …, 2002
3642002
The cross entropy method for classification
S Mannor, D Peleg, R Rubinstein
Proceedings of the 22nd international conference on Machine learning, 561-568, 2005
3602005
Policy gradients with variance related risk criteria
A Tamar, D Di Castro, S Mannor
Proceedings of the twenty-ninth international conference on machine learning …, 2012
3352012
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International conference on machine learning, 1899-1908, 2016
3322016
Percentile optimization for Markov decision processes with parameter uncertainty
E Delage, S Mannor
Operations research 58 (1), 203-213, 2010
332*2010
Dynamic abstraction in reinforcement learning via clustering
S Mannor, I Menache, A Hoze, U Klein
Proceedings of the twenty-first international conference on Machine learning, 71, 2004
3292004
Bayes meets Bellman: The Gaussian process approach to temporal difference learning
Y Engel, S Mannor, R Meir
Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003
3062003
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20