Seguir
Shie Mannor
Shie Mannor
Professor of Electrical Engineering @ Technion & Researcher @ Nvidia Research
Dirección de correo verificada de technion.ac.il - Página principal
Título
Citado por
Citado por
Año
A Tutorial on the Cross-Entropy Method
B DE, P KROESE, S MANNOR
Annals of Operations Research 134 (1), 19-67, 2005
3183*2005
The kernel recursive least-squares algorithm
Y Engel, S Mannor, R Meir
IEEE Transactions on signal processing 52 (8), 2275-2285, 2004
12132004
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems.
E Even-Dar, S Mannor, Y Mansour, S Mahadevan
Journal of machine learning research 7 (6), 2006
7212006
Robustness and Regularization of Support Vector Machines.
H Xu, C Caramanis, S Mannor
Journal of machine learning research 10 (7), 2009
5692009
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015
5242015
PAC bounds for multi-armed bandit and Markov decision processes
E Even-Dar, S Mannor, Y Mansour
Computational Learning Theory: 15th Annual Conference on Computational …, 2002
5082002
Reinforcement learning with Gaussian processes
Y Engel, S Mannor, R Meir
ICML, 201-208, 2005
4962005
Robustness and generalization
H Xu, S Mannor
Machine learning 86, 391-423, 2012
4862012
Reward constrained policy optimization
C Tessler, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1805.11074, 2018
4842018
The sample complexity of exploration in the multi-armed bandit problem
S Mannor, JN Tsitsiklis
Journal of Machine Learning Research 5 (Jun), 623-648, 2004
4692004
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
4272017
Robust regression and lasso
H Xu, C Caramanis, S Mannor
Advances in neural information processing systems 21, 2008
3662008
Q-cut—dynamic discovery of sub-goals in reinforcement learning
I Menache, S Mannor, N Shimkin
Machine Learning: ECML 2002: 13th European Conference on Machine Learning …, 2002
3652002
Policy gradients with variance related risk criteria
A Tamar, D Di Castro, S Mannor
Proceedings of the twenty-ninth international conference on machine learning …, 2012
3582012
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
Advances in neural information processing systems 28, 2015
3462015
The cross entropy method for classification
S Mannor, D Peleg, R Rubinstein
Proceedings of the 22nd international conference on Machine learning, 561-568, 2005
3412005
Dynamic abstraction in reinforcement learning via clustering
S Mannor, I Menache, A Hoze, U Klein
Proceedings of the twenty-first international conference on Machine learning, 71, 2004
3252004
Percentile optimization for Markov decision processes with parameter uncertainty
E Delage, S Mannor
Operations research 58 (1), 203-213, 2010
324*2010
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International conference on machine learning, 1899-1908, 2016
3192016
Bayes meets Bellman: The Gaussian process approach to temporal difference learning
Y Engel, S Mannor, R Meir
Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003
3022003
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20