-
1Academic Journal
المصدر: Fuzzy Information and Engineering, Vol 15, Iss 3, Pp 274-290 (2023)
مصطلحات موضوعية: deterministic markov decision process, discounted criterion, fuzzy reward, fuzzy cost, trapezoidal fuzzy number, Engineering (General). Civil engineering (General), TA1-2040, Mathematics, QA1-939
وصف الملف: electronic resource
-
2Academic Journal
المؤلفون: Di Zhao, Zhenyu Ding, Wenjie Li, Sen Zhao, Yuhong Du
المصدر: Applied Sciences, Vol 14, Iss 2, p 851 (2024)
مصطلحات موضوعية: fuzzy reward, end-to-end network, trajectory planning, forward kinematics, deep reinforcement learning, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Relation: https://www.mdpi.com/2076-3417/14/2/851; https://doaj.org/toc/2076-3417; https://doaj.org/article/c1617326640041e6a1deb1ca87584faa
-
3Academic Journal
المؤلفون: Ziyue Wang, Fengming Li, Yu Men, Tianyu Fu, Xuting Yang, Rui Song
المصدر: Applied Sciences; Volume 12; Issue 6; Pages: 3181
مصطلحات موضوعية: robot assembly, deep reinforcement learning, fuzzy reward, compliant control
جغرافية الموضوع: agris
وصف الملف: application/pdf
Relation: https://dx.doi.org/10.3390/app12063181
الاتاحة: https://doi.org/10.3390/app12063181
-
4Academic Journal
المؤلفون: Chen, Ming, Lam, Hak Keung, Shi, Qian, Xiao, Bo
المصدر: Chen , M , Lam , H K , Shi , Q & Xiao , B 2020 , ' Reinforcement Learning-Based Control of Nonlinear Systems Using Lyapunov Stability Concept and Fuzzy Reward Scheme ' , IEEE Transactions on Circuits and Systems II: Express Briefs , vol. 67 , no. 10 , 8871158 , pp. 2059-2063 . https://doi.org/10.1109/TCSII.2019.2947682
مصطلحات موضوعية: adjustable policy learning rate (APLR), cart-pole inverted pendulum, fuzzy reward system, Lyapunov reward system, Proximal policy optimization (PPO)
وصف الملف: application/pdf
-
5Academic Journal
المؤلفون: Carrero-Vera, Karla, Cruz-Suárez, Hugo, Montes-de-Oca, Raúl
مصطلحات موضوعية: keyword:Markov decision process, keyword:total reward, keyword:fuzzy reward, keyword:trapezoidal fuzzy number, keyword:optimal stopping problem, keyword:gambling model, msc:90C40, msc:93C40
وصف الملف: application/pdf
Relation: mr:MR4467492; zbl:Zbl 07584152; reference:[1] Abbasbandy, S., Hajjari, T.: A new approach for ranking of trapezoidal fuzzy numbers.Comput. Math. Appl. 57 (2009), 413-419. MR 2488614; reference:[2] Ban, A. I.: Triangular and parametric approximations of fuzzy numbers inadvertences and corrections.Fuzzy Sets and Systems 160 (2009), 3048-3058. MR 2567092; reference:[3] Bartle, R. G.: The Elements of Integration.Wiley, New York 1995. MR 0200398; reference:[4] Bellman, R. E., Zadeh, L. A.: Decision-making in a fuzzy enviroment.Management Sci. 17 (1970), 141-164. MR 0301613; reference:[5] Cavazos-Cadena, R., Montes-de-Oca, R.: Existence of optimal stationary policies in finite dynamic programs with nonnegative rewards.Probab. Engrg. Inform. Sci. 15 (2001), 557-564. MR 1852975; reference:[6] Chen, S. H.: Operations of fuzzy numbers with step form membership function using function principle.Information Sci. 108 (1998), 149-155. Zbl 0922.04007, MR 1632503; reference:[7] Diamond, P., Kloeden, P.: Metric Spaces of Fuzzy Sets: Theory and Applications.World Scientific, Singapore 1994. MR 1337027; reference:[8] Driankov, D., Hellendoorn, H., Reinfrank, M.: An Introduction to Fuzzy Control.Springer Science and Business Media, New York 2013. MR 3010569; reference:[9] Efendi, R., Arbaiy, N., Deris, M. M.: A new procedure in stock market forecasting based on fuzzy random auto-regression time series model.Information Sci. 441 (2018), 113-132. MR 3771167; reference:[10] Fakoor, M., Kosari, A., Jafarzadeh, M.: Humanoid robot path planning with fuzzy Markov decision processes.J. Appl. Res. Tech. 14 (2016), 300-310.; reference:[11] Furukawa, N.: Parametric orders on fuzzy numbers and their roles in fuzzy optimization problems.Optimization 40 (1997), 171-192. MR 1620380; reference:[12] Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y.: Markov decision processes with fuzzy rewards.In: Proc. Int. Conf. on Nonlinear Analysis, Hirosaki 2002, pp. 221-232. MR 1986973; reference:[13] López-Díaz, M., Ralescu, D. A.: Tools for fuzzy random variables: embeddings and measurabilities.Comput. Statist. Data Anal. 51 (2006), 109-114. MR 2297590; reference:[14] Pedrycz, W.: Why triangular membership functions?.Fuzzy Sets and Systems 64 (1994), 21-30. MR 1281283; reference:[15] Puri, M. L., Ralescu, D. A.: Fuzzy random variable.J. Math. Anal. Appl. 114 (1986), 402-422. MR 0833596; reference:[16] Puterman, M. L.: Markov Decision Processes: Discrete Stochastic Dynamic. First edition.Wiley-Interscience, California 2005. MR 1270015; reference:[17] Rezvani, S., Molani, M.: Representation of trapezoidal fuzzy numbers with shape function.Ann. Fuzzy Math. Inform. 8 (2014), 89-112. MR 3214770; reference:[18] Ross, S.: Dynamic programming and gambling models.Adv. Appl. Probab. 6 (1974), 593-606. MR 0347381; reference:[19] Ross, S.: Introduction to Stochastic Dynamic Programming.Academic Press, New York 1983. MR 0749232; reference:[20] Semmouri, A., Jourhmane, M., Belhallaj, Z.: Discounted Markov decision processes with fuzzy costs.Ann. Oper. Res. 295 (2020), 769-786. MR 4181708; reference:[21] Syropoulos, A., Grammenos, T.: A Modern Introduction to Fuzzy Mathematics.Wiley, New Jersey 2020.; reference:[22] Zadeh, L.: Fuzzy sets.Inform. Control 8 (1965), 338-353. Zbl 0942.00007, MR 0219427; reference:[23] Zeng, W., Li, H.: Weighted triangular approximation of fuzzy numbers.Int. J. Approx. Reason. 46 (2007), 137-150. MR 2362230
-
6Academic Journal
المؤلفون: Y. Yoshida, Yuji Yoshida
المساهمون: The Pennsylvania State University CiteSeerX Archives
مصطلحات موضوعية: Optimal stopping, fuzzy reward, dynai~lic fuzzy system, fuzzy expected
وصف الملف: application/pdf
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.666.6875; http://catalog.lib.kyushu-u.ac.jp/handle/2324/3192/rifis-tr-102.pdf
-
7
المؤلفون: Yuji Yoshida
المصدر: Computers & Mathematics with Applications. 32:17-28
مصطلحات موضوعية: Fuzzy expected value, Computer Science::Computer Science and Game Theory, Mathematical optimization, Fuzzy classification, Fuzzy reward, Mathematics::General Mathematics, Dynamic fuzzy system, Type-2 fuzzy sets and systems, Defuzzification, Fuzzy logic, Computational Mathematics, ComputingMethodologies_PATTERNRECOGNITION, Computational Theory and Mathematics, Fuzzy transportation, Control theory, Modelling and Simulation, Modeling and Simulation, Optimal stopping, Fuzzy number, Fuzzy set operations, Fuzzy associative matrix, ComputingMethodologies_GENERAL, Fuzzy dynamic programming, Mathematics
-
8
المؤلفون: Kurano, Masami, Yasuda, Masami, Nakagami, Jun-ichi, Yoshida, Yuji
المصدر: 数理解析研究所講究録. 899:53-62
مصطلحات موضوعية: time average fuzzy reward, dynamic fuzzy system, contractive properties, fuzzy relational equation
وصف الملف: application/pdf
-
9Academic Journal
المؤلفون: M. Kurano
المساهمون: The Pennsylvania State University CiteSeerX Archives
مصطلحات موضوعية: dynamic fuzzy system, time average fuzzy reward, contractive properties, fuzzy relational equation
وصف الملف: application/postscript
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.139.7704; http://www.math.s.chiba-u.ac.jp/~yasuda/accept/avg911.ps
-
10
المؤلفون: Yoshida, Yuji
المصدر: RIFIS Technical Report. 102
مصطلحات موضوعية: Computer Science::Computer Science and Game Theory, ComputingMethodologies_PATTERNRECOGNITION, Mathematics::General Mathematics, Optimal stopping, ComputingMethodologies_GENERAL, dynamic fuzzy system, fuzzy expected value, fuzzy reward
وصف الملف: application/pdf
-
11
المؤلفون: Yuji Yoshida, Masami Kurano, Jun-ichi Nakagami, Masami Yasuda
المصدر: Computers & Mathematics with Applications. (11-12):77-86
مصطلحات موضوعية: Mathematical optimization, Fuzzy classification, Mathematics::General Mathematics, Dynamic fuzzy system, Fuzzy subalgebra, Type-2 fuzzy sets and systems, Fuzzy logic, Defuzzification, Contractive properties, Computational Mathematics, Computational Theory and Mathematics, Modeling and Simulation, Modelling and Simulation, Fuzzy relational equation, Fuzzy mathematics, Fuzzy set operations, Fuzzy number, Time average fuzzy reward, ComputingMethodologies_GENERAL, Mathematics
-
12
المؤلفون: Carrero-Vera, Karla, Cruz-Suárez, Hugo, Montes-de-Oca, Raúl
مصطلحات موضوعية: Markov decision process, total reward, fuzzy reward, trapezoidal fuzzy number, optimal stopping problem, gambling model
جغرافية الموضوع: 180-199
وصف الملف: média; svazek