-
1Academic Journal
المؤلفون: Wittwer, Felix, Sauter, Nicholas K, Mendez, Derek, Poon, Billy K, Brewster, Aaron S, Holton, James M, Wall, Michael E, Hart, William E, Bard, Deborah J, Blaschke, Johannes P
المصدر: Concurrency and Computation Practice and Experience. 36(5)
مصطلحات موضوعية: Information and Computing Sciences, Human-Centred Computing, AMD GPU, cross compilation, code optimization, Kokkos, Nvidia GPU, Artificial Intelligence and Image Processing, Computer Software, Distributed Computing, Information and computing sciences
وصف الملف: application/pdf
-
2Academic Journal
المؤلفون: Wang, Bei, Ethier, Stephane, Tang, William, Ibrahim, Khaled Z, Madduri, Kamesh, Williams, Samuel, Oliker, Leonid
المصدر: The International Journal of High Performance Computing Applications. 33(1)
مصطلحات موضوعية: Information and Computing Sciences, Applied Computing, Affordable and Clean Energy, Particle-in-cell methods, Vlasov-Poisson equations, NVIDIA GPU, Intel Xeon Phi, heterogeneous systems, fusion plasma simulations, extreme scale, cs.DC, physics.comp-ph, physics.plasm-ph, Distributed Computing, Applied computing, Distributed computing and systems software
وصف الملف: application/pdf
URL الوصول: https://escholarship.org/uc/item/7m80x9nm
-
3Academic Journal
المؤلفون: Lopez, Florent, Mary, Théo
المساهمون: Innovative Computing Laboratory Knoxville (ICL), The University of Tennessee Knoxville, Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)
المصدر: ISSN: 1094-3420 ; International Journal of High Performance Computing Applications ; https://hal.science/hal-02937325 ; International Journal of High Performance Computing Applications, In press.
مصطلحات موضوعية: numerical linear algebra, mixed precision algorithms, high performance computing, LU factorization, tensor cores, NVIDIA GPU, rounding error analysis, [INFO]Computer Science [cs], [MATH]Mathematics [math]
Relation: hal-02937325; https://hal.science/hal-02937325; https://hal.science/hal-02937325v2/document; https://hal.science/hal-02937325v2/file/paper.pdf
-
4Academic Journal
المساهمون: Lyakh, Dmitry [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)]
المصدر: Computer Physics Communications; 189
وصف الملف: Medium: ED; Size: p. 84-91
-
5Academic Journal
المصدر: IEEE Access, Vol 8, Pp 7861-7876 (2020)
مصطلحات موضوعية: Heterogeneous platforms, multicore, Nvidia GPU, Intel Xeon Phi, workload partitioning, performance, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
وصف الملف: electronic resource
-
6Academic Journal
المؤلفون: Dos Santos F. F., Rech P.
المساهمون: Dos Santos, F. F., Rech, P.
مصطلحات موضوعية: deep neural network, fault tolerance, NVIDIA GPU, radiation-induced fault, reliability
Relation: volume:2024; firstpage:1; lastpage:1; numberofpages:1; journal:IEEE DESIGN & TEST; https://hdl.handle.net/11572/440314
-
7Academic Journal
المصدر: Applied Medical Informatics, Vol 43, Iss Suppl. S1, Pp 28-28 (2021)
مصطلحات موضوعية: machine learning, densenet, real time processing, nvidia gpu, Computer applications to medicine. Medical informatics, R858-859.7
-
8
المؤلفون: Krishnasamy, Ezhilmathi, Sourouri, Mohammed, Cai, Xing
المصدر: INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE Procedia Computer Science. :1494-1503
مصطلحات موضوعية: NVIDIA GPU, CUDA programming, OpenMP, 3D sweeping, anisotropic front propagation
وصف الملف: electronic
-
9Academic Journal
المساهمون: Department of Mathematics Manchester (School of Mathematics), University of Manchester Manchester, Innovative Computing Laboratory Knoxville (ICL), The University of Tennessee Knoxville, Centre National de la Recherche Scientifique (CNRS), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)
المصدر: ISSN: 1064-8275 ; SIAM Journal on Scientific Computing ; https://hal.science/hal-02491076 ; SIAM Journal on Scientific Computing, 2020, 42 (3), pp.C124-C141. ⟨10.1137/19M1289546⟩.
مصطلحات موضوعية: NVIDIA GPU, matrix multiplication, rounding error analysis, floating-point arithmetic, fused multiply-add, tensor cores, LU factorization, [INFO]Computer Science [cs], [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], [MATH]Mathematics [math], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]
Relation: hal-02491076; https://hal.science/hal-02491076; https://hal.science/hal-02491076v2/document; https://hal.science/hal-02491076v2/file/BlockFMA.pdf
-
10Academic Journal
المساهمون: Russian Foundation for Basic Research (grant No. 20-07-00140), Ministry of Science and Higher Education of the Russian Federation (government order FENU-2020-0022), Российский фонд фундаментальных исследований (грант № 20-07-00140), Министерство образования и науки РФ (государственное задание FENU-2020-0022)
المصدر: Computational Mathematics and Software Engineering; Том 9, № 3 (2020); 17-34 ; Вычислительная математика и информатика; Том 9, № 3 (2020); 17-34 ; 2410-7034 ; 2305-9052 ; 10.14529/cmse2003
مصطلحات موضوعية: time series, motif discovery, parallel algorithm, NVIDIA GPU, OpenACC, временной ряд, поиск лейтмотивов, параллельный алгоритм
وصف الملف: application/pdf
-
11Academic Journal
المؤلفون: Afanasyev, Ilya
المساهمون: RFBR
المصدر: Lobachevskii Journal of Mathematics; Том 41, № 8 (2020): Special issue “Supercomputing Applications, Algorithms and Software Tools”. ; 1818-9962 ; 1995-0802
مصطلحات موضوعية: Graph Algorithms, NEC SX-Aurora TSUBASA, Connected Compo- nents, NVIDIA GPU, HPC, Large-Scale Graph Processing
Time: 12345, 54321
-
12Report
المؤلفون: Lopez, Florent, Mary, Théo
المساهمون: Innovative Computing Laboratory Knoxville (ICL), The University of Tennessee Knoxville, Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)
المصدر: https://hal.archives-ouvertes.fr/hal-02937325 ; 2020.
مصطلحات موضوعية: numerical linear algebra, mixed precision algorithms, high performance computing, LU factorization, tensor cores, NVIDIA GPU, rounding error analysis, [INFO]Computer Science [cs], [MATH]Mathematics [math]
Relation: hal-02937325; https://hal.archives-ouvertes.fr/hal-02937325; https://hal.archives-ouvertes.fr/hal-02937325/document; https://hal.archives-ouvertes.fr/hal-02937325/file/paper.pdf
-
13
المؤلفون: Chien, Wei Der, Nylund, Jonas, Bengtsson, Gabriel, Peng, I. B., Podobas, Artur, Markidis, Stefano
المصدر: Proceedings - Symposium on Computer Architecture and High Performance Computing. :149-156
مصطلحات موضوعية: CUDA, Implicit Particle-in-Cell, Multi-GPU, Nvidia GPU, Application programming interfaces (API), Domain decomposition methods, Graphics processing unit, Plasma simulation, Program processors, Supercomputers, High power efficiencies, Improve performance, Large scale simulations, Multi-GPU Systems, Overlapping communication and computations, Particle in cell codes, Particle-in-cell code, Performance analysis, Computer hardware
وصف الملف: print
-
14Dissertation/ Thesis
المؤلفون: Alexandr Krastenov
المساهمون: Oberhuber Tomáš, Klinkovský Jakub
مصطلحات موضوعية: CUDA C++, husté matice, doba provádění, transpozice na místě, násobení, NVIDIA GPU knihovny, optimalizační techniky, transpozice mimo místo, paralelní algoritmy, paralelizační strategie, vědecké výpočty, knihovna TNL, dense matrices, execution time, in-place transposition, multiplication, NVIDIA GPU libraries, optimization techniques, out-of-place transposition, parallel algorithms, parallelization strategies, scientific computations, TNL library
وصف الملف: application/pdf
Relation: KOS-1241045124805; http://hdl.handle.net/10467/116917
الاتاحة: http://hdl.handle.net/10467/116917
-
15Academic Journal
المؤلفون: Long, Rogelio
المصدر: Open Access Theses & Dissertations
مصطلحات موضوعية: Hardware Evaluation, NVIDIA GPU, Sparse Computation, Computer Sciences
وصف الملف: application/pdf
Relation: https://scholarworks.utep.edu/open_etd/3289; https://scholarworks.utep.edu/cgi/viewcontent.cgi?article=4288&context=open_etd
-
16
المؤلفون: Nicholas J. Higham, Srikara Pranesh, Florent Lopez, Pierre Blanchard, Théo Mary
المساهمون: Department of Mathematics [Manchester] (School of Mathematics), University of Manchester [Manchester], Innovative Computing Laboratory [Knoxville] (ICL), The University of Tennessee [Knoxville], Centre National de la Recherche Scientifique (CNRS), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)
المصدر: SIAM Journal on Scientific Computing
SIAM Journal on Scientific Computing, Society for Industrial and Applied Mathematics, 2020, 42 (3), pp.C124-C141. ⟨10.1137/19M1289546⟩مصطلحات موضوعية: fused multiply-add, Floating point, floating-point arithmetic, Carry (arithmetic), matrix multiplication, 010103 numerical & computational mathematics, 01 natural sciences, law.invention, Computational science, Matrix (mathematics), law, Tensor (intrinsic definition), [INFO]Computer Science [cs], 0101 mathematics, [MATH]Mathematics [math], NVIDIA GPU, Mathematics, Block (data storage), Multiply–accumulate operation, Applied Mathematics, [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], LU decomposition, Matrix multiplication, rounding error analysis, Computational Mathematics, tensor cores, LU factorization, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]
-
17Conference
المؤلفون: Dos Santos F. F., Navaux P., Carro L., Rech P.
المساهمون: Dos Santos, F. F., Navaux, P., Carro, L., Rech, P.
مصطلحات موضوعية: DNN, NVIDIA GPU, Reliability, Soft errors
Relation: info:eu-repo/semantics/altIdentifier/isbn/978-1-7281-1173-5; ispartofbook:Proceedings of the European Test Workshop; 2019 IEEE European Test Symposium, ETS 2019; volume:2019-; firstpage:1; lastpage:6; numberofpages:6; http://hdl.handle.net/11572/346653; info:eu-repo/semantics/altIdentifier/scopus/2-s2.0-85071154346
-
18
المؤلفون: Zymbler, M.L., Kraeva, Ya.A.
مصطلحات موضوعية: УДК 004.272.25, motif discovery, OpenACC, parallel algorithm, параллельный алгоритм, УДК 004.032.24, поиск лейтмотивов, УДК 004.421, time series, NVIDIA GPU, временной ряд
وصف الملف: application/pdf
-
19
مصطلحات موضوعية: Series (mathematics), Computer science, параллельный алгоритм, Parallel algorithm, OpenMP, Data structure, OpenAcc, parallel algorithm, Intel Xeon Phi, Precomputation, Subsequence, Vectorization (mathematics), Scalability, vectorization, векторизация вычислений, поиск диссонансов, discord discovery, time series, NVIDIA GPU, Algorithm, Xeon Phi, временной ряд
-
20
المؤلفون: Diptarup Saha, Darshak Thakore, Karan Darji, Narendra M. Patel
المصدر: Procedia Computer Science. 79:516-524
مصطلحات موضوعية: Computer science, Low-pass filter, Graphics processing unit, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Image processing, 02 engineering and technology, Parallel computing, Grayscale, Edge detection, Convolution, Parallel Programming, CUDA, Digital image processing, 0202 electrical engineering, electronic engineering, information engineering, NVIDIA GPU, General Environmental Science, ComputingMethodologies_COMPUTERGRAPHICS, Pixel, Image Processing, 020207 software engineering, Sobel operator, Computer Science::Graphics, Computer Science::Computer Vision and Pattern Recognition, Computer Science::Mathematical Software, General Earth and Planetary Sciences, RGB color model, 020201 artificial intelligence & image processing, General-purpose computing on graphics processing units, High-pass filter, Algorithm