-
1Conference
Authors: Koska, Oceane, Baboulin, Marc, Gazda, Arnaud
Source: 2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) IPDPSW Parallel and Distributed Processing Symposium Workshops (IPDPSW), 2025 IEEE International. :501-508 Jun, 2025
Relation: 2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
-
2Academic Journal
Source: IEEE Transactions on Parallel and Distributed Systems IEEE Trans. Parallel Distrib. Syst. Parallel and Distributed Systems, IEEE Transactions on. 36(3):422-436 Mar, 2025
Linked Full Text -
3Academic Journal
Authors: Buttari, Alfredo, Mary, Theo, Pacteau, André
Contributors: Buttari, Alfredo
Source: SIAM Journal on Scientific Computing. 47:B382-B401
Subject Terms: mixed precision, [INFO.INFO-MS] Computer Science [cs]/Mathematical Software [cs.MS], Roundoff error, [INFO.INFO-NA] Computer Science [cs]/Numerical Analysis [cs.NA], Mixed-precision algorithms, low-rank approximations, QR factorization, Numerical methods for low-rank matrix approximation, matrix compression, [INFO.INFO-AO] Computer Science [cs]/Computer Arithmetic
File Description: application/xml; application/pdf
-
4Conference
Authors: Tsai, Yaohung M., Luszczek, Piotr, Dongarra, Jack
Source: 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH) SCALAH Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH), 2022 IEEE/ACM Workshop on. :43-50 Nov, 2022
Relation: 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH)
-
5Academic Journal
Authors: Baboulin, Marc, Kaya, Oguz, Mary, Theo, Robeyns, Matthieu
Contributors: Robeyns, Matthieu
Subject Terms: Matrix and tensor computations, Iterative refinement, Mixed precision algorithms, Tensor decompositions, [MATH] Mathematics [math], Low-rank approximations, Randomized SVD, [INFO] Computer Science [cs]
File Description: application/pdf
Access URL: https://hal.science/hal-04115337v2
-
6eBook
Authors: Baboulin, MarcAff13, Donfack, SimpliceAff14, Aff17, Kaya, OguzAff15, Mary, TheoAff16, Robeyns, MatthieuAff15
Contributors: Goos, Gerhard, Series EditorAff1, Hartmanis, Juris, Founding EditorAff2, Bertino, Elisa, Editorial Board MemberAff3, Gao, Wen, Editorial Board MemberAff4, Steffen, Bernhard, Editorial Board MemberAff5, Yung, Moti, Editorial Board MemberAff6, Carretero, Jesus, editorAff7, Shende, Sameer, editorAff8, Garcia-Blas, Javier, editorAff9, Brandic, Ivona, editorAff10, Olcoz, Katzalin, editorAff11, Schreiber, Martin, editorAff12
Source: Euro-Par 2024: Parallel Processing : 30th European Conference on Parallel and Distributed Processing, Madrid, Spain, August 26–30, 2024, Proceedings, Part III. 14803:31-44
-
7Conference
Authors: Doucet, Nicolas, Ltaief, Hatem, Gratadour, Damien, Keyes, David
Source: 2019 IEEE/ACM 9th Workshop on Irregular Applications: Architectures and Algorithms (IA3) Irregular Applications: Architectures and Algorithms (IA3), 2019 IEEE/ACM 9th Workshop on. :31-38 Nov, 2019
Relation: 2019 IEEE/ACM 9th Workshop on Irregular Applications: Architectures and Algorithms (IA3)
-
8Academic Journal
Authors: Lopez, Florent, Mary, Theo
Contributors: Mary, Theo
Source: The International Journal of High Performance Computing Applications. 37:165-179
Subject Terms: high performance computing, tensor cores, LU factorization, numerical linear algebra, mixed precision algorithms, [MATH] Mathematics [math], [INFO] Computer Science [cs], 0101 mathematics, NVIDIA GPU, 01 natural sciences, rounding error analysis
File Description: application/pdf
Linked Full TextAccess URL: https://hal.science/hal-02937325v2
https://hal.science/hal-02937325v2/document
https://doi.org/10.1177/10943420221136848 -
9Academic Journal
Authors: Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L'Excellent, Jean-Yves, Mary, Théo
Contributors: Gerest, Matthieu, Mumps Technologies Lyon, Algorithmes Parallèles et Optimisation (IRIT-APO), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), EDF R&D (EDF R&D), EDF (EDF), Centre National de la Recherche Scientifique (CNRS), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Université Paris-Panthéon-Assas, CIFRE PhDthesis of Matthieu Gerest funded by EDF
Source: IMA Journal of Numerical Analysis. 43:2198-2227
Subject Terms: floating-point arithmetic, multiprecision algorithms, [INFO.INFO-AO]Computer Science [cs]/Computer Arithmetic, block low-rank matrices, linear systems, singular value decomposition, [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], [MATH.MATH-NA] Mathematics [math]/Numerical Analysis [math.NA], 01 natural sciences, rounding error analysis, [INFO.INFO-NA] Computer Science [cs]/Numerical Analysis [cs.NA], low-rank approximations, LU factorization, numerical linear algebra, data sparse matrices, mixed precision algorithms, [INFO.INFO-AO] Computer Science [cs]/Computer Arithmetic, 0101 mathematics, [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]
File Description: application/pdf
Linked Full TextAccess URL: https://hal.science/hal-03251738v3
https://doi.org/10.1093/imanum/drac037
https://hal.science/hal-03251738v3/document -
10Conference
Authors: Masliah, Ian, Baboulin, Marc, Falcou, Joel
Source: 2015 IEEE Trustcom/BigDataSE/ISPA Trustcom/BigDataSE/ISPA, 2015 IEEE. 3:69-76 Aug, 2015
Relation: 2015 IEEE Trustcom/BigDataSE/ISPA
-
11Conference
Authors: Dongarra, Jack, Ltaief, Hatem, Luszczek, Piotr, Weaver, Vincent M.
Source: 2012 Second International Conference on Cloud and Green Computing Cloud and Green Computing (CGC), 2012 Second International Conference on. :274-281 Nov, 2012
Relation: 2012 International Conference on Cloud and Green Computing (CGC)
-
12Report
Authors: Buttari, Alfredo, Mary, Théo, Pacteau, André
Contributors: Algorithmes Parallèles et Optimisation (IRIT-APO), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), ANR-22-EXNU-0003,Exa-Soft,High Performance Computing software and tools(2022), ANR-23-CE46-0005,MixHPC,Algorithmes en précision mixte pour le calcul haute performance(2023)
Source: https://hal.science/hal-04490215 ; 2024.
Subject Terms: Mixed-precision algorithms, QR factorization, low-rank approximations, [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], [INFO.INFO-AO]Computer Science [cs]/Computer Arithmetic, [INFO.INFO-MS]Computer Science [cs]/Mathematical Software [cs.MS]
-
13Report
Authors: Baboulin, Marc, Kaya, Oguz, Mary, Theo, Robeyns, Matthieu
Contributors: Systèmes Parallèles - LISN (ParSys), Algorithmes, Apprentissage et Calcul (AAC), Laboratoire Interdisciplinaire des Sciences du Numérique (LISN), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Interdisciplinaire des Sciences du Numérique (LISN), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), DIM RFSI RC-TENSOR No 2021-05
Source: https://inria.hal.science/hal-04115337 ; 2023.
Subject Terms: Matrix and tensor computations, Low-rank approximations, Mixed precision algorithms, Iterative refinement, Randomized SVD, Tensor decompositions, 65F55, 65G50, 65Y20, 15A69, [INFO]Computer Science [cs], [MATH]Mathematics [math]
-
14Academic Journal
Authors: Lopez, Florent, Mary, Théo
Contributors: Innovative Computing Laboratory Knoxville (ICL), The University of Tennessee Knoxville, Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)
Source: ISSN: 1094-3420 ; International Journal of High Performance Computing Applications ; https://hal.science/hal-02937325 ; International Journal of High Performance Computing Applications, In press.
Subject Terms: numerical linear algebra, mixed precision algorithms, high performance computing, LU factorization, tensor cores, NVIDIA GPU, rounding error analysis, [INFO]Computer Science [cs], [MATH]Mathematics [math]
Relation: hal-02937325; https://hal.science/hal-02937325; https://hal.science/hal-02937325v2/document; https://hal.science/hal-02937325v2/file/paper.pdf
-
15Report
Authors: Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L'Excellent, Jean-Yves, Mary, Théo
Contributors: Mumps Technologies, Algorithmes Parallèles et Optimisation (IRIT-APO), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, EDF R&D (EDF R&D), EDF (EDF), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Laboratoire de l'Informatique du Parallélisme (LIP), Centre National de la Recherche Scientifique (CNRS)-Université de Lyon-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-École normale supérieure - Lyon (ENS Lyon), CIFRE PhDthesis of Matthieu Gerest funded by EDF
Source: https://hal.archives-ouvertes.fr/hal-03251738 ; 2021.
Subject Terms: numerical linear algebra, rounding error analysis, floating-point arithmetic, mixed precision algorithms, multiprecision algorithms, block low-rank matrices, data sparse matrices, LU factorization, linear systems, low-rank approximations, singular value decomposition, [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA], [INFO.INFO-AO]Computer Science [cs]/Computer Arithmetic
Relation: hal-03251738; https://hal.archives-ouvertes.fr/hal-03251738; https://hal.archives-ouvertes.fr/hal-03251738v2/document; https://hal.archives-ouvertes.fr/hal-03251738v2/file/mixedBLR.pdf
-
16eBook
Authors: Zhang, XianyiAff1, Aff2, Zhang, YunquanAff1, Aff3, Wang, LeiAff1, Aff2
Contributors: Yuen, David A., editorAffID1, Wang, Long, editorAffID2, Chi, Xuebin, editorAffID3, Johnsson, Lennart, editorAffID4, Ge, Wei, editorAffID5, Shi, Yaolin, editorAffID6
Source: GPU Solutions to Multi-scale Problems in Science and Engineering. :555-560
-
17Report
Authors: Lopez, Florent, Mary, Théo
Contributors: Innovative Computing Laboratory Knoxville (ICL), The University of Tennessee Knoxville, Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)
Source: https://hal.archives-ouvertes.fr/hal-02937325 ; 2020.
Subject Terms: numerical linear algebra, mixed precision algorithms, high performance computing, LU factorization, tensor cores, NVIDIA GPU, rounding error analysis, [INFO]Computer Science [cs], [MATH]Mathematics [math]
Relation: hal-02937325; https://hal.archives-ouvertes.fr/hal-02937325; https://hal.archives-ouvertes.fr/hal-02937325/document; https://hal.archives-ouvertes.fr/hal-02937325/file/paper.pdf
-
18Academic Journal
Authors: Jack Dongarra, Jakub Kurzak
Source: Kurzak, J & Dongarra, J 2007, 'Implementation of mixed precision in solving systems of linear equations on the Cell processor', Concurrency and Computation: Practice & Experience, vol. 19, no. 10, pp. 1371-1385. https://doi.org/10.1002/cpe.1164
Subject Terms: Iterative refinement, Mixed-precision algorithms, 0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, LINPACK, Cell broadband engine, HPL
Access URL: https://dblp.uni-trier.de/db/journals/concurrency/concurrency19.html#KurzakD07
http://onlinelibrary.wiley.com/doi/10.1002/cpe.1164/full
https://www.research.manchester.ac.uk/portal/en/publications/implementation-of-mixed -precision -in-solving-systems-of-linear-equations-on-the-cell-processor(775e79dd-e669-4f37-b953-6660732a7dd3).html
http://www.escholar.manchester.ac.uk/uk-ac-man-scw:1a10882
http://cscads.rice.edu/presentations/fulltext-kurzak.pdf
https://core.ac.uk/display/74115495Linked Full Text -
19Report
Authors: Sukkari, Dalal E., Ltaief, Hatem, Keyes, David E.
Contributors: Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Extreme Computing Research Center
Subject Terms: Singular Value Decomposition, Polar Decomposition, Symmetric Eigensolver, Mixed Precision Algorithms, GPU-based Scientific Computing
File Description: application/pdf
Relation: Sukkari, D., Ltaief, H., & Keyes, D. (2016). A High Performance QDWH-SVD Solver Using Hardware Accelerators. ACM Transactions on Mathematical Software, 43(1), 1–25. doi:10.1145/2894747; http://hdl.handle.net/10754/348632
-
20Academic Journal
Authors: Jakub Kurzak, Alfredo Buttari, Jack Dongarra
Contributors: The Pennsylvania State University CiteSeerX Archives
Subject Terms: CELL BE, iterative refinement, mixed-precision algorithms, Cholesky factorization
File Description: application/pdf