Search Results - "mixed precision algorithms"

1

Conference

A mixed-precision quantum-classical algorithm for solving linear systems

Authors: Koska, Oceane, Baboulin, Marc, Gazda, Arnaud

Source: 2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) IPDPSW Parallel and Distributed Processing Symposium Workshops (IPDPSW), 2025 IEEE International. :501-508 Jun, 2025

Relation: 2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

2

Academic Journal

High Performance Householder QR Factorization on Emerging GPU Architectures Using Tensor Cores

Authors: Leng, Y., Zou, G., Wang, H., Wu, P., Zhang, S.

Source: IEEE Transactions on Parallel and Distributed Systems IEEE Trans. Parallel Distrib. Syst. Parallel and Distributed Systems, IEEE Transactions on. 36(3):422-436 Mar, 2025

Linked Full Text

3

Academic Journal

Truncated QR Factorization with Pivoting in Mixed Precision: Truncated QR factorization with pivoting in mixed precision

Authors: Buttari, Alfredo, Mary, Theo, Pacteau, André

Contributors: Buttari, Alfredo

Source: SIAM Journal on Scientific Computing. 47:B382-B401

Subject Terms: mixed precision, [INFO.INFO-MS] Computer Science [cs]/Mathematical Software [cs.MS], Roundoff error, [INFO.INFO-NA] Computer Science [cs]/Numerical Analysis [cs.NA], Mixed-precision algorithms, low-rank approximations, QR factorization, Numerical methods for low-rank matrix approximation, matrix compression, [INFO.INFO-AO] Computer Science [cs]/Computer Arithmetic

File Description: application/xml; application/pdf

4

Conference

Mixed-Precision Algorithm for Finding Selected Eigenvalues and Eigenvectors of Symmetric and Hermitian Matrices1

Authors: Tsai, Yaohung M., Luszczek, Piotr, Dongarra, Jack

Source: 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH) SCALAH Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH), 2022 IEEE/ACM Workshop on. :43-50 Nov, 2022

Relation: 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH)

5

Academic Journal

Mixed precision iterative refinement for low-rank matrix and tensor approximations

Authors: Baboulin, Marc, Kaya, Oguz, Mary, Theo, Robeyns, Matthieu

Contributors: Robeyns, Matthieu

Subject Terms: Matrix and tensor computations, Iterative refinement, Mixed precision algorithms, Tensor decompositions, [MATH] Mathematics [math], Low-rank approximations, Randomized SVD, [INFO] Computer Science [cs]

File Description: application/pdf

Access URL: https://hal.science/hal-04115337v2

View record at OpenAIRE

6

eBook

Mixed Precision Randomized Low-Rank Approximation with GPU Tensor Cores

Authors: Baboulin, Marc^Aff13, Donfack, Simplice^{Aff14, Aff17}, Kaya, Oguz^Aff15, Mary, Theo^Aff16, Robeyns, Matthieu^Aff15

Contributors: Goos, Gerhard, Series Editor^Aff1, Hartmanis, Juris, Founding Editor^Aff2, Bertino, Elisa, Editorial Board Member^Aff3, Gao, Wen, Editorial Board Member^Aff4, Steffen, Bernhard, Editorial Board Member^Aff5, Yung, Moti, Editorial Board Member^Aff6, Carretero, Jesus, editor^Aff7, Shende, Sameer, editor^Aff8, Garcia-Blas, Javier, editor^Aff9, Brandic, Ivona, editor^Aff10, Olcoz, Katzalin, editor^Aff11, Schreiber, Martin, editor^Aff12

Source: Euro-Par 2024: Parallel Processing : 30th European Conference on Parallel and Distributed Processing, Madrid, Spain, August 26–30, 2024, Proceedings, Part III. 14803:31-44

7

Conference

Mixed-Precision Tomographic Reconstructor Computations on Hardware Accelerators

Authors: Doucet, Nicolas, Ltaief, Hatem, Gratadour, Damien, Keyes, David

Source: 2019 IEEE/ACM 9th Workshop on Irregular Applications: Architectures and Algorithms (IA3) Irregular Applications: Architectures and Algorithms (IA3), 2019 IEEE/ACM 9th Workshop on. :31-38 Nov, 2019

Relation: 2019 IEEE/ACM 9th Workshop on Irregular Applications: Architectures and Algorithms (IA3)

8

Academic Journal

Mixed precision LU factorization on GPU tensor cores: reducing data movement and memory footprint

Authors: Lopez, Florent, Mary, Theo

Contributors: Mary, Theo

Source: The International Journal of High Performance Computing Applications. 37:165-179

Subject Terms: high performance computing, tensor cores, LU factorization, numerical linear algebra, mixed precision algorithms, [MATH] Mathematics [math], [INFO] Computer Science [cs], 0101 mathematics, NVIDIA GPU, 01 natural sciences, rounding error analysis

File Description: application/pdf

Access URL: https://hal.science/hal-02937325v2
https://hal.science/hal-02937325v2/document
https://doi.org/10.1177/10943420221136848

View record at OpenAIRE

Linked Full Text

9

Academic Journal

Mixed precision low-rank approximations and their application to block low-rank LU factorization

Authors: Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L'Excellent, Jean-Yves, Mary, Théo

Contributors: Gerest, Matthieu, Mumps Technologies Lyon, Algorithmes Parallèles et Optimisation (IRIT-APO), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), EDF R&D (EDF R&D), EDF (EDF), Centre National de la Recherche Scientifique (CNRS), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Université Paris-Panthéon-Assas, CIFRE PhDthesis of Matthieu Gerest funded by EDF

Source: IMA Journal of Numerical Analysis. 43:2198-2227

Subject Terms: floating-point arithmetic, multiprecision algorithms, [INFO.INFO-AO]Computer Science [cs]/Computer Arithmetic, block low-rank matrices, linear systems, singular value decomposition, [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], [MATH.MATH-NA] Mathematics [math]/Numerical Analysis [math.NA], 01 natural sciences, rounding error analysis, [INFO.INFO-NA] Computer Science [cs]/Numerical Analysis [cs.NA], low-rank approximations, LU factorization, numerical linear algebra, data sparse matrices, mixed precision algorithms, [INFO.INFO-AO] Computer Science [cs]/Computer Arithmetic, 0101 mathematics, [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]

File Description: application/pdf

Access URL: https://hal.science/hal-03251738v3
https://doi.org/10.1093/imanum/drac037
https://hal.science/hal-03251738v3/document

View record at OpenAIRE

Linked Full Text

10

Conference

Metaprogramming Dense Linear Algebra Solvers Applications to Multi and Many-Core Architectures

Authors: Masliah, Ian, Baboulin, Marc, Falcou, Joel

Source: 2015 IEEE Trustcom/BigDataSE/ISPA Trustcom/BigDataSE/ISPA, 2015 IEEE. 3:69-76 Aug, 2015

Relation: 2015 IEEE Trustcom/BigDataSE/ISPA

11

Conference

Energy Footprint of Advanced Dense Numerical Linear Algebra Using Tile Algorithms on Multicore Architectures

Authors: Dongarra, Jack, Ltaief, Hatem, Luszczek, Piotr, Weaver, Vincent M.

Source: 2012 Second International Conference on Cloud and Green Computing Cloud and Green Computing (CGC), 2012 Second International Conference on. :274-281 Nov, 2012

Relation: 2012 International Conference on Cloud and Green Computing (CGC)

12

Report

Truncated QR factorization with pivoting in mixed precision ; Factorisation QR avec pivotage et troncature en précision mixte

Authors: Buttari, Alfredo, Mary, Théo, Pacteau, André

Contributors: Algorithmes Parallèles et Optimisation (IRIT-APO), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), ANR-22-EXNU-0003,Exa-Soft,High Performance Computing software and tools(2022), ANR-23-CE46-0005,MixHPC,Algorithmes en précision mixte pour le calcul haute performance(2023)

Source: https://hal.science/hal-04490215 ; 2024.

Subject Terms: Mixed-precision algorithms, QR factorization, low-rank approximations, [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], [INFO.INFO-AO]Computer Science [cs]/Computer Arithmetic, [INFO.INFO-MS]Computer Science [cs]/Mathematical Software [cs.MS]

Availability: https://hal.science/hal-04490215
https://hal.science/hal-04490215v1/document
https://hal.science/hal-04490215v1/file/paper.pdf

View record from BASE

13

Report

Mixed precision iterative refinement for low-rank matrix and tensor approximations

Authors: Baboulin, Marc, Kaya, Oguz, Mary, Theo, Robeyns, Matthieu

Contributors: Systèmes Parallèles - LISN (ParSys), Algorithmes, Apprentissage et Calcul (AAC), Laboratoire Interdisciplinaire des Sciences du Numérique (LISN), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Interdisciplinaire des Sciences du Numérique (LISN), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), DIM RFSI RC-TENSOR No 2021-05

Source: https://inria.hal.science/hal-04115337 ; 2023.

Subject Terms: Matrix and tensor computations, Low-rank approximations, Mixed precision algorithms, Iterative refinement, Randomized SVD, Tensor decompositions, 65F55, 65G50, 65Y20, 15A69, [INFO]Computer Science [cs], [MATH]Mathematics [math]

Availability: https://inria.hal.science/hal-04115337
https://inria.hal.science/hal-04115337v1/document
https://inria.hal.science/hal-04115337v1/file/Paper.pdf

View record from BASE

14

Academic Journal

Mixed Precision LU Factorization on GPU Tensor Cores: Reducing Data Movement and Memory Footprint

Authors: Lopez, Florent, Mary, Théo

Contributors: Innovative Computing Laboratory Knoxville (ICL), The University of Tennessee Knoxville, Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)

Source: ISSN: 1094-3420 ; International Journal of High Performance Computing Applications ; https://hal.science/hal-02937325 ; International Journal of High Performance Computing Applications, In press.

Subject Terms: numerical linear algebra, mixed precision algorithms, high performance computing, LU factorization, tensor cores, NVIDIA GPU, rounding error analysis, [INFO]Computer Science [cs], [MATH]Mathematics [math]

Relation: hal-02937325; https://hal.science/hal-02937325; https://hal.science/hal-02937325v2/document; https://hal.science/hal-02937325v2/file/paper.pdf

Availability: https://hal.science/hal-02937325
https://hal.science/hal-02937325v2/document
https://hal.science/hal-02937325v2/file/paper.pdf

View record from BASE

15

Report

Mixed Precision Low Rank Approximations and their Application to Block Low Rank LU Factorization

Authors: Amestoy, Patrick, Boiteau, Olivier, Buttari, Alfredo, Gerest, Matthieu, Jézéquel, Fabienne, L'Excellent, Jean-Yves, Mary, Théo

Contributors: Mumps Technologies, Algorithmes Parallèles et Optimisation (IRIT-APO), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, EDF R&D (EDF R&D), EDF (EDF), Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Laboratoire de l'Informatique du Parallélisme (LIP), Centre National de la Recherche Scientifique (CNRS)-Université de Lyon-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-École normale supérieure - Lyon (ENS Lyon), CIFRE PhDthesis of Matthieu Gerest funded by EDF

Source: https://hal.archives-ouvertes.fr/hal-03251738 ; 2021.

Subject Terms: numerical linear algebra, rounding error analysis, floating-point arithmetic, mixed precision algorithms, multiprecision algorithms, block low-rank matrices, data sparse matrices, LU factorization, linear systems, low-rank approximations, singular value decomposition, [INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA], [INFO.INFO-AO]Computer Science [cs]/Computer Arithmetic

Relation: hal-03251738; https://hal.archives-ouvertes.fr/hal-03251738; https://hal.archives-ouvertes.fr/hal-03251738v2/document; https://hal.archives-ouvertes.fr/hal-03251738v2/file/mixedBLR.pdf

Availability: https://hal.archives-ouvertes.fr/hal-03251738
https://hal.archives-ouvertes.fr/hal-03251738v2/document
https://hal.archives-ouvertes.fr/hal-03251738v2/file/mixedBLR.pdf

View record from BASE

16

eBook

Using Mixed Precision Algorithm for LINPACK Benchmark on AMD GPU

Authors: Zhang, Xianyi^{Aff1, Aff2}, Zhang, Yunquan^{Aff1, Aff3}, Wang, Lei^{Aff1, Aff2}

Contributors: Yuen, David A., editor^AffID1, Wang, Long, editor^AffID2, Chi, Xuebin, editor^AffID3, Johnsson, Lennart, editor^AffID4, Ge, Wei, editor^AffID5, Shi, Yaolin, editor^AffID6

Source: GPU Solutions to Multi-scale Problems in Science and Engineering. :555-560

17

Report

Mixed Precision LU Factorization on GPU Tensor Cores: Reducing Data Movement and Memory Footprint

Authors: Lopez, Florent, Mary, Théo

Contributors: Innovative Computing Laboratory Knoxville (ICL), The University of Tennessee Knoxville, Performance et Qualité des Algorithmes Numériques (PEQUAN), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)

Source: https://hal.archives-ouvertes.fr/hal-02937325 ; 2020.

Subject Terms: numerical linear algebra, mixed precision algorithms, high performance computing, LU factorization, tensor cores, NVIDIA GPU, rounding error analysis, [INFO]Computer Science [cs], [MATH]Mathematics [math]

Relation: hal-02937325; https://hal.archives-ouvertes.fr/hal-02937325; https://hal.archives-ouvertes.fr/hal-02937325/document; https://hal.archives-ouvertes.fr/hal-02937325/file/paper.pdf

Availability: https://hal.archives-ouvertes.fr/hal-02937325
https://hal.archives-ouvertes.fr/hal-02937325/document
https://hal.archives-ouvertes.fr/hal-02937325/file/paper.pdf

View record from BASE

18

Academic Journal

Implementation of mixed precision in solving systems of linear equations on the Cell processor

Authors: Jack Dongarra, Jakub Kurzak

Source: Kurzak, J & Dongarra, J 2007, 'Implementation of mixed precision in solving systems of linear equations on the Cell processor', Concurrency and Computation: Practice & Experience, vol. 19, no. 10, pp. 1371-1385. https://doi.org/10.1002/cpe.1164

Subject Terms: Iterative refinement, Mixed-precision algorithms, 0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, LINPACK, Cell broadband engine, HPL

Access URL: https://dblp.uni-trier.de/db/journals/concurrency/concurrency19.html#KurzakD07
http://onlinelibrary.wiley.com/doi/10.1002/cpe.1164/full
https://www.research.manchester.ac.uk/portal/en/publications/implementation-of-mixed-precision-in-solving-systems-of-linear-equations-on-the-cell-processor(775e79dd-e669-4f37-b953-6660732a7dd3).html
http://www.escholar.manchester.ac.uk/uk-ac-man-scw:1a10882
http://cscads.rice.edu/presentations/fulltext-kurzak.pdf
https://core.ac.uk/display/74115495

Linked Full Text

19

Report

A High Performance QDWH-SVD Solver using Hardware Accelerators

Authors: Sukkari, Dalal E., Ltaief, Hatem, Keyes, David E.

Contributors: Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Extreme Computing Research Center

Subject Terms: Singular Value Decomposition, Polar Decomposition, Symmetric Eigensolver, Mixed Precision Algorithms, GPU-based Scientific Computing

File Description: application/pdf

Relation: Sukkari, D., Ltaief, H., & Keyes, D. (2016). A High Performance QDWH-SVD Solver Using Hardware Accelerators. ACM Transactions on Mathematical Software, 43(1), 1–25. doi:10.1145/2894747; http://hdl.handle.net/10754/348632

Availability: http://hdl.handle.net/10754/348632
https://doi.org/10.1145/2894747

View record from BASE

20

Academic Journal