Search CORE

Correctly rounded multiplication by arbitrary precision constants

Author: Brisebarre Nicolas
Muller Jean-Michel
Publication venue: HAL CCSD
Publication date: 01/01/2004
Field of study

We introduce an algorithm for multiplying a floating-point number

x

by a constant

C

that is not exactly representable in floating-point arithmetic. Our algorithm uses a multiplication and a fused multiply accumulate instruction. We give methods for checking whether, for a given value of

C

and a given floating-point format, our algorithm returns a correctly rounded result for any

x

. When it does not, our methods give the values

x

for which the multiplication is not correctly rounded.Nous proposons un algorithme permettant de multiplier un nombre virgule ﬂottante x par une constante C qui n’est pas exactement représentable en virgule ﬂottante.Notre algorithme nécessite la disponibilité d’une instruction “multiplication-accumulation”. Nous donnons des méthodes pour tester si,pour une constante C et un format virgule ﬂottante donnés, notre algorithme donnera un arrondi correct pour toutes les valeurs de x.Quand ce n’est pas le cas,nos méthodes permettent de connaître toutes les valeurs de x pour lesquelles la multiplication par C n’est pas arrondie correctement

CiteSeerX

HAL-UJM

HAL-Rennes 1

Chebyshev Interpolation Polynomial-based Tools for Rigorous Computing

Author: Brisebarre Nicolas
Joldes Mioara Maria
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

17 pagesInternational audiencePerforming numerical computations, yet being able to provide rigorous mathematical statements about the obtained result, is required in many domains like global optimization, ODE solving or integration. Taylor models, which associate to a function a pair made of a Taylor approximation polynomial and a rigorous remainder bound, are a widely used rigorous computation tool. This approach benefits from the advantages of numerical methods, but also gives the ability to make reliable statements about the approximated function. Despite the fact that approximation polynomials based on interpolation at Chebyshev nodes offer a quasi-optimal approximation to a function, together with several other useful features, an analogous to Taylor models, based on such polynomials, has not been yet well-established in the field of validated numerics. This paper presents a preliminary work for obtaining such interpolation polynomials together with validated interval bounds for approximating univariate functions. We propose two methods that make practical the use of this: one is based on a representation in Newton basis and the other uses Chebyshev polynomial basis. We compare the quality of the obtained remainders and the performance of the approaches to the ones provided by Taylor models

Integer and Floating-Point Constant Multipliers for FPGAs

Author: Brisebarre Nicolas
de Dinechin Florent
Muller Jean-Michel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

International audienceReconfigurable circuits now have a capacity that allows them to be used as floating-point accelerators. They offer massive parallelism, but also the opportunity to design optimised floating-point hardware operators not available in microprocessors. Multiplication by a constant is an important example of such an operator. This article presents an architecture generator for the correctly rounded multiplication of a floating-point number by a constant. This constant can be a floating-point value, but also an arbitrary irrational number. The multiplication of the significands is an instance of the well-studied problem of constant integer multiplication, for which improvement to existing algorithms are also proposed and evaluated

CiteSeerX

(M,p,k)-friendly points: a table-based method for trigonometric function evaluation

Author: Brisebarre Nicolas
Ercegovac Milos
Muller Jean-Michel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2012
Field of study

International audienceWe present a new way of approximating the sine and cosine functions by a few table look-ups and additions. It consists in first reducing the input range to a very small interval by using rotations with "(M, p, k) friendly angles", proposed in this work, and then by using a bipartite table method in a small interval. An implementation of the method for 24- bit case is described and compared with CORDIC. Roughly, the proposed scheme offers a speedup of 2 compared with an unfolded double-rotation radix-2 CORDIC

Numérisation de Documents Anciens Mathématiques

Sur les fonctions entières à double pas récurrent

Author: Brisebarre Nicolas
Habsieger Laurent
Publication venue: 'Cellule MathDoc/CEDRAM'
Publication date: 01/01/1999
Field of study

Annales de l’institut Fourier (AIF)

A path-norm toolkit for modern networks: consequences, promises and challenges

Author: Brisebarre Nicolas
Gonon Antoine
Gribonval Rémi
Riccietti Elisa
Publication venue
Publication date: 13/03/2024
Field of study

This work introduces the first toolkit around path-norms that fully encompasses general DAG ReLU networks with biases, skip connections and any operation based on the extraction of order statistics: max pooling, GroupSort etc. This toolkit notably allows us to establish generalization bounds for modern neural networks that are not only the most widely applicable path-norm based ones, but also recover or beat the sharpest known bounds of this type. These extended path-norms further enjoy the usual benefits of path-norms: ease of computation, invariance under the symmetries of the network, and improved sharpness on layered fully-connected networks compared to the product of operator norms, another complexity measure most commonly used. The versatility of the toolkit and its ease of implementation allow us to challenge the concrete promises of path-norm-based generalization bounds, by numerically evaluating the sharpest known bounds for ResNets on ImageNet

arXiv.org e-Print Archive

Sur les fonctions entières à double pas récurrent

Author: Brisebarre Nicolas
Habsieger Laurent
Publication venue: 'Cellule MathDoc/CEDRAM'
Publication date: 01/01/1999
Field of study

Numérisation de Documents Anciens Mathématiques

Annales de l’institut Fourier (AIF)

Comparison between binary and decimal floating-point numbers

Author: Brisebarre Nicolas
Lauter Christoph
Mezzarobba Marc
Muller Jean-Michel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

International audienceWe introduce an algorithm to compare a binary floating-point (FP) number and a decimal FP number, assuming the "binary encoding" of the decimal formats is used, and with a special emphasis on the basic interchange formats specified by the IEEE 754-2008 standard for FP arithmetic. It is a two-step algorithm: a first pass, based on the exponents only, quickly eliminates most cases, then, when the first pass does not suffice, a more accurate second pass is performed. We provide an implementation of several variants of our algorithm, and compare them