FLAC v1.3.3 - Page 4

Topic: FLAC v1.3.3 (Read 159417 times) previous topic - next topic

0 Members and 1 Guest are viewing this topic.

Re: FLAC v1.3.3

Reply #75 – 2020-04-12 13:51:54

FLAC 7a35c52 MSYS2/mingw-w64-x86_64 2020-04-12 (64bit, GCC 9.3)

Aggressively optimized builds tuned for modern processors. Included is a generic x86_64 build and an Intel Haswell optimized build that will only run on Haswell or newer Intel CPUs and Zen or newer AMD CPUs. All FLAC tests passed for both builds.

Changes:

Rebuilt against a newer Mingw-w64 git snapshot
Haswell binaries are now built without FMA support which should allow reproducible compression sizes with other builds

The new Haswell build is 8.1% faster than the previous Haswell build and 5% faster than the generic build on a 2700X (--verify --best) with this test file: BIS1447-002-flac_16.flac

Build information:
Spoiler (click to show/hide)

Environment variables for generic:

Code: [Select]

CPPFLAGS="-D_FORTIFY_SOURCE=0"
CFLAGS="-march=x86-64 -O3 -fno-common -fno-plt -fno-semantic-interposition -fno-stack-protector -fno-math-errno -fno-trapping-math -falign-functions=32 -fdevirtualize-at-ltrans -fgraphite-identity -floop-nest-optimize -fipa-pta -flto -ffat-lto-objects -funroll-loops -m64 -mtune=skylake -pipe"
CXXFLAGS="$CFLAGS"
LDFLAGS="-Wl,-O1,--sort-common,--as-needed ${CFLAGS}"

Environment variables for Haswell:

Code: [Select]

CPPFLAGS="-D_FORTIFY_SOURCE=0"
CFLAGS="-march=haswell -O3 -fno-common -fno-plt -fno-semantic-interposition -fno-stack-protector -fno-math-errno -fno-trapping-math -falign-functions=32 -fdevirtualize-at-ltrans -fgraphite-identity -floop-nest-optimize -fipa-pta -flto -ffat-lto-objects -funroll-loops -m64 -mno-fma -pipe"
CXXFLAGS="$CFLAGS"
LDFLAGS="-Wl,-O1,--sort-common,--as-needed ${CFLAGS}"

FLAC configuration:

Code: [Select]

./configure \
--disable-shared \
--enable-static \
--disable-xmms-plugin \
--disable-rpath \
--prefix=${MINGW_PREFIX} \
--build=${MINGW_CHOST} \
--host=${MINGW_CHOST} \
--target=${MINGW_CHOST} \
--enable-64-bit-words \
--disable-stack-smash-protection

I'd appreciate it if anyone else can confirm the compression difference is now gone and how much of a performance hit or increase Intel CPUs receive.

Re: FLAC v1.3.3

Reply #76 – 2020-04-13 08:42:00

@GrieverV:
- The compression difference is gone; same FLACs as with NetRanger's binaries here
- On my i6700 I can not confirm a speed increase (haswell build); a quick test resulted in ~ the same performance as Case's build, but your FMA enabled build is still a bit faster (~1%).

Pardon my ignorance: What does this "FMA support" do? I was under the impression that compiler options can (and should) have an impact on the binary size/speed, but change the output files of a lossless encoder (although both files are valid FLACs with identical MD5)? Isn't this algorithm fixed by the source code?

Re: FLAC v1.3.3

Reply #77 – 2020-04-13 09:49:10

Thanks for confirming and testing on Intel!

I'm not a math person and floating-point (approximate) stuff is like black magic to me, but as far as I understand, fused multiply-add can result in slight differences due to different rounding compared to the usual multiply-add. Most modern CPUs should have hardware support for fused multiply-add operations which can be used through the FMA instruction set and GCC can make use of those instructions when performing optimizations by specifying the -mfma flag which is enabled when -march=haswell is set. Compilers usually have options that affect floating-point math.

libFLAC uses floating-point math for encoding that can be optimized by GCC and it seems enabling FMA will change the result in some cases.

Re: FLAC v1.3.3

Reply #78 – 2020-04-13 09:56:18

What must I do to make compile compatible with SandyBridge CPU, but not with generations before?
Is there some step-by-step which I can follow?
I've tried yesterday, by installing MinGW and setting things up, but it just wouldn't work.

Re: FLAC v1.3.3

Reply #79 – 2020-04-13 10:03:54

@itisljar
Assuming you're using MSYS2, all you have to do is change -march=haswell to -march=sandybridge in the makepkg_mingw64.conf included with the Haswell build, copy to msys64/etc/, start MSYS2 MinGW 64-bit shell and run 'MINGW_INSTALLS=mingw64 makepkg-mingw -sic' in each of the folders containing the PKGBUILDs. It's easiest to copy the libogg/ and flac-git/ folders to msys64\home\<user> so all you have to do is 'cd <folder>/'

If you're using plain Mingw-w64, you'll need to set the environment variables manually and refer to the PKGBUILDs to see how ogg and FLAC are built.

Somewhat step-by-step for MSYS2:
Spoiler (click to show/hide)

Code: [Select]

1) Install MSYS2
2) Start MSYS2 MinGW 64-bit shell
3) Run 'pacman -Syyu' and then close the shell
4) Run 'pacman -Syu'
5) Install mingw toolchain, cmake and nasm 'pacman -S --needed mingw-w64-x86_64-toolchain mingw-w64-x86_64-cmake mingw-w64-x86_64-nasm'
6) Modify makepkg_mingw64.conf and change march=haswell to march=sandybridge
7) Copy makepkg_mingw64.conf to msys64/etc/
8) Copy libogg/ and flac-git/ to msys64\home\<user>
9) Using the MSYS2 MinGW 64-bit shell, enter the libogg/ directory by 'cd libogg/'
10) Build and install libogg with 'MINGW_INSTALLS=mingw64 makepkg-mingw -sic'
11) Repeat steps 9 and 10 for flac-git (cd ../flac-git/)
12) You should have a *.pkg.tar.xz archive containing the built libogg/FLAC inside the folder containing the PKGBUILDS (libogg/ flac-git/)

edit: Fair warning, anti-viruses hate MSYS2 and may completely break your MSYS2 install when you try to build any package (curl.exe, bash.exe, etc.). You might want to add an exception for the MSYS2 folder while building packages and then remove it from exceptions when done to avoid any security issues in the future while not using MSYS2.

Re: FLAC v1.3.3

Reply #80 – 2020-04-13 11:11:26

@GrieverV:
Thank you very much for shedding some light on FMA optimisation. Very interesting stuff.

Btw, here are the results of my ususal test set:

Code: [Select]

Intel Core i7-6700 CPU @ 3.40GHz, 24 GB RAM, Samsung SSD 850 EVO 500GB
Windows 10 Pro 64-bit, Version 1909, Build 18363.720

flac133_case.exe:
39.303 sec (avg. runtime/5 runs, fastest run: 39.228 sec)
total size of FLAC files: 1.167.279.038 bytes -> different build!

flac133_GrieverV_7a35c528_haswell_FMA.exe (07.04.2020):
38.964 sec (avg. runtime/5 runs, fastest run: 38.814 sec)
total size of FLAC files: 1.167.276.028 bytes

flac133_GrieverV_7a35c528_haswell_noFMA.exe (12.04.2020):
39.169 sec (avg. runtime/5 runs, fastest run: 39.055 sec)
total size of FLAC files: 1.167.278.241 bytes

flac133_GrieverV_7a35c528.exe (12.04.2020):
40.192 sec (avg. runtime/5 runs, fastest run: 40.068 sec)
total size of FLAC files: 1.167.278.241 bytes

Re: FLAC v1.3.3

Reply #81 – 2020-04-13 11:40:57

Quote from: GrieverV on 2020-04-13 10:03:54

Somewhat step-by-step for MSYS2:

Oh, thank you very much! I will try these in next few days!

Re: FLAC v1.3.3

Reply #82 – 2020-04-13 13:06:50

Can't edit the post, but... do I edit these:

Code: [Select]

#-- Compiler and Linker Flags
# -march (or -mcpu) builds exclusively for an architecture
# -mtune optimizes for an architecture, but builds for whole processor family
CPPFLAGS="-D__USE_MINGW_ANSI_STDIO=1"
CFLAGS="-march=x86-64 -mtune=generic -O2 -pipe"
CXXFLAGS="-march=x86-64 -mtune=generic -O2 -pipe"
LDFLAGS="-pipe"

Like this?

Code: [Select]

#-- Compiler and Linker Flags
# -march (or -mcpu) builds exclusively for an architecture
# -mtune optimizes for an architecture, but builds for whole processor family
CPPFLAGS="-D__USE_MINGW_ANSI_STDIO=1"
CFLAGS="-march=x86-64 -mtune=sandybridge -O2 -pipe"
CXXFLAGS="-march=x86-64 -mtune=sandybridge -O2 -pipe"
LDFLAGS="-pipe"

Or like this?

Code: [Select]

#-- Compiler and Linker Flags
# -march (or -mcpu) builds exclusively for an architecture
# -mtune optimizes for an architecture, but builds for whole processor family
CPPFLAGS="-D__USE_MINGW_ANSI_STDIO=1"
CFLAGS="-march=sandybridge -mtune=generic -O2 -pipe"
CXXFLAGS="-march=sandybridge -mtune=generic -O2 -pipe"
LDFLAGS="-pipe"

Re: FLAC v1.3.3

Reply #83 – 2020-04-13 13:09:03

sundance: From what I understand FLAC uses floating point math in Linear Predictive Coding. Different floating point instructions have different precision, plus there are rounding errors. Most lossless encoders try to predict the next sample(s) and store error in the resulting file - due to slightly different math the predictions differ, so stored correction information differs, and so file size is different, but still lossless. Hopefully...
In case of lossy encoders, different compile may produce different file too, but in this case decoded file will be different too, inaudibly though. Hopefully...

Re: FLAC v1.3.3

Reply #84 – 2020-04-13 19:31:44

Thanks a lot for your explanation. That makes perfect sense.
Somehow I tended to think that lossless compression and (imprecise) floating point calculations can't go together...

Re: FLAC v1.3.3

Reply #85 – 2020-04-16 07:56:13

FLAC uses integer math not floating point. From the WIKIpedia article:

The FLAC format supports only integer samples, not floating-point. It can handle any PCM bit resolution from 4 to 32 bits per sample, any sampling rate from 1 Hz to 65,535 Hz in 1 Hz increments or from 10 Hz to 655,350 Hz in 10 Hz increments, and any number of channels from 1 to 8.[9] To Date (Vers. 1.3.3 of the reference encoder), FLAC encoding is limited to 24 bits per sample since no encoder for 32 bits per sample exists.[10]
Channels can be grouped in some cases, for example stereo and 5.1 channel surround, to take advantage of interchannel correlations to increase compression.
CRC checksums are used for identifying corrupted frames when used in a streaming protocol. The file also includes a complete MD5 hash of the raw PCM audio in its STREAMINFO metadata header. FLAC allows for a Rice parameter between 0 and 16.
FLAC uses linear prediction to convert the audio samples. There are two steps, the predictor and the error coding. The predictor can be one of four types (Zero, Verbatim, Fixed Linear and Finite Impulse Response[dubious – discuss] (FIR) Linear). The difference between the predictor and the actual sample data is calculated and is known as the residual. The residual is stored efficiently using Golomb-Rice coding. It also uses run-length encoding for blocks of identical samples, such as silent passages.

Thanks

Re: FLAC v1.3.3

Reply #86 – 2020-04-16 09:40:58

The FLAC format uses only integer math, but FLAC encoders can use floating-point math to select the (integer) parameters used for the linear prediction.

Re: FLAC v1.3.3

Reply #87 – 2020-05-03 16:31:07

FLAC v1.3.3-Git-2020-05-03-37e675b
Built on May 03, 2020, GCC 9.3.0

Latest commit included : 37e675b

https://xiph.org/flac/

https://github.com/xiph/flac/commits/master
https://gitlab.xiph.org/xiph/flac

Re: FLAC v1.3.3

Reply #88 – 2020-05-14 17:36:23

FLAC v1.3.3-Git-2020-05-14-ce6dd6b
Built on May 14, 2020, GCC 10.1.0

Latest commit included : ce6dd6b

https://xiph.org/flac/

https://github.com/xiph/flac/commits/master
https://gitlab.xiph.org/xiph/flac

Re: FLAC v1.3.3

Reply #89 – 2020-06-06 15:39:28

1.3.3.r3913.ce6dd6b5-1, gcc.exe (Rev3, Built by MSYS2 project) 10.1.0
-march=native on "Coffee Lake", but should be skylake compatible

Code: [Select]

$ gcc -march=native -Q --help=target | grep march
  -march=                               skylake
  Known valid arguments for -march= option:

Test -8ep

NetRanger's Built on May 14, 2020, GCC 10.1.0 - 10.91x
FLAC v.1.3.3 win64 ICL 19 compile - by John33, 10.08.2019 - 10.87x
flac-1.3.2-win.zip 2017-01-01 from site - 13.07x
This build - 14.53x

Re: FLAC v1.3.3

Reply #90 – 2020-06-06 17:53:29

Quote from: SigHunter on 2020-03-10 08:11:04

1.3.3 was released over 6 months ago and never even made the news on the homepage, that's a little sad IMHO
https://xiph.org/flac/news.html

thanks, 1.3.3 is now on the news page

Re: FLAC v1.3.3

Reply #91 – 2020-06-08 13:49:44

What about official win compiles?

Re: FLAC v1.3.3

Reply #92 – 2020-06-08 14:35:21

Quote from: rutra80 on 2020-06-08 13:49:44

What about official win compiles?

Quote from: kode54 on 2019-08-19 02:28:21

Erik has already made it quite clear that he couldn't care any less about proprietary ("dead") operating systems.

Re: FLAC v1.3.3

Reply #93 – 2020-06-08 18:57:20

I wonder whats smarter: claiming that Windows is dead, or not caring about most of your users

Re: FLAC v1.3.3

Reply #94 – 2020-06-08 20:48:47

I can't say I'm particularly happy about his stance either, but John33's compiles at RareWares have been the de facto Windows compiles for quite some time, anyway; until a few years ago, they trounced the "official" build in performance, to boot.

Re: FLAC v1.3.3

Reply #95 – 2020-06-09 06:39:35

John33 use ICL compile win32 and win64. FLAC Frontend

Re: FLAC v1.3.3

Reply #96 – 2020-06-09 07:58:52

Quote from: kode54 on 2019-08-19 02:28:21

Erik has already made it quite clear that he couldn't care any less about proprietary ("dead") operating systems.

How old is he, 15?

oh, nevermind, we can get windows compiles elsewhere.

Re: FLAC v1.3.3

Reply #97 – 2020-06-11 17:30:18

Anything have a compiler visual studio? i'm try use cmake with ninja but some component not found

Re: FLAC v1.3.3

Reply #98 – 2020-06-11 18:24:39

I seem to be a little late to the party here!!

I can generate Intel compiles and/or Visual Studio compiles. What does everybody want?

Re: FLAC v1.3.3

Reply #99 – 2020-06-11 21:03:36

Quote from: john33 on 2020-06-11 18:24:39

I seem to be a little late to the party here!!

I can generate Intel compiles and/or Visual Studio compiles. What does everybody want?

I'd prefer Intel compiles (that's what you currently have on the RW site, correct?), but either would be great.

Notice