4cfad5ae0f
- 全新ui - 全面优化websocket逻辑,提高数字人和ui连接的稳定性及资源开销 - 全面优化唤醒逻辑,提供稳定的普通唤醒模式和前置词唤醒模式 - 优化拾音质量,支持多声道麦克风拾音 - 优化自动播放服务器的对接机制,提供稳定和兼容旧版ue工程的对接模式 - 数字人接口输出机器人表情,以适应新fay ui及单片机的数字人表情输出 - 使用更高级的音频时长计算方式,可以更精准控制音频播放完成后的逻辑 - 修复点击关闭按钮会导致程序退出的bug - 修复没有麦克风的设备开启麦克风会出错的问题 - 为服务器主机地址提供配置项,以方便服务器部署
363 lines
13 KiB
HTML
363 lines
13 KiB
HTML
<!DOCTYPE html>
|
|
<html>
|
|
<!-- Created by GNU Texinfo 7.0.1, https://www.gnu.org/software/texinfo/ -->
|
|
<head>
|
|
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
|
|
<title>FFmpeg Resampler Documentation</title>
|
|
|
|
<meta name="description" content="FFmpeg Resampler Documentation">
|
|
<meta name="keywords" content="FFmpeg Resampler Documentation">
|
|
<meta name="resource-type" content="document">
|
|
<meta name="distribution" content="global">
|
|
<meta name="Generator" content="makeinfo">
|
|
<meta name="viewport" content="width=device-width,initial-scale=1">
|
|
|
|
<link href="#SEC_Contents" rel="contents" title="Table of Contents">
|
|
<style type="text/css">
|
|
<!--
|
|
ul.toc-numbered-mark {list-style: none}
|
|
-->
|
|
</style>
|
|
|
|
|
|
</head>
|
|
|
|
<body lang="en">
|
|
|
|
|
|
<div class="top-level-extent" id="SEC_Top">
|
|
|
|
|
|
<div class="element-contents" id="SEC_Contents">
|
|
<h2 class="contents-heading">Table of Contents</h2>
|
|
|
|
<div class="contents">
|
|
|
|
<ul class="toc-numbered-mark">
|
|
<li><a id="toc-Description" href="#Description">1 Description</a></li>
|
|
<li><a id="toc-Resampler-Options" href="#Resampler-Options">2 Resampler Options</a></li>
|
|
<li><a id="toc-See-Also" href="#See-Also">3 See Also</a></li>
|
|
<li><a id="toc-Authors" href="#Authors">4 Authors</a></li>
|
|
</ul>
|
|
</div>
|
|
</div>
|
|
<div class="chapter-level-extent" id="Description">
|
|
<h2 class="chapter">1 Description</h2>
|
|
|
|
<p>The FFmpeg resampler provides a high-level interface to the
|
|
libswresample library audio resampling utilities. In particular it
|
|
allows one to perform audio resampling, audio channel layout rematrixing,
|
|
and convert audio format and packing layout.
|
|
</p>
|
|
|
|
</div>
|
|
<div class="chapter-level-extent" id="Resampler-Options">
|
|
<h2 class="chapter">2 Resampler Options</h2>
|
|
|
|
<p>The audio resampler supports the following named options.
|
|
</p>
|
|
<p>Options may be set by specifying -<var class="var">option</var> <var class="var">value</var> in the
|
|
FFmpeg tools, <var class="var">option</var>=<var class="var">value</var> for the aresample filter,
|
|
by setting the value explicitly in the
|
|
<code class="code">SwrContext</code> options or using the <samp class="file">libavutil/opt.h</samp> API for
|
|
programmatic use.
|
|
</p>
|
|
<dl class="table">
|
|
<dt><samp class="option">uchl, used_chlayout</samp></dt>
|
|
<dd><p>Set used input channel layout. Default is unset. This option is
|
|
only used for special remapping.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">isr, in_sample_rate</samp></dt>
|
|
<dd><p>Set the input sample rate. Default value is 0.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">osr, out_sample_rate</samp></dt>
|
|
<dd><p>Set the output sample rate. Default value is 0.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">isf, in_sample_fmt</samp></dt>
|
|
<dd><p>Specify the input sample format. It is set by default to <code class="code">none</code>.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">osf, out_sample_fmt</samp></dt>
|
|
<dd><p>Specify the output sample format. It is set by default to <code class="code">none</code>.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">tsf, internal_sample_fmt</samp></dt>
|
|
<dd><p>Set the internal sample format. Default value is <code class="code">none</code>.
|
|
This will automatically be chosen when it is not explicitly set.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">ichl, in_chlayout</samp></dt>
|
|
<dt><samp class="option">ochl, out_chlayout</samp></dt>
|
|
<dd><p>Set the input/output channel layout.
|
|
</p>
|
|
<p>See <a data-manual="ffmpeg-utils" href="ffmpeg-utils.html#channel-layout-syntax">(ffmpeg-utils)the Channel Layout section in the ffmpeg-utils(1) manual</a>
|
|
for the required syntax.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">clev, center_mix_level</samp></dt>
|
|
<dd><p>Set the center mix level. It is a value expressed in deciBel, and must be
|
|
in the interval [-32,32].
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">slev, surround_mix_level</samp></dt>
|
|
<dd><p>Set the surround mix level. It is a value expressed in deciBel, and must
|
|
be in the interval [-32,32].
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">lfe_mix_level</samp></dt>
|
|
<dd><p>Set LFE mix into non LFE level. It is used when there is a LFE input but no
|
|
LFE output. It is a value expressed in deciBel, and must
|
|
be in the interval [-32,32].
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">rmvol, rematrix_volume</samp></dt>
|
|
<dd><p>Set rematrix volume. Default value is 1.0.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">rematrix_maxval</samp></dt>
|
|
<dd><p>Set maximum output value for rematrixing.
|
|
This can be used to prevent clipping vs. preventing volume reduction.
|
|
A value of 1.0 prevents clipping.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">flags, swr_flags</samp></dt>
|
|
<dd><p>Set flags used by the converter. Default value is 0.
|
|
</p>
|
|
<p>It supports the following individual flags:
|
|
</p><dl class="table">
|
|
<dt><samp class="option">res</samp></dt>
|
|
<dd><p>force resampling, this flag forces resampling to be used even when the
|
|
input and output sample rates match.
|
|
</p></dd>
|
|
</dl>
|
|
|
|
</dd>
|
|
<dt><samp class="option">dither_scale</samp></dt>
|
|
<dd><p>Set the dither scale. Default value is 1.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">dither_method</samp></dt>
|
|
<dd><p>Set dither method. Default value is 0.
|
|
</p>
|
|
<p>Supported values:
|
|
</p><dl class="table">
|
|
<dt>‘<samp class="samp">rectangular</samp>’</dt>
|
|
<dd><p>select rectangular dither
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">triangular</samp>’</dt>
|
|
<dd><p>select triangular dither
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">triangular_hp</samp>’</dt>
|
|
<dd><p>select triangular dither with high pass
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">lipshitz</samp>’</dt>
|
|
<dd><p>select Lipshitz noise shaping dither.
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">shibata</samp>’</dt>
|
|
<dd><p>select Shibata noise shaping dither.
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">low_shibata</samp>’</dt>
|
|
<dd><p>select low Shibata noise shaping dither.
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">high_shibata</samp>’</dt>
|
|
<dd><p>select high Shibata noise shaping dither.
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">f_weighted</samp>’</dt>
|
|
<dd><p>select f-weighted noise shaping dither
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">modified_e_weighted</samp>’</dt>
|
|
<dd><p>select modified-e-weighted noise shaping dither
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">improved_e_weighted</samp>’</dt>
|
|
<dd><p>select improved-e-weighted noise shaping dither
|
|
</p>
|
|
</dd>
|
|
</dl>
|
|
|
|
</dd>
|
|
<dt><samp class="option">resampler</samp></dt>
|
|
<dd><p>Set resampling engine. Default value is swr.
|
|
</p>
|
|
<p>Supported values:
|
|
</p><dl class="table">
|
|
<dt>‘<samp class="samp">swr</samp>’</dt>
|
|
<dd><p>select the native SW Resampler; filter options precision and cheby are not
|
|
applicable in this case.
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">soxr</samp>’</dt>
|
|
<dd><p>select the SoX Resampler (where available); compensation, and filter options
|
|
filter_size, phase_shift, exact_rational, filter_type & kaiser_beta, are not
|
|
applicable in this case.
|
|
</p></dd>
|
|
</dl>
|
|
|
|
</dd>
|
|
<dt><samp class="option">filter_size</samp></dt>
|
|
<dd><p>For swr only, set resampling filter size, default value is 32.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">phase_shift</samp></dt>
|
|
<dd><p>For swr only, set resampling phase shift, default value is 10, and must be in
|
|
the interval [0,30].
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">linear_interp</samp></dt>
|
|
<dd><p>Use linear interpolation when enabled (the default). Disable it if you want
|
|
to preserve speed instead of quality when exact_rational fails.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">exact_rational</samp></dt>
|
|
<dd><p>For swr only, when enabled, try to use exact phase_count based on input and
|
|
output sample rate. However, if it is larger than <code class="code">1 << phase_shift</code>,
|
|
the phase_count will be <code class="code">1 << phase_shift</code> as fallback. Default is enabled.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">cutoff</samp></dt>
|
|
<dd><p>Set cutoff frequency (swr: 6dB point; soxr: 0dB point) ratio; must be a float
|
|
value between 0 and 1. Default value is 0.97 with swr, and 0.91 with soxr
|
|
(which, with a sample-rate of 44100, preserves the entire audio band to 20kHz).
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">precision</samp></dt>
|
|
<dd><p>For soxr only, the precision in bits to which the resampled signal will be
|
|
calculated. The default value of 20 (which, with suitable dithering, is
|
|
appropriate for a destination bit-depth of 16) gives SoX’s ’High Quality’; a
|
|
value of 28 gives SoX’s ’Very High Quality’.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">cheby</samp></dt>
|
|
<dd><p>For soxr only, selects passband rolloff none (Chebyshev) & higher-precision
|
|
approximation for ’irrational’ ratios. Default value is 0.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">async</samp></dt>
|
|
<dd><p>For swr only, simple 1 parameter audio sync to timestamps using stretching,
|
|
squeezing, filling and trimming. Setting this to 1 will enable filling and
|
|
trimming, larger values represent the maximum amount in samples that the data
|
|
may be stretched or squeezed for each second.
|
|
Default value is 0, thus no compensation is applied to make the samples match
|
|
the audio timestamps.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">first_pts</samp></dt>
|
|
<dd><p>For swr only, assume the first pts should be this value. The time unit is 1 / sample rate.
|
|
This allows for padding/trimming at the start of stream. By default, no
|
|
assumption is made about the first frame’s expected pts, so no padding or
|
|
trimming is done. For example, this could be set to 0 to pad the beginning with
|
|
silence if an audio stream starts after the video stream or to trim any samples
|
|
with a negative pts due to encoder delay.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">min_comp</samp></dt>
|
|
<dd><p>For swr only, set the minimum difference between timestamps and audio data (in
|
|
seconds) to trigger stretching/squeezing/filling or trimming of the
|
|
data to make it match the timestamps. The default is that
|
|
stretching/squeezing/filling and trimming is disabled
|
|
(<samp class="option">min_comp</samp> = <code class="code">FLT_MAX</code>).
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">min_hard_comp</samp></dt>
|
|
<dd><p>For swr only, set the minimum difference between timestamps and audio data (in
|
|
seconds) to trigger adding/dropping samples to make it match the
|
|
timestamps. This option effectively is a threshold to select between
|
|
hard (trim/fill) and soft (squeeze/stretch) compensation. Note that
|
|
all compensation is by default disabled through <samp class="option">min_comp</samp>.
|
|
The default is 0.1.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">comp_duration</samp></dt>
|
|
<dd><p>For swr only, set duration (in seconds) over which data is stretched/squeezed
|
|
to make it match the timestamps. Must be a non-negative double float value,
|
|
default value is 1.0.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">max_soft_comp</samp></dt>
|
|
<dd><p>For swr only, set maximum factor by which data is stretched/squeezed to make it
|
|
match the timestamps. Must be a non-negative double float value, default value
|
|
is 0.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">matrix_encoding</samp></dt>
|
|
<dd><p>Select matrixed stereo encoding.
|
|
</p>
|
|
<p>It accepts the following values:
|
|
</p><dl class="table">
|
|
<dt>‘<samp class="samp">none</samp>’</dt>
|
|
<dd><p>select none
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">dolby</samp>’</dt>
|
|
<dd><p>select Dolby
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">dplii</samp>’</dt>
|
|
<dd><p>select Dolby Pro Logic II
|
|
</p></dd>
|
|
</dl>
|
|
|
|
<p>Default value is <code class="code">none</code>.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">filter_type</samp></dt>
|
|
<dd><p>For swr only, select resampling filter type. This only affects resampling
|
|
operations.
|
|
</p>
|
|
<p>It accepts the following values:
|
|
</p><dl class="table">
|
|
<dt>‘<samp class="samp">cubic</samp>’</dt>
|
|
<dd><p>select cubic
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">blackman_nuttall</samp>’</dt>
|
|
<dd><p>select Blackman Nuttall windowed sinc
|
|
</p></dd>
|
|
<dt>‘<samp class="samp">kaiser</samp>’</dt>
|
|
<dd><p>select Kaiser windowed sinc
|
|
</p></dd>
|
|
</dl>
|
|
|
|
</dd>
|
|
<dt><samp class="option">kaiser_beta</samp></dt>
|
|
<dd><p>For swr only, set Kaiser window beta value. Must be a double float value in the
|
|
interval [2,16], default value is 9.
|
|
</p>
|
|
</dd>
|
|
<dt><samp class="option">output_sample_bits</samp></dt>
|
|
<dd><p>For swr only, set number of used output sample bits for dithering. Must be an integer in the
|
|
interval [0,64], default value is 0, which means it’s not used.
|
|
</p>
|
|
</dd>
|
|
</dl>
|
|
|
|
|
|
</div>
|
|
<div class="chapter-level-extent" id="See-Also">
|
|
<h2 class="chapter">3 See Also</h2>
|
|
|
|
<p><a class="url" href="ffmpeg.html">ffmpeg</a>, <a class="url" href="ffplay.html">ffplay</a>, <a class="url" href="ffprobe.html">ffprobe</a>,
|
|
<a class="url" href="libswresample.html">libswresample</a>
|
|
</p>
|
|
|
|
</div>
|
|
<div class="chapter-level-extent" id="Authors">
|
|
<h2 class="chapter">4 Authors</h2>
|
|
|
|
<p>The FFmpeg developers.
|
|
</p>
|
|
<p>For details about the authorship, see the Git history of the project
|
|
(https://git.ffmpeg.org/ffmpeg), e.g. by typing the command
|
|
<code class="command">git log</code> in the FFmpeg source directory, or browsing the
|
|
online repository at <a class="url" href="https://git.ffmpeg.org/ffmpeg">https://git.ffmpeg.org/ffmpeg</a>.
|
|
</p>
|
|
<p>Maintainers for the specific components are listed in the file
|
|
<samp class="file">MAINTAINERS</samp> in the source code tree.
|
|
</p>
|
|
|
|
</div>
|
|
</div>
|
|
|
|
|
|
|
|
</body>
|
|
</html>
|