You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

16835 lines
453KB

  1. @chapter Filtering Introduction
  2. @c man begin FILTERING INTRODUCTION
  3. Filtering in FFmpeg is enabled through the libavfilter library.
  4. In libavfilter, a filter can have multiple inputs and multiple
  5. outputs.
  6. To illustrate the sorts of things that are possible, we consider the
  7. following filtergraph.
  8. @verbatim
  9. [main]
  10. input --> split ---------------------> overlay --> output
  11. | ^
  12. |[tmp] [flip]|
  13. +-----> crop --> vflip -------+
  14. @end verbatim
  15. This filtergraph splits the input stream in two streams, then sends one
  16. stream through the crop filter and the vflip filter, before merging it
  17. back with the other stream by overlaying it on top. You can use the
  18. following command to achieve this:
  19. @example
  20. ffmpeg -i INPUT -vf "split [main][tmp]; [tmp] crop=iw:ih/2:0:0, vflip [flip]; [main][flip] overlay=0:H/2" OUTPUT
  21. @end example
  22. The result will be that the top half of the video is mirrored
  23. onto the bottom half of the output video.
  24. Filters in the same linear chain are separated by commas, and distinct
  25. linear chains of filters are separated by semicolons. In our example,
  26. @var{crop,vflip} are in one linear chain, @var{split} and
  27. @var{overlay} are separately in another. The points where the linear
  28. chains join are labelled by names enclosed in square brackets. In the
  29. example, the split filter generates two outputs that are associated to
  30. the labels @var{[main]} and @var{[tmp]}.
  31. The stream sent to the second output of @var{split}, labelled as
  32. @var{[tmp]}, is processed through the @var{crop} filter, which crops
  33. away the lower half part of the video, and then vertically flipped. The
  34. @var{overlay} filter takes in input the first unchanged output of the
  35. split filter (which was labelled as @var{[main]}), and overlay on its
  36. lower half the output generated by the @var{crop,vflip} filterchain.
  37. Some filters take in input a list of parameters: they are specified
  38. after the filter name and an equal sign, and are separated from each other
  39. by a colon.
  40. There exist so-called @var{source filters} that do not have an
  41. audio/video input, and @var{sink filters} that will not have audio/video
  42. output.
  43. @c man end FILTERING INTRODUCTION
  44. @chapter graph2dot
  45. @c man begin GRAPH2DOT
  46. The @file{graph2dot} program included in the FFmpeg @file{tools}
  47. directory can be used to parse a filtergraph description and issue a
  48. corresponding textual representation in the dot language.
  49. Invoke the command:
  50. @example
  51. graph2dot -h
  52. @end example
  53. to see how to use @file{graph2dot}.
  54. You can then pass the dot description to the @file{dot} program (from
  55. the graphviz suite of programs) and obtain a graphical representation
  56. of the filtergraph.
  57. For example the sequence of commands:
  58. @example
  59. echo @var{GRAPH_DESCRIPTION} | \
  60. tools/graph2dot -o graph.tmp && \
  61. dot -Tpng graph.tmp -o graph.png && \
  62. display graph.png
  63. @end example
  64. can be used to create and display an image representing the graph
  65. described by the @var{GRAPH_DESCRIPTION} string. Note that this string must be
  66. a complete self-contained graph, with its inputs and outputs explicitly defined.
  67. For example if your command line is of the form:
  68. @example
  69. ffmpeg -i infile -vf scale=640:360 outfile
  70. @end example
  71. your @var{GRAPH_DESCRIPTION} string will need to be of the form:
  72. @example
  73. nullsrc,scale=640:360,nullsink
  74. @end example
  75. you may also need to set the @var{nullsrc} parameters and add a @var{format}
  76. filter in order to simulate a specific input file.
  77. @c man end GRAPH2DOT
  78. @chapter Filtergraph description
  79. @c man begin FILTERGRAPH DESCRIPTION
  80. A filtergraph is a directed graph of connected filters. It can contain
  81. cycles, and there can be multiple links between a pair of
  82. filters. Each link has one input pad on one side connecting it to one
  83. filter from which it takes its input, and one output pad on the other
  84. side connecting it to one filter accepting its output.
  85. Each filter in a filtergraph is an instance of a filter class
  86. registered in the application, which defines the features and the
  87. number of input and output pads of the filter.
  88. A filter with no input pads is called a "source", and a filter with no
  89. output pads is called a "sink".
  90. @anchor{Filtergraph syntax}
  91. @section Filtergraph syntax
  92. A filtergraph has a textual representation, which is recognized by the
  93. @option{-filter}/@option{-vf}/@option{-af} and
  94. @option{-filter_complex} options in @command{ffmpeg} and
  95. @option{-vf}/@option{-af} in @command{ffplay}, and by the
  96. @code{avfilter_graph_parse_ptr()} function defined in
  97. @file{libavfilter/avfilter.h}.
  98. A filterchain consists of a sequence of connected filters, each one
  99. connected to the previous one in the sequence. A filterchain is
  100. represented by a list of ","-separated filter descriptions.
  101. A filtergraph consists of a sequence of filterchains. A sequence of
  102. filterchains is represented by a list of ";"-separated filterchain
  103. descriptions.
  104. A filter is represented by a string of the form:
  105. [@var{in_link_1}]...[@var{in_link_N}]@var{filter_name}=@var{arguments}[@var{out_link_1}]...[@var{out_link_M}]
  106. @var{filter_name} is the name of the filter class of which the
  107. described filter is an instance of, and has to be the name of one of
  108. the filter classes registered in the program.
  109. The name of the filter class is optionally followed by a string
  110. "=@var{arguments}".
  111. @var{arguments} is a string which contains the parameters used to
  112. initialize the filter instance. It may have one of two forms:
  113. @itemize
  114. @item
  115. A ':'-separated list of @var{key=value} pairs.
  116. @item
  117. A ':'-separated list of @var{value}. In this case, the keys are assumed to be
  118. the option names in the order they are declared. E.g. the @code{fade} filter
  119. declares three options in this order -- @option{type}, @option{start_frame} and
  120. @option{nb_frames}. Then the parameter list @var{in:0:30} means that the value
  121. @var{in} is assigned to the option @option{type}, @var{0} to
  122. @option{start_frame} and @var{30} to @option{nb_frames}.
  123. @item
  124. A ':'-separated list of mixed direct @var{value} and long @var{key=value}
  125. pairs. The direct @var{value} must precede the @var{key=value} pairs, and
  126. follow the same constraints order of the previous point. The following
  127. @var{key=value} pairs can be set in any preferred order.
  128. @end itemize
  129. If the option value itself is a list of items (e.g. the @code{format} filter
  130. takes a list of pixel formats), the items in the list are usually separated by
  131. @samp{|}.
  132. The list of arguments can be quoted using the character @samp{'} as initial
  133. and ending mark, and the character @samp{\} for escaping the characters
  134. within the quoted text; otherwise the argument string is considered
  135. terminated when the next special character (belonging to the set
  136. @samp{[]=;,}) is encountered.
  137. The name and arguments of the filter are optionally preceded and
  138. followed by a list of link labels.
  139. A link label allows one to name a link and associate it to a filter output
  140. or input pad. The preceding labels @var{in_link_1}
  141. ... @var{in_link_N}, are associated to the filter input pads,
  142. the following labels @var{out_link_1} ... @var{out_link_M}, are
  143. associated to the output pads.
  144. When two link labels with the same name are found in the
  145. filtergraph, a link between the corresponding input and output pad is
  146. created.
  147. If an output pad is not labelled, it is linked by default to the first
  148. unlabelled input pad of the next filter in the filterchain.
  149. For example in the filterchain
  150. @example
  151. nullsrc, split[L1], [L2]overlay, nullsink
  152. @end example
  153. the split filter instance has two output pads, and the overlay filter
  154. instance two input pads. The first output pad of split is labelled
  155. "L1", the first input pad of overlay is labelled "L2", and the second
  156. output pad of split is linked to the second input pad of overlay,
  157. which are both unlabelled.
  158. In a filter description, if the input label of the first filter is not
  159. specified, "in" is assumed; if the output label of the last filter is not
  160. specified, "out" is assumed.
  161. In a complete filterchain all the unlabelled filter input and output
  162. pads must be connected. A filtergraph is considered valid if all the
  163. filter input and output pads of all the filterchains are connected.
  164. Libavfilter will automatically insert @ref{scale} filters where format
  165. conversion is required. It is possible to specify swscale flags
  166. for those automatically inserted scalers by prepending
  167. @code{sws_flags=@var{flags};}
  168. to the filtergraph description.
  169. Here is a BNF description of the filtergraph syntax:
  170. @example
  171. @var{NAME} ::= sequence of alphanumeric characters and '_'
  172. @var{LINKLABEL} ::= "[" @var{NAME} "]"
  173. @var{LINKLABELS} ::= @var{LINKLABEL} [@var{LINKLABELS}]
  174. @var{FILTER_ARGUMENTS} ::= sequence of chars (possibly quoted)
  175. @var{FILTER} ::= [@var{LINKLABELS}] @var{NAME} ["=" @var{FILTER_ARGUMENTS}] [@var{LINKLABELS}]
  176. @var{FILTERCHAIN} ::= @var{FILTER} [,@var{FILTERCHAIN}]
  177. @var{FILTERGRAPH} ::= [sws_flags=@var{flags};] @var{FILTERCHAIN} [;@var{FILTERGRAPH}]
  178. @end example
  179. @section Notes on filtergraph escaping
  180. Filtergraph description composition entails several levels of
  181. escaping. See @ref{quoting_and_escaping,,the "Quoting and escaping"
  182. section in the ffmpeg-utils(1) manual,ffmpeg-utils} for more
  183. information about the employed escaping procedure.
  184. A first level escaping affects the content of each filter option
  185. value, which may contain the special character @code{:} used to
  186. separate values, or one of the escaping characters @code{\'}.
  187. A second level escaping affects the whole filter description, which
  188. may contain the escaping characters @code{\'} or the special
  189. characters @code{[],;} used by the filtergraph description.
  190. Finally, when you specify a filtergraph on a shell commandline, you
  191. need to perform a third level escaping for the shell special
  192. characters contained within it.
  193. For example, consider the following string to be embedded in
  194. the @ref{drawtext} filter description @option{text} value:
  195. @example
  196. this is a 'string': may contain one, or more, special characters
  197. @end example
  198. This string contains the @code{'} special escaping character, and the
  199. @code{:} special character, so it needs to be escaped in this way:
  200. @example
  201. text=this is a \'string\'\: may contain one, or more, special characters
  202. @end example
  203. A second level of escaping is required when embedding the filter
  204. description in a filtergraph description, in order to escape all the
  205. filtergraph special characters. Thus the example above becomes:
  206. @example
  207. drawtext=text=this is a \\\'string\\\'\\: may contain one\, or more\, special characters
  208. @end example
  209. (note that in addition to the @code{\'} escaping special characters,
  210. also @code{,} needs to be escaped).
  211. Finally an additional level of escaping is needed when writing the
  212. filtergraph description in a shell command, which depends on the
  213. escaping rules of the adopted shell. For example, assuming that
  214. @code{\} is special and needs to be escaped with another @code{\}, the
  215. previous string will finally result in:
  216. @example
  217. -vf "drawtext=text=this is a \\\\\\'string\\\\\\'\\\\: may contain one\\, or more\\, special characters"
  218. @end example
  219. @chapter Timeline editing
  220. Some filters support a generic @option{enable} option. For the filters
  221. supporting timeline editing, this option can be set to an expression which is
  222. evaluated before sending a frame to the filter. If the evaluation is non-zero,
  223. the filter will be enabled, otherwise the frame will be sent unchanged to the
  224. next filter in the filtergraph.
  225. The expression accepts the following values:
  226. @table @samp
  227. @item t
  228. timestamp expressed in seconds, NAN if the input timestamp is unknown
  229. @item n
  230. sequential number of the input frame, starting from 0
  231. @item pos
  232. the position in the file of the input frame, NAN if unknown
  233. @item w
  234. @item h
  235. width and height of the input frame if video
  236. @end table
  237. Additionally, these filters support an @option{enable} command that can be used
  238. to re-define the expression.
  239. Like any other filtering option, the @option{enable} option follows the same
  240. rules.
  241. For example, to enable a blur filter (@ref{smartblur}) from 10 seconds to 3
  242. minutes, and a @ref{curves} filter starting at 3 seconds:
  243. @example
  244. smartblur = enable='between(t,10,3*60)',
  245. curves = enable='gte(t,3)' : preset=cross_process
  246. @end example
  247. @c man end FILTERGRAPH DESCRIPTION
  248. @chapter Audio Filters
  249. @c man begin AUDIO FILTERS
  250. When you configure your FFmpeg build, you can disable any of the
  251. existing filters using @code{--disable-filters}.
  252. The configure output will show the audio filters included in your
  253. build.
  254. Below is a description of the currently available audio filters.
  255. @section acompressor
  256. A compressor is mainly used to reduce the dynamic range of a signal.
  257. Especially modern music is mostly compressed at a high ratio to
  258. improve the overall loudness. It's done to get the highest attention
  259. of a listener, "fatten" the sound and bring more "power" to the track.
  260. If a signal is compressed too much it may sound dull or "dead"
  261. afterwards or it may start to "pump" (which could be a powerful effect
  262. but can also destroy a track completely).
  263. The right compression is the key to reach a professional sound and is
  264. the high art of mixing and mastering. Because of its complex settings
  265. it may take a long time to get the right feeling for this kind of effect.
  266. Compression is done by detecting the volume above a chosen level
  267. @code{threshold} and dividing it by the factor set with @code{ratio}.
  268. So if you set the threshold to -12dB and your signal reaches -6dB a ratio
  269. of 2:1 will result in a signal at -9dB. Because an exact manipulation of
  270. the signal would cause distortion of the waveform the reduction can be
  271. levelled over the time. This is done by setting "Attack" and "Release".
  272. @code{attack} determines how long the signal has to rise above the threshold
  273. before any reduction will occur and @code{release} sets the time the signal
  274. has to fall below the threshold to reduce the reduction again. Shorter signals
  275. than the chosen attack time will be left untouched.
  276. The overall reduction of the signal can be made up afterwards with the
  277. @code{makeup} setting. So compressing the peaks of a signal about 6dB and
  278. raising the makeup to this level results in a signal twice as loud than the
  279. source. To gain a softer entry in the compression the @code{knee} flattens the
  280. hard edge at the threshold in the range of the chosen decibels.
  281. The filter accepts the following options:
  282. @table @option
  283. @item level_in
  284. Set input gain. Default is 1. Range is between 0.015625 and 64.
  285. @item threshold
  286. If a signal of second stream rises above this level it will affect the gain
  287. reduction of the first stream.
  288. By default it is 0.125. Range is between 0.00097563 and 1.
  289. @item ratio
  290. Set a ratio by which the signal is reduced. 1:2 means that if the level
  291. rose 4dB above the threshold, it will be only 2dB above after the reduction.
  292. Default is 2. Range is between 1 and 20.
  293. @item attack
  294. Amount of milliseconds the signal has to rise above the threshold before gain
  295. reduction starts. Default is 20. Range is between 0.01 and 2000.
  296. @item release
  297. Amount of milliseconds the signal has to fall below the threshold before
  298. reduction is decreased again. Default is 250. Range is between 0.01 and 9000.
  299. @item makeup
  300. Set the amount by how much signal will be amplified after processing.
  301. Default is 2. Range is from 1 and 64.
  302. @item knee
  303. Curve the sharp knee around the threshold to enter gain reduction more softly.
  304. Default is 2.82843. Range is between 1 and 8.
  305. @item link
  306. Choose if the @code{average} level between all channels of input stream
  307. or the louder(@code{maximum}) channel of input stream affects the
  308. reduction. Default is @code{average}.
  309. @item detection
  310. Should the exact signal be taken in case of @code{peak} or an RMS one in case
  311. of @code{rms}. Default is @code{rms} which is mostly smoother.
  312. @item mix
  313. How much to use compressed signal in output. Default is 1.
  314. Range is between 0 and 1.
  315. @end table
  316. @section acrossfade
  317. Apply cross fade from one input audio stream to another input audio stream.
  318. The cross fade is applied for specified duration near the end of first stream.
  319. The filter accepts the following options:
  320. @table @option
  321. @item nb_samples, ns
  322. Specify the number of samples for which the cross fade effect has to last.
  323. At the end of the cross fade effect the first input audio will be completely
  324. silent. Default is 44100.
  325. @item duration, d
  326. Specify the duration of the cross fade effect. See
  327. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  328. for the accepted syntax.
  329. By default the duration is determined by @var{nb_samples}.
  330. If set this option is used instead of @var{nb_samples}.
  331. @item overlap, o
  332. Should first stream end overlap with second stream start. Default is enabled.
  333. @item curve1
  334. Set curve for cross fade transition for first stream.
  335. @item curve2
  336. Set curve for cross fade transition for second stream.
  337. For description of available curve types see @ref{afade} filter description.
  338. @end table
  339. @subsection Examples
  340. @itemize
  341. @item
  342. Cross fade from one input to another:
  343. @example
  344. ffmpeg -i first.flac -i second.flac -filter_complex acrossfade=d=10:c1=exp:c2=exp output.flac
  345. @end example
  346. @item
  347. Cross fade from one input to another but without overlapping:
  348. @example
  349. ffmpeg -i first.flac -i second.flac -filter_complex acrossfade=d=10:o=0:c1=exp:c2=exp output.flac
  350. @end example
  351. @end itemize
  352. @section adelay
  353. Delay one or more audio channels.
  354. Samples in delayed channel are filled with silence.
  355. The filter accepts the following option:
  356. @table @option
  357. @item delays
  358. Set list of delays in milliseconds for each channel separated by '|'.
  359. At least one delay greater than 0 should be provided.
  360. Unused delays will be silently ignored. If number of given delays is
  361. smaller than number of channels all remaining channels will not be delayed.
  362. @end table
  363. @subsection Examples
  364. @itemize
  365. @item
  366. Delay first channel by 1.5 seconds, the third channel by 0.5 seconds and leave
  367. the second channel (and any other channels that may be present) unchanged.
  368. @example
  369. adelay=1500|0|500
  370. @end example
  371. @end itemize
  372. @section aecho
  373. Apply echoing to the input audio.
  374. Echoes are reflected sound and can occur naturally amongst mountains
  375. (and sometimes large buildings) when talking or shouting; digital echo
  376. effects emulate this behaviour and are often used to help fill out the
  377. sound of a single instrument or vocal. The time difference between the
  378. original signal and the reflection is the @code{delay}, and the
  379. loudness of the reflected signal is the @code{decay}.
  380. Multiple echoes can have different delays and decays.
  381. A description of the accepted parameters follows.
  382. @table @option
  383. @item in_gain
  384. Set input gain of reflected signal. Default is @code{0.6}.
  385. @item out_gain
  386. Set output gain of reflected signal. Default is @code{0.3}.
  387. @item delays
  388. Set list of time intervals in milliseconds between original signal and reflections
  389. separated by '|'. Allowed range for each @code{delay} is @code{(0 - 90000.0]}.
  390. Default is @code{1000}.
  391. @item decays
  392. Set list of loudnesses of reflected signals separated by '|'.
  393. Allowed range for each @code{decay} is @code{(0 - 1.0]}.
  394. Default is @code{0.5}.
  395. @end table
  396. @subsection Examples
  397. @itemize
  398. @item
  399. Make it sound as if there are twice as many instruments as are actually playing:
  400. @example
  401. aecho=0.8:0.88:60:0.4
  402. @end example
  403. @item
  404. If delay is very short, then it sound like a (metallic) robot playing music:
  405. @example
  406. aecho=0.8:0.88:6:0.4
  407. @end example
  408. @item
  409. A longer delay will sound like an open air concert in the mountains:
  410. @example
  411. aecho=0.8:0.9:1000:0.3
  412. @end example
  413. @item
  414. Same as above but with one more mountain:
  415. @example
  416. aecho=0.8:0.9:1000|1800:0.3|0.25
  417. @end example
  418. @end itemize
  419. @section aemphasis
  420. Audio emphasis filter creates or restores material directly taken from LPs or
  421. emphased CDs with different filter curves. E.g. to store music on vinyl the
  422. signal has to be altered by a filter first to even out the disadvantages of
  423. this recording medium.
  424. Once the material is played back the inverse filter has to be applied to
  425. restore the distortion of the frequency response.
  426. The filter accepts the following options:
  427. @table @option
  428. @item level_in
  429. Set input gain.
  430. @item level_out
  431. Set output gain.
  432. @item mode
  433. Set filter mode. For restoring material use @code{reproduction} mode, otherwise
  434. use @code{production} mode. Default is @code{reproduction} mode.
  435. @item type
  436. Set filter type. Selects medium. Can be one of the following:
  437. @table @option
  438. @item col
  439. select Columbia.
  440. @item emi
  441. select EMI.
  442. @item bsi
  443. select BSI (78RPM).
  444. @item riaa
  445. select RIAA.
  446. @item cd
  447. select Compact Disc (CD).
  448. @item 50fm
  449. select 50µs (FM).
  450. @item 75fm
  451. select 75µs (FM).
  452. @item 50kf
  453. select 50µs (FM-KF).
  454. @item 75kf
  455. select 75µs (FM-KF).
  456. @end table
  457. @end table
  458. @section aeval
  459. Modify an audio signal according to the specified expressions.
  460. This filter accepts one or more expressions (one for each channel),
  461. which are evaluated and used to modify a corresponding audio signal.
  462. It accepts the following parameters:
  463. @table @option
  464. @item exprs
  465. Set the '|'-separated expressions list for each separate channel. If
  466. the number of input channels is greater than the number of
  467. expressions, the last specified expression is used for the remaining
  468. output channels.
  469. @item channel_layout, c
  470. Set output channel layout. If not specified, the channel layout is
  471. specified by the number of expressions. If set to @samp{same}, it will
  472. use by default the same input channel layout.
  473. @end table
  474. Each expression in @var{exprs} can contain the following constants and functions:
  475. @table @option
  476. @item ch
  477. channel number of the current expression
  478. @item n
  479. number of the evaluated sample, starting from 0
  480. @item s
  481. sample rate
  482. @item t
  483. time of the evaluated sample expressed in seconds
  484. @item nb_in_channels
  485. @item nb_out_channels
  486. input and output number of channels
  487. @item val(CH)
  488. the value of input channel with number @var{CH}
  489. @end table
  490. Note: this filter is slow. For faster processing you should use a
  491. dedicated filter.
  492. @subsection Examples
  493. @itemize
  494. @item
  495. Half volume:
  496. @example
  497. aeval=val(ch)/2:c=same
  498. @end example
  499. @item
  500. Invert phase of the second channel:
  501. @example
  502. aeval=val(0)|-val(1)
  503. @end example
  504. @end itemize
  505. @anchor{afade}
  506. @section afade
  507. Apply fade-in/out effect to input audio.
  508. A description of the accepted parameters follows.
  509. @table @option
  510. @item type, t
  511. Specify the effect type, can be either @code{in} for fade-in, or
  512. @code{out} for a fade-out effect. Default is @code{in}.
  513. @item start_sample, ss
  514. Specify the number of the start sample for starting to apply the fade
  515. effect. Default is 0.
  516. @item nb_samples, ns
  517. Specify the number of samples for which the fade effect has to last. At
  518. the end of the fade-in effect the output audio will have the same
  519. volume as the input audio, at the end of the fade-out transition
  520. the output audio will be silence. Default is 44100.
  521. @item start_time, st
  522. Specify the start time of the fade effect. Default is 0.
  523. The value must be specified as a time duration; see
  524. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  525. for the accepted syntax.
  526. If set this option is used instead of @var{start_sample}.
  527. @item duration, d
  528. Specify the duration of the fade effect. See
  529. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  530. for the accepted syntax.
  531. At the end of the fade-in effect the output audio will have the same
  532. volume as the input audio, at the end of the fade-out transition
  533. the output audio will be silence.
  534. By default the duration is determined by @var{nb_samples}.
  535. If set this option is used instead of @var{nb_samples}.
  536. @item curve
  537. Set curve for fade transition.
  538. It accepts the following values:
  539. @table @option
  540. @item tri
  541. select triangular, linear slope (default)
  542. @item qsin
  543. select quarter of sine wave
  544. @item hsin
  545. select half of sine wave
  546. @item esin
  547. select exponential sine wave
  548. @item log
  549. select logarithmic
  550. @item ipar
  551. select inverted parabola
  552. @item qua
  553. select quadratic
  554. @item cub
  555. select cubic
  556. @item squ
  557. select square root
  558. @item cbr
  559. select cubic root
  560. @item par
  561. select parabola
  562. @item exp
  563. select exponential
  564. @item iqsin
  565. select inverted quarter of sine wave
  566. @item ihsin
  567. select inverted half of sine wave
  568. @item dese
  569. select double-exponential seat
  570. @item desi
  571. select double-exponential sigmoid
  572. @end table
  573. @end table
  574. @subsection Examples
  575. @itemize
  576. @item
  577. Fade in first 15 seconds of audio:
  578. @example
  579. afade=t=in:ss=0:d=15
  580. @end example
  581. @item
  582. Fade out last 25 seconds of a 900 seconds audio:
  583. @example
  584. afade=t=out:st=875:d=25
  585. @end example
  586. @end itemize
  587. @section afftfilt
  588. Apply arbitrary expressions to samples in frequency domain.
  589. @table @option
  590. @item real
  591. Set frequency domain real expression for each separate channel separated
  592. by '|'. Default is "1".
  593. If the number of input channels is greater than the number of
  594. expressions, the last specified expression is used for the remaining
  595. output channels.
  596. @item imag
  597. Set frequency domain imaginary expression for each separate channel
  598. separated by '|'. If not set, @var{real} option is used.
  599. Each expression in @var{real} and @var{imag} can contain the following
  600. constants:
  601. @table @option
  602. @item sr
  603. sample rate
  604. @item b
  605. current frequency bin number
  606. @item nb
  607. number of available bins
  608. @item ch
  609. channel number of the current expression
  610. @item chs
  611. number of channels
  612. @item pts
  613. current frame pts
  614. @end table
  615. @item win_size
  616. Set window size.
  617. It accepts the following values:
  618. @table @samp
  619. @item w16
  620. @item w32
  621. @item w64
  622. @item w128
  623. @item w256
  624. @item w512
  625. @item w1024
  626. @item w2048
  627. @item w4096
  628. @item w8192
  629. @item w16384
  630. @item w32768
  631. @item w65536
  632. @end table
  633. Default is @code{w4096}
  634. @item win_func
  635. Set window function. Default is @code{hann}.
  636. @item overlap
  637. Set window overlap. If set to 1, the recommended overlap for selected
  638. window function will be picked. Default is @code{0.75}.
  639. @end table
  640. @subsection Examples
  641. @itemize
  642. @item
  643. Leave almost only low frequencies in audio:
  644. @example
  645. afftfilt="1-clip((b/nb)*b,0,1)"
  646. @end example
  647. @end itemize
  648. @anchor{aformat}
  649. @section aformat
  650. Set output format constraints for the input audio. The framework will
  651. negotiate the most appropriate format to minimize conversions.
  652. It accepts the following parameters:
  653. @table @option
  654. @item sample_fmts
  655. A '|'-separated list of requested sample formats.
  656. @item sample_rates
  657. A '|'-separated list of requested sample rates.
  658. @item channel_layouts
  659. A '|'-separated list of requested channel layouts.
  660. See @ref{channel layout syntax,,the Channel Layout section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  661. for the required syntax.
  662. @end table
  663. If a parameter is omitted, all values are allowed.
  664. Force the output to either unsigned 8-bit or signed 16-bit stereo
  665. @example
  666. aformat=sample_fmts=u8|s16:channel_layouts=stereo
  667. @end example
  668. @section agate
  669. A gate is mainly used to reduce lower parts of a signal. This kind of signal
  670. processing reduces disturbing noise between useful signals.
  671. Gating is done by detecting the volume below a chosen level @var{threshold}
  672. and divide it by the factor set with @var{ratio}. The bottom of the noise
  673. floor is set via @var{range}. Because an exact manipulation of the signal
  674. would cause distortion of the waveform the reduction can be levelled over
  675. time. This is done by setting @var{attack} and @var{release}.
  676. @var{attack} determines how long the signal has to fall below the threshold
  677. before any reduction will occur and @var{release} sets the time the signal
  678. has to raise above the threshold to reduce the reduction again.
  679. Shorter signals than the chosen attack time will be left untouched.
  680. @table @option
  681. @item level_in
  682. Set input level before filtering.
  683. Default is 1. Allowed range is from 0.015625 to 64.
  684. @item range
  685. Set the level of gain reduction when the signal is below the threshold.
  686. Default is 0.06125. Allowed range is from 0 to 1.
  687. @item threshold
  688. If a signal rises above this level the gain reduction is released.
  689. Default is 0.125. Allowed range is from 0 to 1.
  690. @item ratio
  691. Set a ratio about which the signal is reduced.
  692. Default is 2. Allowed range is from 1 to 9000.
  693. @item attack
  694. Amount of milliseconds the signal has to rise above the threshold before gain
  695. reduction stops.
  696. Default is 20 milliseconds. Allowed range is from 0.01 to 9000.
  697. @item release
  698. Amount of milliseconds the signal has to fall below the threshold before the
  699. reduction is increased again. Default is 250 milliseconds.
  700. Allowed range is from 0.01 to 9000.
  701. @item makeup
  702. Set amount of amplification of signal after processing.
  703. Default is 1. Allowed range is from 1 to 64.
  704. @item knee
  705. Curve the sharp knee around the threshold to enter gain reduction more softly.
  706. Default is 2.828427125. Allowed range is from 1 to 8.
  707. @item detection
  708. Choose if exact signal should be taken for detection or an RMS like one.
  709. Default is rms. Can be peak or rms.
  710. @item link
  711. Choose if the average level between all channels or the louder channel affects
  712. the reduction.
  713. Default is average. Can be average or maximum.
  714. @end table
  715. @section alimiter
  716. The limiter prevents input signal from raising over a desired threshold.
  717. This limiter uses lookahead technology to prevent your signal from distorting.
  718. It means that there is a small delay after signal is processed. Keep in mind
  719. that the delay it produces is the attack time you set.
  720. The filter accepts the following options:
  721. @table @option
  722. @item level_in
  723. Set input gain. Default is 1.
  724. @item level_out
  725. Set output gain. Default is 1.
  726. @item limit
  727. Don't let signals above this level pass the limiter. Default is 1.
  728. @item attack
  729. The limiter will reach its attenuation level in this amount of time in
  730. milliseconds. Default is 5 milliseconds.
  731. @item release
  732. Come back from limiting to attenuation 1.0 in this amount of milliseconds.
  733. Default is 50 milliseconds.
  734. @item asc
  735. When gain reduction is always needed ASC takes care of releasing to an
  736. average reduction level rather than reaching a reduction of 0 in the release
  737. time.
  738. @item asc_level
  739. Select how much the release time is affected by ASC, 0 means nearly no changes
  740. in release time while 1 produces higher release times.
  741. @item level
  742. Auto level output signal. Default is enabled.
  743. This normalizes audio back to 0dB if enabled.
  744. @end table
  745. Depending on picked setting it is recommended to upsample input 2x or 4x times
  746. with @ref{aresample} before applying this filter.
  747. @section allpass
  748. Apply a two-pole all-pass filter with central frequency (in Hz)
  749. @var{frequency}, and filter-width @var{width}.
  750. An all-pass filter changes the audio's frequency to phase relationship
  751. without changing its frequency to amplitude relationship.
  752. The filter accepts the following options:
  753. @table @option
  754. @item frequency, f
  755. Set frequency in Hz.
  756. @item width_type
  757. Set method to specify band-width of filter.
  758. @table @option
  759. @item h
  760. Hz
  761. @item q
  762. Q-Factor
  763. @item o
  764. octave
  765. @item s
  766. slope
  767. @end table
  768. @item width, w
  769. Specify the band-width of a filter in width_type units.
  770. @end table
  771. @anchor{amerge}
  772. @section amerge
  773. Merge two or more audio streams into a single multi-channel stream.
  774. The filter accepts the following options:
  775. @table @option
  776. @item inputs
  777. Set the number of inputs. Default is 2.
  778. @end table
  779. If the channel layouts of the inputs are disjoint, and therefore compatible,
  780. the channel layout of the output will be set accordingly and the channels
  781. will be reordered as necessary. If the channel layouts of the inputs are not
  782. disjoint, the output will have all the channels of the first input then all
  783. the channels of the second input, in that order, and the channel layout of
  784. the output will be the default value corresponding to the total number of
  785. channels.
  786. For example, if the first input is in 2.1 (FL+FR+LF) and the second input
  787. is FC+BL+BR, then the output will be in 5.1, with the channels in the
  788. following order: a1, a2, b1, a3, b2, b3 (a1 is the first channel of the
  789. first input, b1 is the first channel of the second input).
  790. On the other hand, if both input are in stereo, the output channels will be
  791. in the default order: a1, a2, b1, b2, and the channel layout will be
  792. arbitrarily set to 4.0, which may or may not be the expected value.
  793. All inputs must have the same sample rate, and format.
  794. If inputs do not have the same duration, the output will stop with the
  795. shortest.
  796. @subsection Examples
  797. @itemize
  798. @item
  799. Merge two mono files into a stereo stream:
  800. @example
  801. amovie=left.wav [l] ; amovie=right.mp3 [r] ; [l] [r] amerge
  802. @end example
  803. @item
  804. Multiple merges assuming 1 video stream and 6 audio streams in @file{input.mkv}:
  805. @example
  806. ffmpeg -i input.mkv -filter_complex "[0:1][0:2][0:3][0:4][0:5][0:6] amerge=inputs=6" -c:a pcm_s16le output.mkv
  807. @end example
  808. @end itemize
  809. @section amix
  810. Mixes multiple audio inputs into a single output.
  811. Note that this filter only supports float samples (the @var{amerge}
  812. and @var{pan} audio filters support many formats). If the @var{amix}
  813. input has integer samples then @ref{aresample} will be automatically
  814. inserted to perform the conversion to float samples.
  815. For example
  816. @example
  817. ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex amix=inputs=3:duration=first:dropout_transition=3 OUTPUT
  818. @end example
  819. will mix 3 input audio streams to a single output with the same duration as the
  820. first input and a dropout transition time of 3 seconds.
  821. It accepts the following parameters:
  822. @table @option
  823. @item inputs
  824. The number of inputs. If unspecified, it defaults to 2.
  825. @item duration
  826. How to determine the end-of-stream.
  827. @table @option
  828. @item longest
  829. The duration of the longest input. (default)
  830. @item shortest
  831. The duration of the shortest input.
  832. @item first
  833. The duration of the first input.
  834. @end table
  835. @item dropout_transition
  836. The transition time, in seconds, for volume renormalization when an input
  837. stream ends. The default value is 2 seconds.
  838. @end table
  839. @section anequalizer
  840. High-order parametric multiband equalizer for each channel.
  841. It accepts the following parameters:
  842. @table @option
  843. @item params
  844. This option string is in format:
  845. "c@var{chn} f=@var{cf} w=@var{w} g=@var{g} t=@var{f} | ..."
  846. Each equalizer band is separated by '|'.
  847. @table @option
  848. @item chn
  849. Set channel number to which equalization will be applied.
  850. If input doesn't have that channel the entry is ignored.
  851. @item cf
  852. Set central frequency for band.
  853. If input doesn't have that frequency the entry is ignored.
  854. @item w
  855. Set band width in hertz.
  856. @item g
  857. Set band gain in dB.
  858. @item f
  859. Set filter type for band, optional, can be:
  860. @table @samp
  861. @item 0
  862. Butterworth, this is default.
  863. @item 1
  864. Chebyshev type 1.
  865. @item 2
  866. Chebyshev type 2.
  867. @end table
  868. @end table
  869. @item curves
  870. With this option activated frequency response of anequalizer is displayed
  871. in video stream.
  872. @item size
  873. Set video stream size. Only useful if curves option is activated.
  874. @item mgain
  875. Set max gain that will be displayed. Only useful if curves option is activated.
  876. Setting this to reasonable value allows to display gain which is derived from
  877. neighbour bands which are too close to each other and thus produce higher gain
  878. when both are activated.
  879. @item fscale
  880. Set frequency scale used to draw frequency response in video output.
  881. Can be linear or logarithmic. Default is logarithmic.
  882. @item colors
  883. Set color for each channel curve which is going to be displayed in video stream.
  884. This is list of color names separated by space or by '|'.
  885. Unrecognised or missing colors will be replaced by white color.
  886. @end table
  887. @subsection Examples
  888. @itemize
  889. @item
  890. Lower gain by 10 of central frequency 200Hz and width 100 Hz
  891. for first 2 channels using Chebyshev type 1 filter:
  892. @example
  893. anequalizer=c0 f=200 w=100 g=-10 t=1|c1 f=200 w=100 g=-10 t=1
  894. @end example
  895. @end itemize
  896. @subsection Commands
  897. This filter supports the following commands:
  898. @table @option
  899. @item change
  900. Alter existing filter parameters.
  901. Syntax for the commands is : "@var{fN}|f=@var{freq}|w=@var{width}|g=@var{gain}"
  902. @var{fN} is existing filter number, starting from 0, if no such filter is available
  903. error is returned.
  904. @var{freq} set new frequency parameter.
  905. @var{width} set new width parameter in herz.
  906. @var{gain} set new gain parameter in dB.
  907. Full filter invocation with asendcmd may look like this:
  908. asendcmd=c='4.0 anequalizer change 0|f=200|w=50|g=1',anequalizer=...
  909. @end table
  910. @section anull
  911. Pass the audio source unchanged to the output.
  912. @section apad
  913. Pad the end of an audio stream with silence.
  914. This can be used together with @command{ffmpeg} @option{-shortest} to
  915. extend audio streams to the same length as the video stream.
  916. A description of the accepted options follows.
  917. @table @option
  918. @item packet_size
  919. Set silence packet size. Default value is 4096.
  920. @item pad_len
  921. Set the number of samples of silence to add to the end. After the
  922. value is reached, the stream is terminated. This option is mutually
  923. exclusive with @option{whole_len}.
  924. @item whole_len
  925. Set the minimum total number of samples in the output audio stream. If
  926. the value is longer than the input audio length, silence is added to
  927. the end, until the value is reached. This option is mutually exclusive
  928. with @option{pad_len}.
  929. @end table
  930. If neither the @option{pad_len} nor the @option{whole_len} option is
  931. set, the filter will add silence to the end of the input stream
  932. indefinitely.
  933. @subsection Examples
  934. @itemize
  935. @item
  936. Add 1024 samples of silence to the end of the input:
  937. @example
  938. apad=pad_len=1024
  939. @end example
  940. @item
  941. Make sure the audio output will contain at least 10000 samples, pad
  942. the input with silence if required:
  943. @example
  944. apad=whole_len=10000
  945. @end example
  946. @item
  947. Use @command{ffmpeg} to pad the audio input with silence, so that the
  948. video stream will always result the shortest and will be converted
  949. until the end in the output file when using the @option{shortest}
  950. option:
  951. @example
  952. ffmpeg -i VIDEO -i AUDIO -filter_complex "[1:0]apad" -shortest OUTPUT
  953. @end example
  954. @end itemize
  955. @section aphaser
  956. Add a phasing effect to the input audio.
  957. A phaser filter creates series of peaks and troughs in the frequency spectrum.
  958. The position of the peaks and troughs are modulated so that they vary over time, creating a sweeping effect.
  959. A description of the accepted parameters follows.
  960. @table @option
  961. @item in_gain
  962. Set input gain. Default is 0.4.
  963. @item out_gain
  964. Set output gain. Default is 0.74
  965. @item delay
  966. Set delay in milliseconds. Default is 3.0.
  967. @item decay
  968. Set decay. Default is 0.4.
  969. @item speed
  970. Set modulation speed in Hz. Default is 0.5.
  971. @item type
  972. Set modulation type. Default is triangular.
  973. It accepts the following values:
  974. @table @samp
  975. @item triangular, t
  976. @item sinusoidal, s
  977. @end table
  978. @end table
  979. @section apulsator
  980. Audio pulsator is something between an autopanner and a tremolo.
  981. But it can produce funny stereo effects as well. Pulsator changes the volume
  982. of the left and right channel based on a LFO (low frequency oscillator) with
  983. different waveforms and shifted phases.
  984. This filter have the ability to define an offset between left and right
  985. channel. An offset of 0 means that both LFO shapes match each other.
  986. The left and right channel are altered equally - a conventional tremolo.
  987. An offset of 50% means that the shape of the right channel is exactly shifted
  988. in phase (or moved backwards about half of the frequency) - pulsator acts as
  989. an autopanner. At 1 both curves match again. Every setting in between moves the
  990. phase shift gapless between all stages and produces some "bypassing" sounds with
  991. sine and triangle waveforms. The more you set the offset near 1 (starting from
  992. the 0.5) the faster the signal passes from the left to the right speaker.
  993. The filter accepts the following options:
  994. @table @option
  995. @item level_in
  996. Set input gain. By default it is 1. Range is [0.015625 - 64].
  997. @item level_out
  998. Set output gain. By default it is 1. Range is [0.015625 - 64].
  999. @item mode
  1000. Set waveform shape the LFO will use. Can be one of: sine, triangle, square,
  1001. sawup or sawdown. Default is sine.
  1002. @item amount
  1003. Set modulation. Define how much of original signal is affected by the LFO.
  1004. @item offset_l
  1005. Set left channel offset. Default is 0. Allowed range is [0 - 1].
  1006. @item offset_r
  1007. Set right channel offset. Default is 0.5. Allowed range is [0 - 1].
  1008. @item width
  1009. Set pulse width. Default is 1. Allowed range is [0 - 2].
  1010. @item timing
  1011. Set possible timing mode. Can be one of: bpm, ms or hz. Default is hz.
  1012. @item bpm
  1013. Set bpm. Default is 120. Allowed range is [30 - 300]. Only used if timing
  1014. is set to bpm.
  1015. @item ms
  1016. Set ms. Default is 500. Allowed range is [10 - 2000]. Only used if timing
  1017. is set to ms.
  1018. @item hz
  1019. Set frequency in Hz. Default is 2. Allowed range is [0.01 - 100]. Only used
  1020. if timing is set to hz.
  1021. @end table
  1022. @anchor{aresample}
  1023. @section aresample
  1024. Resample the input audio to the specified parameters, using the
  1025. libswresample library. If none are specified then the filter will
  1026. automatically convert between its input and output.
  1027. This filter is also able to stretch/squeeze the audio data to make it match
  1028. the timestamps or to inject silence / cut out audio to make it match the
  1029. timestamps, do a combination of both or do neither.
  1030. The filter accepts the syntax
  1031. [@var{sample_rate}:]@var{resampler_options}, where @var{sample_rate}
  1032. expresses a sample rate and @var{resampler_options} is a list of
  1033. @var{key}=@var{value} pairs, separated by ":". See the
  1034. ffmpeg-resampler manual for the complete list of supported options.
  1035. @subsection Examples
  1036. @itemize
  1037. @item
  1038. Resample the input audio to 44100Hz:
  1039. @example
  1040. aresample=44100
  1041. @end example
  1042. @item
  1043. Stretch/squeeze samples to the given timestamps, with a maximum of 1000
  1044. samples per second compensation:
  1045. @example
  1046. aresample=async=1000
  1047. @end example
  1048. @end itemize
  1049. @section asetnsamples
  1050. Set the number of samples per each output audio frame.
  1051. The last output packet may contain a different number of samples, as
  1052. the filter will flush all the remaining samples when the input audio
  1053. signal its end.
  1054. The filter accepts the following options:
  1055. @table @option
  1056. @item nb_out_samples, n
  1057. Set the number of frames per each output audio frame. The number is
  1058. intended as the number of samples @emph{per each channel}.
  1059. Default value is 1024.
  1060. @item pad, p
  1061. If set to 1, the filter will pad the last audio frame with zeroes, so
  1062. that the last frame will contain the same number of samples as the
  1063. previous ones. Default value is 1.
  1064. @end table
  1065. For example, to set the number of per-frame samples to 1234 and
  1066. disable padding for the last frame, use:
  1067. @example
  1068. asetnsamples=n=1234:p=0
  1069. @end example
  1070. @section asetrate
  1071. Set the sample rate without altering the PCM data.
  1072. This will result in a change of speed and pitch.
  1073. The filter accepts the following options:
  1074. @table @option
  1075. @item sample_rate, r
  1076. Set the output sample rate. Default is 44100 Hz.
  1077. @end table
  1078. @section ashowinfo
  1079. Show a line containing various information for each input audio frame.
  1080. The input audio is not modified.
  1081. The shown line contains a sequence of key/value pairs of the form
  1082. @var{key}:@var{value}.
  1083. The following values are shown in the output:
  1084. @table @option
  1085. @item n
  1086. The (sequential) number of the input frame, starting from 0.
  1087. @item pts
  1088. The presentation timestamp of the input frame, in time base units; the time base
  1089. depends on the filter input pad, and is usually 1/@var{sample_rate}.
  1090. @item pts_time
  1091. The presentation timestamp of the input frame in seconds.
  1092. @item pos
  1093. position of the frame in the input stream, -1 if this information in
  1094. unavailable and/or meaningless (for example in case of synthetic audio)
  1095. @item fmt
  1096. The sample format.
  1097. @item chlayout
  1098. The channel layout.
  1099. @item rate
  1100. The sample rate for the audio frame.
  1101. @item nb_samples
  1102. The number of samples (per channel) in the frame.
  1103. @item checksum
  1104. The Adler-32 checksum (printed in hexadecimal) of the audio data. For planar
  1105. audio, the data is treated as if all the planes were concatenated.
  1106. @item plane_checksums
  1107. A list of Adler-32 checksums for each data plane.
  1108. @end table
  1109. @anchor{astats}
  1110. @section astats
  1111. Display time domain statistical information about the audio channels.
  1112. Statistics are calculated and displayed for each audio channel and,
  1113. where applicable, an overall figure is also given.
  1114. It accepts the following option:
  1115. @table @option
  1116. @item length
  1117. Short window length in seconds, used for peak and trough RMS measurement.
  1118. Default is @code{0.05} (50 milliseconds). Allowed range is @code{[0.1 - 10]}.
  1119. @item metadata
  1120. Set metadata injection. All the metadata keys are prefixed with @code{lavfi.astats.X},
  1121. where @code{X} is channel number starting from 1 or string @code{Overall}. Default is
  1122. disabled.
  1123. Available keys for each channel are:
  1124. DC_offset
  1125. Min_level
  1126. Max_level
  1127. Min_difference
  1128. Max_difference
  1129. Mean_difference
  1130. Peak_level
  1131. RMS_peak
  1132. RMS_trough
  1133. Crest_factor
  1134. Flat_factor
  1135. Peak_count
  1136. Bit_depth
  1137. and for Overall:
  1138. DC_offset
  1139. Min_level
  1140. Max_level
  1141. Min_difference
  1142. Max_difference
  1143. Mean_difference
  1144. Peak_level
  1145. RMS_level
  1146. RMS_peak
  1147. RMS_trough
  1148. Flat_factor
  1149. Peak_count
  1150. Bit_depth
  1151. Number_of_samples
  1152. For example full key look like this @code{lavfi.astats.1.DC_offset} or
  1153. this @code{lavfi.astats.Overall.Peak_count}.
  1154. For description what each key means read below.
  1155. @item reset
  1156. Set number of frame after which stats are going to be recalculated.
  1157. Default is disabled.
  1158. @end table
  1159. A description of each shown parameter follows:
  1160. @table @option
  1161. @item DC offset
  1162. Mean amplitude displacement from zero.
  1163. @item Min level
  1164. Minimal sample level.
  1165. @item Max level
  1166. Maximal sample level.
  1167. @item Min difference
  1168. Minimal difference between two consecutive samples.
  1169. @item Max difference
  1170. Maximal difference between two consecutive samples.
  1171. @item Mean difference
  1172. Mean difference between two consecutive samples.
  1173. The average of each difference between two consecutive samples.
  1174. @item Peak level dB
  1175. @item RMS level dB
  1176. Standard peak and RMS level measured in dBFS.
  1177. @item RMS peak dB
  1178. @item RMS trough dB
  1179. Peak and trough values for RMS level measured over a short window.
  1180. @item Crest factor
  1181. Standard ratio of peak to RMS level (note: not in dB).
  1182. @item Flat factor
  1183. Flatness (i.e. consecutive samples with the same value) of the signal at its peak levels
  1184. (i.e. either @var{Min level} or @var{Max level}).
  1185. @item Peak count
  1186. Number of occasions (not the number of samples) that the signal attained either
  1187. @var{Min level} or @var{Max level}.
  1188. @item Bit depth
  1189. Overall bit depth of audio. Number of bits used for each sample.
  1190. @end table
  1191. @section asyncts
  1192. Synchronize audio data with timestamps by squeezing/stretching it and/or
  1193. dropping samples/adding silence when needed.
  1194. This filter is not built by default, please use @ref{aresample} to do squeezing/stretching.
  1195. It accepts the following parameters:
  1196. @table @option
  1197. @item compensate
  1198. Enable stretching/squeezing the data to make it match the timestamps. Disabled
  1199. by default. When disabled, time gaps are covered with silence.
  1200. @item min_delta
  1201. The minimum difference between timestamps and audio data (in seconds) to trigger
  1202. adding/dropping samples. The default value is 0.1. If you get an imperfect
  1203. sync with this filter, try setting this parameter to 0.
  1204. @item max_comp
  1205. The maximum compensation in samples per second. Only relevant with compensate=1.
  1206. The default value is 500.
  1207. @item first_pts
  1208. Assume that the first PTS should be this value. The time base is 1 / sample
  1209. rate. This allows for padding/trimming at the start of the stream. By default,
  1210. no assumption is made about the first frame's expected PTS, so no padding or
  1211. trimming is done. For example, this could be set to 0 to pad the beginning with
  1212. silence if an audio stream starts after the video stream or to trim any samples
  1213. with a negative PTS due to encoder delay.
  1214. @end table
  1215. @section atempo
  1216. Adjust audio tempo.
  1217. The filter accepts exactly one parameter, the audio tempo. If not
  1218. specified then the filter will assume nominal 1.0 tempo. Tempo must
  1219. be in the [0.5, 2.0] range.
  1220. @subsection Examples
  1221. @itemize
  1222. @item
  1223. Slow down audio to 80% tempo:
  1224. @example
  1225. atempo=0.8
  1226. @end example
  1227. @item
  1228. To speed up audio to 125% tempo:
  1229. @example
  1230. atempo=1.25
  1231. @end example
  1232. @end itemize
  1233. @section atrim
  1234. Trim the input so that the output contains one continuous subpart of the input.
  1235. It accepts the following parameters:
  1236. @table @option
  1237. @item start
  1238. Timestamp (in seconds) of the start of the section to keep. I.e. the audio
  1239. sample with the timestamp @var{start} will be the first sample in the output.
  1240. @item end
  1241. Specify time of the first audio sample that will be dropped, i.e. the
  1242. audio sample immediately preceding the one with the timestamp @var{end} will be
  1243. the last sample in the output.
  1244. @item start_pts
  1245. Same as @var{start}, except this option sets the start timestamp in samples
  1246. instead of seconds.
  1247. @item end_pts
  1248. Same as @var{end}, except this option sets the end timestamp in samples instead
  1249. of seconds.
  1250. @item duration
  1251. The maximum duration of the output in seconds.
  1252. @item start_sample
  1253. The number of the first sample that should be output.
  1254. @item end_sample
  1255. The number of the first sample that should be dropped.
  1256. @end table
  1257. @option{start}, @option{end}, and @option{duration} are expressed as time
  1258. duration specifications; see
  1259. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}.
  1260. Note that the first two sets of the start/end options and the @option{duration}
  1261. option look at the frame timestamp, while the _sample options simply count the
  1262. samples that pass through the filter. So start/end_pts and start/end_sample will
  1263. give different results when the timestamps are wrong, inexact or do not start at
  1264. zero. Also note that this filter does not modify the timestamps. If you wish
  1265. to have the output timestamps start at zero, insert the asetpts filter after the
  1266. atrim filter.
  1267. If multiple start or end options are set, this filter tries to be greedy and
  1268. keep all samples that match at least one of the specified constraints. To keep
  1269. only the part that matches all the constraints at once, chain multiple atrim
  1270. filters.
  1271. The defaults are such that all the input is kept. So it is possible to set e.g.
  1272. just the end values to keep everything before the specified time.
  1273. Examples:
  1274. @itemize
  1275. @item
  1276. Drop everything except the second minute of input:
  1277. @example
  1278. ffmpeg -i INPUT -af atrim=60:120
  1279. @end example
  1280. @item
  1281. Keep only the first 1000 samples:
  1282. @example
  1283. ffmpeg -i INPUT -af atrim=end_sample=1000
  1284. @end example
  1285. @end itemize
  1286. @section bandpass
  1287. Apply a two-pole Butterworth band-pass filter with central
  1288. frequency @var{frequency}, and (3dB-point) band-width width.
  1289. The @var{csg} option selects a constant skirt gain (peak gain = Q)
  1290. instead of the default: constant 0dB peak gain.
  1291. The filter roll off at 6dB per octave (20dB per decade).
  1292. The filter accepts the following options:
  1293. @table @option
  1294. @item frequency, f
  1295. Set the filter's central frequency. Default is @code{3000}.
  1296. @item csg
  1297. Constant skirt gain if set to 1. Defaults to 0.
  1298. @item width_type
  1299. Set method to specify band-width of filter.
  1300. @table @option
  1301. @item h
  1302. Hz
  1303. @item q
  1304. Q-Factor
  1305. @item o
  1306. octave
  1307. @item s
  1308. slope
  1309. @end table
  1310. @item width, w
  1311. Specify the band-width of a filter in width_type units.
  1312. @end table
  1313. @section bandreject
  1314. Apply a two-pole Butterworth band-reject filter with central
  1315. frequency @var{frequency}, and (3dB-point) band-width @var{width}.
  1316. The filter roll off at 6dB per octave (20dB per decade).
  1317. The filter accepts the following options:
  1318. @table @option
  1319. @item frequency, f
  1320. Set the filter's central frequency. Default is @code{3000}.
  1321. @item width_type
  1322. Set method to specify band-width of filter.
  1323. @table @option
  1324. @item h
  1325. Hz
  1326. @item q
  1327. Q-Factor
  1328. @item o
  1329. octave
  1330. @item s
  1331. slope
  1332. @end table
  1333. @item width, w
  1334. Specify the band-width of a filter in width_type units.
  1335. @end table
  1336. @section bass
  1337. Boost or cut the bass (lower) frequencies of the audio using a two-pole
  1338. shelving filter with a response similar to that of a standard
  1339. hi-fi's tone-controls. This is also known as shelving equalisation (EQ).
  1340. The filter accepts the following options:
  1341. @table @option
  1342. @item gain, g
  1343. Give the gain at 0 Hz. Its useful range is about -20
  1344. (for a large cut) to +20 (for a large boost).
  1345. Beware of clipping when using a positive gain.
  1346. @item frequency, f
  1347. Set the filter's central frequency and so can be used
  1348. to extend or reduce the frequency range to be boosted or cut.
  1349. The default value is @code{100} Hz.
  1350. @item width_type
  1351. Set method to specify band-width of filter.
  1352. @table @option
  1353. @item h
  1354. Hz
  1355. @item q
  1356. Q-Factor
  1357. @item o
  1358. octave
  1359. @item s
  1360. slope
  1361. @end table
  1362. @item width, w
  1363. Determine how steep is the filter's shelf transition.
  1364. @end table
  1365. @section biquad
  1366. Apply a biquad IIR filter with the given coefficients.
  1367. Where @var{b0}, @var{b1}, @var{b2} and @var{a0}, @var{a1}, @var{a2}
  1368. are the numerator and denominator coefficients respectively.
  1369. @section bs2b
  1370. Bauer stereo to binaural transformation, which improves headphone listening of
  1371. stereo audio records.
  1372. It accepts the following parameters:
  1373. @table @option
  1374. @item profile
  1375. Pre-defined crossfeed level.
  1376. @table @option
  1377. @item default
  1378. Default level (fcut=700, feed=50).
  1379. @item cmoy
  1380. Chu Moy circuit (fcut=700, feed=60).
  1381. @item jmeier
  1382. Jan Meier circuit (fcut=650, feed=95).
  1383. @end table
  1384. @item fcut
  1385. Cut frequency (in Hz).
  1386. @item feed
  1387. Feed level (in Hz).
  1388. @end table
  1389. @section channelmap
  1390. Remap input channels to new locations.
  1391. It accepts the following parameters:
  1392. @table @option
  1393. @item channel_layout
  1394. The channel layout of the output stream.
  1395. @item map
  1396. Map channels from input to output. The argument is a '|'-separated list of
  1397. mappings, each in the @code{@var{in_channel}-@var{out_channel}} or
  1398. @var{in_channel} form. @var{in_channel} can be either the name of the input
  1399. channel (e.g. FL for front left) or its index in the input channel layout.
  1400. @var{out_channel} is the name of the output channel or its index in the output
  1401. channel layout. If @var{out_channel} is not given then it is implicitly an
  1402. index, starting with zero and increasing by one for each mapping.
  1403. @end table
  1404. If no mapping is present, the filter will implicitly map input channels to
  1405. output channels, preserving indices.
  1406. For example, assuming a 5.1+downmix input MOV file,
  1407. @example
  1408. ffmpeg -i in.mov -filter 'channelmap=map=DL-FL|DR-FR' out.wav
  1409. @end example
  1410. will create an output WAV file tagged as stereo from the downmix channels of
  1411. the input.
  1412. To fix a 5.1 WAV improperly encoded in AAC's native channel order
  1413. @example
  1414. ffmpeg -i in.wav -filter 'channelmap=1|2|0|5|3|4:5.1' out.wav
  1415. @end example
  1416. @section channelsplit
  1417. Split each channel from an input audio stream into a separate output stream.
  1418. It accepts the following parameters:
  1419. @table @option
  1420. @item channel_layout
  1421. The channel layout of the input stream. The default is "stereo".
  1422. @end table
  1423. For example, assuming a stereo input MP3 file,
  1424. @example
  1425. ffmpeg -i in.mp3 -filter_complex channelsplit out.mkv
  1426. @end example
  1427. will create an output Matroska file with two audio streams, one containing only
  1428. the left channel and the other the right channel.
  1429. Split a 5.1 WAV file into per-channel files:
  1430. @example
  1431. ffmpeg -i in.wav -filter_complex
  1432. 'channelsplit=channel_layout=5.1[FL][FR][FC][LFE][SL][SR]'
  1433. -map '[FL]' front_left.wav -map '[FR]' front_right.wav -map '[FC]'
  1434. front_center.wav -map '[LFE]' lfe.wav -map '[SL]' side_left.wav -map '[SR]'
  1435. side_right.wav
  1436. @end example
  1437. @section chorus
  1438. Add a chorus effect to the audio.
  1439. Can make a single vocal sound like a chorus, but can also be applied to instrumentation.
  1440. Chorus resembles an echo effect with a short delay, but whereas with echo the delay is
  1441. constant, with chorus, it is varied using using sinusoidal or triangular modulation.
  1442. The modulation depth defines the range the modulated delay is played before or after
  1443. the delay. Hence the delayed sound will sound slower or faster, that is the delayed
  1444. sound tuned around the original one, like in a chorus where some vocals are slightly
  1445. off key.
  1446. It accepts the following parameters:
  1447. @table @option
  1448. @item in_gain
  1449. Set input gain. Default is 0.4.
  1450. @item out_gain
  1451. Set output gain. Default is 0.4.
  1452. @item delays
  1453. Set delays. A typical delay is around 40ms to 60ms.
  1454. @item decays
  1455. Set decays.
  1456. @item speeds
  1457. Set speeds.
  1458. @item depths
  1459. Set depths.
  1460. @end table
  1461. @subsection Examples
  1462. @itemize
  1463. @item
  1464. A single delay:
  1465. @example
  1466. chorus=0.7:0.9:55:0.4:0.25:2
  1467. @end example
  1468. @item
  1469. Two delays:
  1470. @example
  1471. chorus=0.6:0.9:50|60:0.4|0.32:0.25|0.4:2|1.3
  1472. @end example
  1473. @item
  1474. Fuller sounding chorus with three delays:
  1475. @example
  1476. chorus=0.5:0.9:50|60|40:0.4|0.32|0.3:0.25|0.4|0.3:2|2.3|1.3
  1477. @end example
  1478. @end itemize
  1479. @section compand
  1480. Compress or expand the audio's dynamic range.
  1481. It accepts the following parameters:
  1482. @table @option
  1483. @item attacks
  1484. @item decays
  1485. A list of times in seconds for each channel over which the instantaneous level
  1486. of the input signal is averaged to determine its volume. @var{attacks} refers to
  1487. increase of volume and @var{decays} refers to decrease of volume. For most
  1488. situations, the attack time (response to the audio getting louder) should be
  1489. shorter than the decay time, because the human ear is more sensitive to sudden
  1490. loud audio than sudden soft audio. A typical value for attack is 0.3 seconds and
  1491. a typical value for decay is 0.8 seconds.
  1492. If specified number of attacks & decays is lower than number of channels, the last
  1493. set attack/decay will be used for all remaining channels.
  1494. @item points
  1495. A list of points for the transfer function, specified in dB relative to the
  1496. maximum possible signal amplitude. Each key points list must be defined using
  1497. the following syntax: @code{x0/y0|x1/y1|x2/y2|....} or
  1498. @code{x0/y0 x1/y1 x2/y2 ....}
  1499. The input values must be in strictly increasing order but the transfer function
  1500. does not have to be monotonically rising. The point @code{0/0} is assumed but
  1501. may be overridden (by @code{0/out-dBn}). Typical values for the transfer
  1502. function are @code{-70/-70|-60/-20}.
  1503. @item soft-knee
  1504. Set the curve radius in dB for all joints. It defaults to 0.01.
  1505. @item gain
  1506. Set the additional gain in dB to be applied at all points on the transfer
  1507. function. This allows for easy adjustment of the overall gain.
  1508. It defaults to 0.
  1509. @item volume
  1510. Set an initial volume, in dB, to be assumed for each channel when filtering
  1511. starts. This permits the user to supply a nominal level initially, so that, for
  1512. example, a very large gain is not applied to initial signal levels before the
  1513. companding has begun to operate. A typical value for audio which is initially
  1514. quiet is -90 dB. It defaults to 0.
  1515. @item delay
  1516. Set a delay, in seconds. The input audio is analyzed immediately, but audio is
  1517. delayed before being fed to the volume adjuster. Specifying a delay
  1518. approximately equal to the attack/decay times allows the filter to effectively
  1519. operate in predictive rather than reactive mode. It defaults to 0.
  1520. @end table
  1521. @subsection Examples
  1522. @itemize
  1523. @item
  1524. Make music with both quiet and loud passages suitable for listening to in a
  1525. noisy environment:
  1526. @example
  1527. compand=.3|.3:1|1:-90/-60|-60/-40|-40/-30|-20/-20:6:0:-90:0.2
  1528. @end example
  1529. Another example for audio with whisper and explosion parts:
  1530. @example
  1531. compand=0|0:1|1:-90/-900|-70/-70|-30/-9|0/-3:6:0:0:0
  1532. @end example
  1533. @item
  1534. A noise gate for when the noise is at a lower level than the signal:
  1535. @example
  1536. compand=.1|.1:.2|.2:-900/-900|-50.1/-900|-50/-50:.01:0:-90:.1
  1537. @end example
  1538. @item
  1539. Here is another noise gate, this time for when the noise is at a higher level
  1540. than the signal (making it, in some ways, similar to squelch):
  1541. @example
  1542. compand=.1|.1:.1|.1:-45.1/-45.1|-45/-900|0/-900:.01:45:-90:.1
  1543. @end example
  1544. @item
  1545. 2:1 compression starting at -6dB:
  1546. @example
  1547. compand=points=-80/-80|-6/-6|0/-3.8|20/3.5
  1548. @end example
  1549. @item
  1550. 2:1 compression starting at -9dB:
  1551. @example
  1552. compand=points=-80/-80|-9/-9|0/-5.3|20/2.9
  1553. @end example
  1554. @item
  1555. 2:1 compression starting at -12dB:
  1556. @example
  1557. compand=points=-80/-80|-12/-12|0/-6.8|20/1.9
  1558. @end example
  1559. @item
  1560. 2:1 compression starting at -18dB:
  1561. @example
  1562. compand=points=-80/-80|-18/-18|0/-9.8|20/0.7
  1563. @end example
  1564. @item
  1565. 3:1 compression starting at -15dB:
  1566. @example
  1567. compand=points=-80/-80|-15/-15|0/-10.8|20/-5.2
  1568. @end example
  1569. @item
  1570. Compressor/Gate:
  1571. @example
  1572. compand=points=-80/-105|-62/-80|-15.4/-15.4|0/-12|20/-7.6
  1573. @end example
  1574. @item
  1575. Expander:
  1576. @example
  1577. compand=attacks=0:points=-80/-169|-54/-80|-49.5/-64.6|-41.1/-41.1|-25.8/-15|-10.8/-4.5|0/0|20/8.3
  1578. @end example
  1579. @item
  1580. Hard limiter at -6dB:
  1581. @example
  1582. compand=attacks=0:points=-80/-80|-6/-6|20/-6
  1583. @end example
  1584. @item
  1585. Hard limiter at -12dB:
  1586. @example
  1587. compand=attacks=0:points=-80/-80|-12/-12|20/-12
  1588. @end example
  1589. @item
  1590. Hard noise gate at -35 dB:
  1591. @example
  1592. compand=attacks=0:points=-80/-115|-35.1/-80|-35/-35|20/20
  1593. @end example
  1594. @item
  1595. Soft limiter:
  1596. @example
  1597. compand=attacks=0:points=-80/-80|-12.4/-12.4|-6/-8|0/-6.8|20/-2.8
  1598. @end example
  1599. @end itemize
  1600. @section compensationdelay
  1601. Compensation Delay Line is a metric based delay to compensate differing
  1602. positions of microphones or speakers.
  1603. For example, you have recorded guitar with two microphones placed in
  1604. different location. Because the front of sound wave has fixed speed in
  1605. normal conditions, the phasing of microphones can vary and depends on
  1606. their location and interposition. The best sound mix can be achieved when
  1607. these microphones are in phase (synchronized). Note that distance of
  1608. ~30 cm between microphones makes one microphone to capture signal in
  1609. antiphase to another microphone. That makes the final mix sounding moody.
  1610. This filter helps to solve phasing problems by adding different delays
  1611. to each microphone track and make them synchronized.
  1612. The best result can be reached when you take one track as base and
  1613. synchronize other tracks one by one with it.
  1614. Remember that synchronization/delay tolerance depends on sample rate, too.
  1615. Higher sample rates will give more tolerance.
  1616. It accepts the following parameters:
  1617. @table @option
  1618. @item mm
  1619. Set millimeters distance. This is compensation distance for fine tuning.
  1620. Default is 0.
  1621. @item cm
  1622. Set cm distance. This is compensation distance for tightening distance setup.
  1623. Default is 0.
  1624. @item m
  1625. Set meters distance. This is compensation distance for hard distance setup.
  1626. Default is 0.
  1627. @item dry
  1628. Set dry amount. Amount of unprocessed (dry) signal.
  1629. Default is 0.
  1630. @item wet
  1631. Set wet amount. Amount of processed (wet) signal.
  1632. Default is 1.
  1633. @item temp
  1634. Set temperature degree in Celsius. This is the temperature of the environment.
  1635. Default is 20.
  1636. @end table
  1637. @section dcshift
  1638. Apply a DC shift to the audio.
  1639. This can be useful to remove a DC offset (caused perhaps by a hardware problem
  1640. in the recording chain) from the audio. The effect of a DC offset is reduced
  1641. headroom and hence volume. The @ref{astats} filter can be used to determine if
  1642. a signal has a DC offset.
  1643. @table @option
  1644. @item shift
  1645. Set the DC shift, allowed range is [-1, 1]. It indicates the amount to shift
  1646. the audio.
  1647. @item limitergain
  1648. Optional. It should have a value much less than 1 (e.g. 0.05 or 0.02) and is
  1649. used to prevent clipping.
  1650. @end table
  1651. @section dynaudnorm
  1652. Dynamic Audio Normalizer.
  1653. This filter applies a certain amount of gain to the input audio in order
  1654. to bring its peak magnitude to a target level (e.g. 0 dBFS). However, in
  1655. contrast to more "simple" normalization algorithms, the Dynamic Audio
  1656. Normalizer *dynamically* re-adjusts the gain factor to the input audio.
  1657. This allows for applying extra gain to the "quiet" sections of the audio
  1658. while avoiding distortions or clipping the "loud" sections. In other words:
  1659. The Dynamic Audio Normalizer will "even out" the volume of quiet and loud
  1660. sections, in the sense that the volume of each section is brought to the
  1661. same target level. Note, however, that the Dynamic Audio Normalizer achieves
  1662. this goal *without* applying "dynamic range compressing". It will retain 100%
  1663. of the dynamic range *within* each section of the audio file.
  1664. @table @option
  1665. @item f
  1666. Set the frame length in milliseconds. In range from 10 to 8000 milliseconds.
  1667. Default is 500 milliseconds.
  1668. The Dynamic Audio Normalizer processes the input audio in small chunks,
  1669. referred to as frames. This is required, because a peak magnitude has no
  1670. meaning for just a single sample value. Instead, we need to determine the
  1671. peak magnitude for a contiguous sequence of sample values. While a "standard"
  1672. normalizer would simply use the peak magnitude of the complete file, the
  1673. Dynamic Audio Normalizer determines the peak magnitude individually for each
  1674. frame. The length of a frame is specified in milliseconds. By default, the
  1675. Dynamic Audio Normalizer uses a frame length of 500 milliseconds, which has
  1676. been found to give good results with most files.
  1677. Note that the exact frame length, in number of samples, will be determined
  1678. automatically, based on the sampling rate of the individual input audio file.
  1679. @item g
  1680. Set the Gaussian filter window size. In range from 3 to 301, must be odd
  1681. number. Default is 31.
  1682. Probably the most important parameter of the Dynamic Audio Normalizer is the
  1683. @code{window size} of the Gaussian smoothing filter. The filter's window size
  1684. is specified in frames, centered around the current frame. For the sake of
  1685. simplicity, this must be an odd number. Consequently, the default value of 31
  1686. takes into account the current frame, as well as the 15 preceding frames and
  1687. the 15 subsequent frames. Using a larger window results in a stronger
  1688. smoothing effect and thus in less gain variation, i.e. slower gain
  1689. adaptation. Conversely, using a smaller window results in a weaker smoothing
  1690. effect and thus in more gain variation, i.e. faster gain adaptation.
  1691. In other words, the more you increase this value, the more the Dynamic Audio
  1692. Normalizer will behave like a "traditional" normalization filter. On the
  1693. contrary, the more you decrease this value, the more the Dynamic Audio
  1694. Normalizer will behave like a dynamic range compressor.
  1695. @item p
  1696. Set the target peak value. This specifies the highest permissible magnitude
  1697. level for the normalized audio input. This filter will try to approach the
  1698. target peak magnitude as closely as possible, but at the same time it also
  1699. makes sure that the normalized signal will never exceed the peak magnitude.
  1700. A frame's maximum local gain factor is imposed directly by the target peak
  1701. magnitude. The default value is 0.95 and thus leaves a headroom of 5%*.
  1702. It is not recommended to go above this value.
  1703. @item m
  1704. Set the maximum gain factor. In range from 1.0 to 100.0. Default is 10.0.
  1705. The Dynamic Audio Normalizer determines the maximum possible (local) gain
  1706. factor for each input frame, i.e. the maximum gain factor that does not
  1707. result in clipping or distortion. The maximum gain factor is determined by
  1708. the frame's highest magnitude sample. However, the Dynamic Audio Normalizer
  1709. additionally bounds the frame's maximum gain factor by a predetermined
  1710. (global) maximum gain factor. This is done in order to avoid excessive gain
  1711. factors in "silent" or almost silent frames. By default, the maximum gain
  1712. factor is 10.0, For most inputs the default value should be sufficient and
  1713. it usually is not recommended to increase this value. Though, for input
  1714. with an extremely low overall volume level, it may be necessary to allow even
  1715. higher gain factors. Note, however, that the Dynamic Audio Normalizer does
  1716. not simply apply a "hard" threshold (i.e. cut off values above the threshold).
  1717. Instead, a "sigmoid" threshold function will be applied. This way, the
  1718. gain factors will smoothly approach the threshold value, but never exceed that
  1719. value.
  1720. @item r
  1721. Set the target RMS. In range from 0.0 to 1.0. Default is 0.0 - disabled.
  1722. By default, the Dynamic Audio Normalizer performs "peak" normalization.
  1723. This means that the maximum local gain factor for each frame is defined
  1724. (only) by the frame's highest magnitude sample. This way, the samples can
  1725. be amplified as much as possible without exceeding the maximum signal
  1726. level, i.e. without clipping. Optionally, however, the Dynamic Audio
  1727. Normalizer can also take into account the frame's root mean square,
  1728. abbreviated RMS. In electrical engineering, the RMS is commonly used to
  1729. determine the power of a time-varying signal. It is therefore considered
  1730. that the RMS is a better approximation of the "perceived loudness" than
  1731. just looking at the signal's peak magnitude. Consequently, by adjusting all
  1732. frames to a constant RMS value, a uniform "perceived loudness" can be
  1733. established. If a target RMS value has been specified, a frame's local gain
  1734. factor is defined as the factor that would result in exactly that RMS value.
  1735. Note, however, that the maximum local gain factor is still restricted by the
  1736. frame's highest magnitude sample, in order to prevent clipping.
  1737. @item n
  1738. Enable channels coupling. By default is enabled.
  1739. By default, the Dynamic Audio Normalizer will amplify all channels by the same
  1740. amount. This means the same gain factor will be applied to all channels, i.e.
  1741. the maximum possible gain factor is determined by the "loudest" channel.
  1742. However, in some recordings, it may happen that the volume of the different
  1743. channels is uneven, e.g. one channel may be "quieter" than the other one(s).
  1744. In this case, this option can be used to disable the channel coupling. This way,
  1745. the gain factor will be determined independently for each channel, depending
  1746. only on the individual channel's highest magnitude sample. This allows for
  1747. harmonizing the volume of the different channels.
  1748. @item c
  1749. Enable DC bias correction. By default is disabled.
  1750. An audio signal (in the time domain) is a sequence of sample values.
  1751. In the Dynamic Audio Normalizer these sample values are represented in the
  1752. -1.0 to 1.0 range, regardless of the original input format. Normally, the
  1753. audio signal, or "waveform", should be centered around the zero point.
  1754. That means if we calculate the mean value of all samples in a file, or in a
  1755. single frame, then the result should be 0.0 or at least very close to that
  1756. value. If, however, there is a significant deviation of the mean value from
  1757. 0.0, in either positive or negative direction, this is referred to as a
  1758. DC bias or DC offset. Since a DC bias is clearly undesirable, the Dynamic
  1759. Audio Normalizer provides optional DC bias correction.
  1760. With DC bias correction enabled, the Dynamic Audio Normalizer will determine
  1761. the mean value, or "DC correction" offset, of each input frame and subtract
  1762. that value from all of the frame's sample values which ensures those samples
  1763. are centered around 0.0 again. Also, in order to avoid "gaps" at the frame
  1764. boundaries, the DC correction offset values will be interpolated smoothly
  1765. between neighbouring frames.
  1766. @item b
  1767. Enable alternative boundary mode. By default is disabled.
  1768. The Dynamic Audio Normalizer takes into account a certain neighbourhood
  1769. around each frame. This includes the preceding frames as well as the
  1770. subsequent frames. However, for the "boundary" frames, located at the very
  1771. beginning and at the very end of the audio file, not all neighbouring
  1772. frames are available. In particular, for the first few frames in the audio
  1773. file, the preceding frames are not known. And, similarly, for the last few
  1774. frames in the audio file, the subsequent frames are not known. Thus, the
  1775. question arises which gain factors should be assumed for the missing frames
  1776. in the "boundary" region. The Dynamic Audio Normalizer implements two modes
  1777. to deal with this situation. The default boundary mode assumes a gain factor
  1778. of exactly 1.0 for the missing frames, resulting in a smooth "fade in" and
  1779. "fade out" at the beginning and at the end of the input, respectively.
  1780. @item s
  1781. Set the compress factor. In range from 0.0 to 30.0. Default is 0.0.
  1782. By default, the Dynamic Audio Normalizer does not apply "traditional"
  1783. compression. This means that signal peaks will not be pruned and thus the
  1784. full dynamic range will be retained within each local neighbourhood. However,
  1785. in some cases it may be desirable to combine the Dynamic Audio Normalizer's
  1786. normalization algorithm with a more "traditional" compression.
  1787. For this purpose, the Dynamic Audio Normalizer provides an optional compression
  1788. (thresholding) function. If (and only if) the compression feature is enabled,
  1789. all input frames will be processed by a soft knee thresholding function prior
  1790. to the actual normalization process. Put simply, the thresholding function is
  1791. going to prune all samples whose magnitude exceeds a certain threshold value.
  1792. However, the Dynamic Audio Normalizer does not simply apply a fixed threshold
  1793. value. Instead, the threshold value will be adjusted for each individual
  1794. frame.
  1795. In general, smaller parameters result in stronger compression, and vice versa.
  1796. Values below 3.0 are not recommended, because audible distortion may appear.
  1797. @end table
  1798. @section earwax
  1799. Make audio easier to listen to on headphones.
  1800. This filter adds `cues' to 44.1kHz stereo (i.e. audio CD format) audio
  1801. so that when listened to on headphones the stereo image is moved from
  1802. inside your head (standard for headphones) to outside and in front of
  1803. the listener (standard for speakers).
  1804. Ported from SoX.
  1805. @section equalizer
  1806. Apply a two-pole peaking equalisation (EQ) filter. With this
  1807. filter, the signal-level at and around a selected frequency can
  1808. be increased or decreased, whilst (unlike bandpass and bandreject
  1809. filters) that at all other frequencies is unchanged.
  1810. In order to produce complex equalisation curves, this filter can
  1811. be given several times, each with a different central frequency.
  1812. The filter accepts the following options:
  1813. @table @option
  1814. @item frequency, f
  1815. Set the filter's central frequency in Hz.
  1816. @item width_type
  1817. Set method to specify band-width of filter.
  1818. @table @option
  1819. @item h
  1820. Hz
  1821. @item q
  1822. Q-Factor
  1823. @item o
  1824. octave
  1825. @item s
  1826. slope
  1827. @end table
  1828. @item width, w
  1829. Specify the band-width of a filter in width_type units.
  1830. @item gain, g
  1831. Set the required gain or attenuation in dB.
  1832. Beware of clipping when using a positive gain.
  1833. @end table
  1834. @subsection Examples
  1835. @itemize
  1836. @item
  1837. Attenuate 10 dB at 1000 Hz, with a bandwidth of 200 Hz:
  1838. @example
  1839. equalizer=f=1000:width_type=h:width=200:g=-10
  1840. @end example
  1841. @item
  1842. Apply 2 dB gain at 1000 Hz with Q 1 and attenuate 5 dB at 100 Hz with Q 2:
  1843. @example
  1844. equalizer=f=1000:width_type=q:width=1:g=2,equalizer=f=100:width_type=q:width=2:g=-5
  1845. @end example
  1846. @end itemize
  1847. @section extrastereo
  1848. Linearly increases the difference between left and right channels which
  1849. adds some sort of "live" effect to playback.
  1850. The filter accepts the following option:
  1851. @table @option
  1852. @item m
  1853. Sets the difference coefficient (default: 2.5). 0.0 means mono sound
  1854. (average of both channels), with 1.0 sound will be unchanged, with
  1855. -1.0 left and right channels will be swapped.
  1856. @item c
  1857. Enable clipping. By default is enabled.
  1858. @end table
  1859. @section firequalizer
  1860. Apply FIR Equalization using arbitrary frequency response.
  1861. The filter accepts the following option:
  1862. @table @option
  1863. @item gain
  1864. Set gain curve equation (in dB). The expression can contain variables:
  1865. @table @option
  1866. @item f
  1867. the evaluated frequency
  1868. @item sr
  1869. sample rate
  1870. @item ch
  1871. channel number, set to 0 when multichannels evaluation is disabled
  1872. @item chid
  1873. channel id, see libavutil/channel_layout.h, set to the first channel id when
  1874. multichannels evaluation is disabled
  1875. @item chs
  1876. number of channels
  1877. @item chlayout
  1878. channel_layout, see libavutil/channel_layout.h
  1879. @end table
  1880. and functions:
  1881. @table @option
  1882. @item gain_interpolate(f)
  1883. interpolate gain on frequency f based on gain_entry
  1884. @end table
  1885. This option is also available as command. Default is @code{gain_interpolate(f)}.
  1886. @item gain_entry
  1887. Set gain entry for gain_interpolate function. The expression can
  1888. contain functions:
  1889. @table @option
  1890. @item entry(f, g)
  1891. store gain entry at frequency f with value g
  1892. @end table
  1893. This option is also available as command.
  1894. @item delay
  1895. Set filter delay in seconds. Higher value means more accurate.
  1896. Default is @code{0.01}.
  1897. @item accuracy
  1898. Set filter accuracy in Hz. Lower value means more accurate.
  1899. Default is @code{5}.
  1900. @item wfunc
  1901. Set window function. Acceptable values are:
  1902. @table @option
  1903. @item rectangular
  1904. rectangular window, useful when gain curve is already smooth
  1905. @item hann
  1906. hann window (default)
  1907. @item hamming
  1908. hamming window
  1909. @item blackman
  1910. blackman window
  1911. @item nuttall3
  1912. 3-terms continuous 1st derivative nuttall window
  1913. @item mnuttall3
  1914. minimum 3-terms discontinuous nuttall window
  1915. @item nuttall
  1916. 4-terms continuous 1st derivative nuttall window
  1917. @item bnuttall
  1918. minimum 4-terms discontinuous nuttall (blackman-nuttall) window
  1919. @item bharris
  1920. blackman-harris window
  1921. @end table
  1922. @item fixed
  1923. If enabled, use fixed number of audio samples. This improves speed when
  1924. filtering with large delay. Default is disabled.
  1925. @item multi
  1926. Enable multichannels evaluation on gain. Default is disabled.
  1927. @end table
  1928. @subsection Examples
  1929. @itemize
  1930. @item
  1931. lowpass at 1000 Hz:
  1932. @example
  1933. firequalizer=gain='if(lt(f,1000), 0, -INF)'
  1934. @end example
  1935. @item
  1936. lowpass at 1000 Hz with gain_entry:
  1937. @example
  1938. firequalizer=gain_entry='entry(1000,0); entry(1001, -INF)'
  1939. @end example
  1940. @item
  1941. custom equalization:
  1942. @example
  1943. firequalizer=gain_entry='entry(100,0); entry(400, -4); entry(1000, -6); entry(2000, 0)'
  1944. @end example
  1945. @item
  1946. higher delay:
  1947. @example
  1948. firequalizer=delay=0.1:fixed=on
  1949. @end example
  1950. @item
  1951. lowpass on left channel, highpass on right channel:
  1952. @example
  1953. firequalizer=gain='if(eq(chid,1), gain_interpolate(f), if(eq(chid,2), gain_interpolate(1e6+f), 0))'
  1954. :gain_entry='entry(1000, 0); entry(1001,-INF); entry(1e6+1000,0)':multi=on
  1955. @end example
  1956. @end itemize
  1957. @section flanger
  1958. Apply a flanging effect to the audio.
  1959. The filter accepts the following options:
  1960. @table @option
  1961. @item delay
  1962. Set base delay in milliseconds. Range from 0 to 30. Default value is 0.
  1963. @item depth
  1964. Set added swep delay in milliseconds. Range from 0 to 10. Default value is 2.
  1965. @item regen
  1966. Set percentage regeneration (delayed signal feedback). Range from -95 to 95.
  1967. Default value is 0.
  1968. @item width
  1969. Set percentage of delayed signal mixed with original. Range from 0 to 100.
  1970. Default value is 71.
  1971. @item speed
  1972. Set sweeps per second (Hz). Range from 0.1 to 10. Default value is 0.5.
  1973. @item shape
  1974. Set swept wave shape, can be @var{triangular} or @var{sinusoidal}.
  1975. Default value is @var{sinusoidal}.
  1976. @item phase
  1977. Set swept wave percentage-shift for multi channel. Range from 0 to 100.
  1978. Default value is 25.
  1979. @item interp
  1980. Set delay-line interpolation, @var{linear} or @var{quadratic}.
  1981. Default is @var{linear}.
  1982. @end table
  1983. @section highpass
  1984. Apply a high-pass filter with 3dB point frequency.
  1985. The filter can be either single-pole, or double-pole (the default).
  1986. The filter roll off at 6dB per pole per octave (20dB per pole per decade).
  1987. The filter accepts the following options:
  1988. @table @option
  1989. @item frequency, f
  1990. Set frequency in Hz. Default is 3000.
  1991. @item poles, p
  1992. Set number of poles. Default is 2.
  1993. @item width_type
  1994. Set method to specify band-width of filter.
  1995. @table @option
  1996. @item h
  1997. Hz
  1998. @item q
  1999. Q-Factor
  2000. @item o
  2001. octave
  2002. @item s
  2003. slope
  2004. @end table
  2005. @item width, w
  2006. Specify the band-width of a filter in width_type units.
  2007. Applies only to double-pole filter.
  2008. The default is 0.707q and gives a Butterworth response.
  2009. @end table
  2010. @section join
  2011. Join multiple input streams into one multi-channel stream.
  2012. It accepts the following parameters:
  2013. @table @option
  2014. @item inputs
  2015. The number of input streams. It defaults to 2.
  2016. @item channel_layout
  2017. The desired output channel layout. It defaults to stereo.
  2018. @item map
  2019. Map channels from inputs to output. The argument is a '|'-separated list of
  2020. mappings, each in the @code{@var{input_idx}.@var{in_channel}-@var{out_channel}}
  2021. form. @var{input_idx} is the 0-based index of the input stream. @var{in_channel}
  2022. can be either the name of the input channel (e.g. FL for front left) or its
  2023. index in the specified input stream. @var{out_channel} is the name of the output
  2024. channel.
  2025. @end table
  2026. The filter will attempt to guess the mappings when they are not specified
  2027. explicitly. It does so by first trying to find an unused matching input channel
  2028. and if that fails it picks the first unused input channel.
  2029. Join 3 inputs (with properly set channel layouts):
  2030. @example
  2031. ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex join=inputs=3 OUTPUT
  2032. @end example
  2033. Build a 5.1 output from 6 single-channel streams:
  2034. @example
  2035. ffmpeg -i fl -i fr -i fc -i sl -i sr -i lfe -filter_complex
  2036. 'join=inputs=6:channel_layout=5.1:map=0.0-FL|1.0-FR|2.0-FC|3.0-SL|4.0-SR|5.0-LFE'
  2037. out
  2038. @end example
  2039. @section ladspa
  2040. Load a LADSPA (Linux Audio Developer's Simple Plugin API) plugin.
  2041. To enable compilation of this filter you need to configure FFmpeg with
  2042. @code{--enable-ladspa}.
  2043. @table @option
  2044. @item file, f
  2045. Specifies the name of LADSPA plugin library to load. If the environment
  2046. variable @env{LADSPA_PATH} is defined, the LADSPA plugin is searched in
  2047. each one of the directories specified by the colon separated list in
  2048. @env{LADSPA_PATH}, otherwise in the standard LADSPA paths, which are in
  2049. this order: @file{HOME/.ladspa/lib/}, @file{/usr/local/lib/ladspa/},
  2050. @file{/usr/lib/ladspa/}.
  2051. @item plugin, p
  2052. Specifies the plugin within the library. Some libraries contain only
  2053. one plugin, but others contain many of them. If this is not set filter
  2054. will list all available plugins within the specified library.
  2055. @item controls, c
  2056. Set the '|' separated list of controls which are zero or more floating point
  2057. values that determine the behavior of the loaded plugin (for example delay,
  2058. threshold or gain).
  2059. Controls need to be defined using the following syntax:
  2060. c0=@var{value0}|c1=@var{value1}|c2=@var{value2}|..., where
  2061. @var{valuei} is the value set on the @var{i}-th control.
  2062. Alternatively they can be also defined using the following syntax:
  2063. @var{value0}|@var{value1}|@var{value2}|..., where
  2064. @var{valuei} is the value set on the @var{i}-th control.
  2065. If @option{controls} is set to @code{help}, all available controls and
  2066. their valid ranges are printed.
  2067. @item sample_rate, s
  2068. Specify the sample rate, default to 44100. Only used if plugin have
  2069. zero inputs.
  2070. @item nb_samples, n
  2071. Set the number of samples per channel per each output frame, default
  2072. is 1024. Only used if plugin have zero inputs.
  2073. @item duration, d
  2074. Set the minimum duration of the sourced audio. See
  2075. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  2076. for the accepted syntax.
  2077. Note that the resulting duration may be greater than the specified duration,
  2078. as the generated audio is always cut at the end of a complete frame.
  2079. If not specified, or the expressed duration is negative, the audio is
  2080. supposed to be generated forever.
  2081. Only used if plugin have zero inputs.
  2082. @end table
  2083. @subsection Examples
  2084. @itemize
  2085. @item
  2086. List all available plugins within amp (LADSPA example plugin) library:
  2087. @example
  2088. ladspa=file=amp
  2089. @end example
  2090. @item
  2091. List all available controls and their valid ranges for @code{vcf_notch}
  2092. plugin from @code{VCF} library:
  2093. @example
  2094. ladspa=f=vcf:p=vcf_notch:c=help
  2095. @end example
  2096. @item
  2097. Simulate low quality audio equipment using @code{Computer Music Toolkit} (CMT)
  2098. plugin library:
  2099. @example
  2100. ladspa=file=cmt:plugin=lofi:controls=c0=22|c1=12|c2=12
  2101. @end example
  2102. @item
  2103. Add reverberation to the audio using TAP-plugins
  2104. (Tom's Audio Processing plugins):
  2105. @example
  2106. ladspa=file=tap_reverb:tap_reverb
  2107. @end example
  2108. @item
  2109. Generate white noise, with 0.2 amplitude:
  2110. @example
  2111. ladspa=file=cmt:noise_source_white:c=c0=.2
  2112. @end example
  2113. @item
  2114. Generate 20 bpm clicks using plugin @code{C* Click - Metronome} from the
  2115. @code{C* Audio Plugin Suite} (CAPS) library:
  2116. @example
  2117. ladspa=file=caps:Click:c=c1=20'
  2118. @end example
  2119. @item
  2120. Apply @code{C* Eq10X2 - Stereo 10-band equaliser} effect:
  2121. @example
  2122. ladspa=caps:Eq10X2:c=c0=-48|c9=-24|c3=12|c4=2
  2123. @end example
  2124. @item
  2125. Increase volume by 20dB using fast lookahead limiter from Steve Harris
  2126. @code{SWH Plugins} collection:
  2127. @example
  2128. ladspa=fast_lookahead_limiter_1913:fastLookaheadLimiter:20|0|2
  2129. @end example
  2130. @item
  2131. Attenuate low frequencies using Multiband EQ from Steve Harris
  2132. @code{SWH Plugins} collection:
  2133. @example
  2134. ladspa=mbeq_1197:mbeq:-24|-24|-24|0|0|0|0|0|0|0|0|0|0|0|0
  2135. @end example
  2136. @end itemize
  2137. @subsection Commands
  2138. This filter supports the following commands:
  2139. @table @option
  2140. @item cN
  2141. Modify the @var{N}-th control value.
  2142. If the specified value is not valid, it is ignored and prior one is kept.
  2143. @end table
  2144. @section lowpass
  2145. Apply a low-pass filter with 3dB point frequency.
  2146. The filter can be either single-pole or double-pole (the default).
  2147. The filter roll off at 6dB per pole per octave (20dB per pole per decade).
  2148. The filter accepts the following options:
  2149. @table @option
  2150. @item frequency, f
  2151. Set frequency in Hz. Default is 500.
  2152. @item poles, p
  2153. Set number of poles. Default is 2.
  2154. @item width_type
  2155. Set method to specify band-width of filter.
  2156. @table @option
  2157. @item h
  2158. Hz
  2159. @item q
  2160. Q-Factor
  2161. @item o
  2162. octave
  2163. @item s
  2164. slope
  2165. @end table
  2166. @item width, w
  2167. Specify the band-width of a filter in width_type units.
  2168. Applies only to double-pole filter.
  2169. The default is 0.707q and gives a Butterworth response.
  2170. @end table
  2171. @anchor{pan}
  2172. @section pan
  2173. Mix channels with specific gain levels. The filter accepts the output
  2174. channel layout followed by a set of channels definitions.
  2175. This filter is also designed to efficiently remap the channels of an audio
  2176. stream.
  2177. The filter accepts parameters of the form:
  2178. "@var{l}|@var{outdef}|@var{outdef}|..."
  2179. @table @option
  2180. @item l
  2181. output channel layout or number of channels
  2182. @item outdef
  2183. output channel specification, of the form:
  2184. "@var{out_name}=[@var{gain}*]@var{in_name}[+[@var{gain}*]@var{in_name}...]"
  2185. @item out_name
  2186. output channel to define, either a channel name (FL, FR, etc.) or a channel
  2187. number (c0, c1, etc.)
  2188. @item gain
  2189. multiplicative coefficient for the channel, 1 leaving the volume unchanged
  2190. @item in_name
  2191. input channel to use, see out_name for details; it is not possible to mix
  2192. named and numbered input channels
  2193. @end table
  2194. If the `=' in a channel specification is replaced by `<', then the gains for
  2195. that specification will be renormalized so that the total is 1, thus
  2196. avoiding clipping noise.
  2197. @subsection Mixing examples
  2198. For example, if you want to down-mix from stereo to mono, but with a bigger
  2199. factor for the left channel:
  2200. @example
  2201. pan=1c|c0=0.9*c0+0.1*c1
  2202. @end example
  2203. A customized down-mix to stereo that works automatically for 3-, 4-, 5- and
  2204. 7-channels surround:
  2205. @example
  2206. pan=stereo| FL < FL + 0.5*FC + 0.6*BL + 0.6*SL | FR < FR + 0.5*FC + 0.6*BR + 0.6*SR
  2207. @end example
  2208. Note that @command{ffmpeg} integrates a default down-mix (and up-mix) system
  2209. that should be preferred (see "-ac" option) unless you have very specific
  2210. needs.
  2211. @subsection Remapping examples
  2212. The channel remapping will be effective if, and only if:
  2213. @itemize
  2214. @item gain coefficients are zeroes or ones,
  2215. @item only one input per channel output,
  2216. @end itemize
  2217. If all these conditions are satisfied, the filter will notify the user ("Pure
  2218. channel mapping detected"), and use an optimized and lossless method to do the
  2219. remapping.
  2220. For example, if you have a 5.1 source and want a stereo audio stream by
  2221. dropping the extra channels:
  2222. @example
  2223. pan="stereo| c0=FL | c1=FR"
  2224. @end example
  2225. Given the same source, you can also switch front left and front right channels
  2226. and keep the input channel layout:
  2227. @example
  2228. pan="5.1| c0=c1 | c1=c0 | c2=c2 | c3=c3 | c4=c4 | c5=c5"
  2229. @end example
  2230. If the input is a stereo audio stream, you can mute the front left channel (and
  2231. still keep the stereo channel layout) with:
  2232. @example
  2233. pan="stereo|c1=c1"
  2234. @end example
  2235. Still with a stereo audio stream input, you can copy the right channel in both
  2236. front left and right:
  2237. @example
  2238. pan="stereo| c0=FR | c1=FR"
  2239. @end example
  2240. @section replaygain
  2241. ReplayGain scanner filter. This filter takes an audio stream as an input and
  2242. outputs it unchanged.
  2243. At end of filtering it displays @code{track_gain} and @code{track_peak}.
  2244. @section resample
  2245. Convert the audio sample format, sample rate and channel layout. It is
  2246. not meant to be used directly.
  2247. @section rubberband
  2248. Apply time-stretching and pitch-shifting with librubberband.
  2249. The filter accepts the following options:
  2250. @table @option
  2251. @item tempo
  2252. Set tempo scale factor.
  2253. @item pitch
  2254. Set pitch scale factor.
  2255. @item transients
  2256. Set transients detector.
  2257. Possible values are:
  2258. @table @var
  2259. @item crisp
  2260. @item mixed
  2261. @item smooth
  2262. @end table
  2263. @item detector
  2264. Set detector.
  2265. Possible values are:
  2266. @table @var
  2267. @item compound
  2268. @item percussive
  2269. @item soft
  2270. @end table
  2271. @item phase
  2272. Set phase.
  2273. Possible values are:
  2274. @table @var
  2275. @item laminar
  2276. @item independent
  2277. @end table
  2278. @item window
  2279. Set processing window size.
  2280. Possible values are:
  2281. @table @var
  2282. @item standard
  2283. @item short
  2284. @item long
  2285. @end table
  2286. @item smoothing
  2287. Set smoothing.
  2288. Possible values are:
  2289. @table @var
  2290. @item off
  2291. @item on
  2292. @end table
  2293. @item formant
  2294. Enable formant preservation when shift pitching.
  2295. Possible values are:
  2296. @table @var
  2297. @item shifted
  2298. @item preserved
  2299. @end table
  2300. @item pitchq
  2301. Set pitch quality.
  2302. Possible values are:
  2303. @table @var
  2304. @item quality
  2305. @item speed
  2306. @item consistency
  2307. @end table
  2308. @item channels
  2309. Set channels.
  2310. Possible values are:
  2311. @table @var
  2312. @item apart
  2313. @item together
  2314. @end table
  2315. @end table
  2316. @section sidechaincompress
  2317. This filter acts like normal compressor but has the ability to compress
  2318. detected signal using second input signal.
  2319. It needs two input streams and returns one output stream.
  2320. First input stream will be processed depending on second stream signal.
  2321. The filtered signal then can be filtered with other filters in later stages of
  2322. processing. See @ref{pan} and @ref{amerge} filter.
  2323. The filter accepts the following options:
  2324. @table @option
  2325. @item level_in
  2326. Set input gain. Default is 1. Range is between 0.015625 and 64.
  2327. @item threshold
  2328. If a signal of second stream raises above this level it will affect the gain
  2329. reduction of first stream.
  2330. By default is 0.125. Range is between 0.00097563 and 1.
  2331. @item ratio
  2332. Set a ratio about which the signal is reduced. 1:2 means that if the level
  2333. raised 4dB above the threshold, it will be only 2dB above after the reduction.
  2334. Default is 2. Range is between 1 and 20.
  2335. @item attack
  2336. Amount of milliseconds the signal has to rise above the threshold before gain
  2337. reduction starts. Default is 20. Range is between 0.01 and 2000.
  2338. @item release
  2339. Amount of milliseconds the signal has to fall below the threshold before
  2340. reduction is decreased again. Default is 250. Range is between 0.01 and 9000.
  2341. @item makeup
  2342. Set the amount by how much signal will be amplified after processing.
  2343. Default is 2. Range is from 1 and 64.
  2344. @item knee
  2345. Curve the sharp knee around the threshold to enter gain reduction more softly.
  2346. Default is 2.82843. Range is between 1 and 8.
  2347. @item link
  2348. Choose if the @code{average} level between all channels of side-chain stream
  2349. or the louder(@code{maximum}) channel of side-chain stream affects the
  2350. reduction. Default is @code{average}.
  2351. @item detection
  2352. Should the exact signal be taken in case of @code{peak} or an RMS one in case
  2353. of @code{rms}. Default is @code{rms} which is mainly smoother.
  2354. @item level_sc
  2355. Set sidechain gain. Default is 1. Range is between 0.015625 and 64.
  2356. @item mix
  2357. How much to use compressed signal in output. Default is 1.
  2358. Range is between 0 and 1.
  2359. @end table
  2360. @subsection Examples
  2361. @itemize
  2362. @item
  2363. Full ffmpeg example taking 2 audio inputs, 1st input to be compressed
  2364. depending on the signal of 2nd input and later compressed signal to be
  2365. merged with 2nd input:
  2366. @example
  2367. ffmpeg -i main.flac -i sidechain.flac -filter_complex "[1:a]asplit=2[sc][mix];[0:a][sc]sidechaincompress[compr];[compr][mix]amerge"
  2368. @end example
  2369. @end itemize
  2370. @section sidechaingate
  2371. A sidechain gate acts like a normal (wideband) gate but has the ability to
  2372. filter the detected signal before sending it to the gain reduction stage.
  2373. Normally a gate uses the full range signal to detect a level above the
  2374. threshold.
  2375. For example: If you cut all lower frequencies from your sidechain signal
  2376. the gate will decrease the volume of your track only if not enough highs
  2377. appear. With this technique you are able to reduce the resonation of a
  2378. natural drum or remove "rumbling" of muted strokes from a heavily distorted
  2379. guitar.
  2380. It needs two input streams and returns one output stream.
  2381. First input stream will be processed depending on second stream signal.
  2382. The filter accepts the following options:
  2383. @table @option
  2384. @item level_in
  2385. Set input level before filtering.
  2386. Default is 1. Allowed range is from 0.015625 to 64.
  2387. @item range
  2388. Set the level of gain reduction when the signal is below the threshold.
  2389. Default is 0.06125. Allowed range is from 0 to 1.
  2390. @item threshold
  2391. If a signal rises above this level the gain reduction is released.
  2392. Default is 0.125. Allowed range is from 0 to 1.
  2393. @item ratio
  2394. Set a ratio about which the signal is reduced.
  2395. Default is 2. Allowed range is from 1 to 9000.
  2396. @item attack
  2397. Amount of milliseconds the signal has to rise above the threshold before gain
  2398. reduction stops.
  2399. Default is 20 milliseconds. Allowed range is from 0.01 to 9000.
  2400. @item release
  2401. Amount of milliseconds the signal has to fall below the threshold before the
  2402. reduction is increased again. Default is 250 milliseconds.
  2403. Allowed range is from 0.01 to 9000.
  2404. @item makeup
  2405. Set amount of amplification of signal after processing.
  2406. Default is 1. Allowed range is from 1 to 64.
  2407. @item knee
  2408. Curve the sharp knee around the threshold to enter gain reduction more softly.
  2409. Default is 2.828427125. Allowed range is from 1 to 8.
  2410. @item detection
  2411. Choose if exact signal should be taken for detection or an RMS like one.
  2412. Default is rms. Can be peak or rms.
  2413. @item link
  2414. Choose if the average level between all channels or the louder channel affects
  2415. the reduction.
  2416. Default is average. Can be average or maximum.
  2417. @item level_sc
  2418. Set sidechain gain. Default is 1. Range is from 0.015625 to 64.
  2419. @end table
  2420. @section silencedetect
  2421. Detect silence in an audio stream.
  2422. This filter logs a message when it detects that the input audio volume is less
  2423. or equal to a noise tolerance value for a duration greater or equal to the
  2424. minimum detected noise duration.
  2425. The printed times and duration are expressed in seconds.
  2426. The filter accepts the following options:
  2427. @table @option
  2428. @item duration, d
  2429. Set silence duration until notification (default is 2 seconds).
  2430. @item noise, n
  2431. Set noise tolerance. Can be specified in dB (in case "dB" is appended to the
  2432. specified value) or amplitude ratio. Default is -60dB, or 0.001.
  2433. @end table
  2434. @subsection Examples
  2435. @itemize
  2436. @item
  2437. Detect 5 seconds of silence with -50dB noise tolerance:
  2438. @example
  2439. silencedetect=n=-50dB:d=5
  2440. @end example
  2441. @item
  2442. Complete example with @command{ffmpeg} to detect silence with 0.0001 noise
  2443. tolerance in @file{silence.mp3}:
  2444. @example
  2445. ffmpeg -i silence.mp3 -af silencedetect=noise=0.0001 -f null -
  2446. @end example
  2447. @end itemize
  2448. @section silenceremove
  2449. Remove silence from the beginning, middle or end of the audio.
  2450. The filter accepts the following options:
  2451. @table @option
  2452. @item start_periods
  2453. This value is used to indicate if audio should be trimmed at beginning of
  2454. the audio. A value of zero indicates no silence should be trimmed from the
  2455. beginning. When specifying a non-zero value, it trims audio up until it
  2456. finds non-silence. Normally, when trimming silence from beginning of audio
  2457. the @var{start_periods} will be @code{1} but it can be increased to higher
  2458. values to trim all audio up to specific count of non-silence periods.
  2459. Default value is @code{0}.
  2460. @item start_duration
  2461. Specify the amount of time that non-silence must be detected before it stops
  2462. trimming audio. By increasing the duration, bursts of noises can be treated
  2463. as silence and trimmed off. Default value is @code{0}.
  2464. @item start_threshold
  2465. This indicates what sample value should be treated as silence. For digital
  2466. audio, a value of @code{0} may be fine but for audio recorded from analog,
  2467. you may wish to increase the value to account for background noise.
  2468. Can be specified in dB (in case "dB" is appended to the specified value)
  2469. or amplitude ratio. Default value is @code{0}.
  2470. @item stop_periods
  2471. Set the count for trimming silence from the end of audio.
  2472. To remove silence from the middle of a file, specify a @var{stop_periods}
  2473. that is negative. This value is then treated as a positive value and is
  2474. used to indicate the effect should restart processing as specified by
  2475. @var{start_periods}, making it suitable for removing periods of silence
  2476. in the middle of the audio.
  2477. Default value is @code{0}.
  2478. @item stop_duration
  2479. Specify a duration of silence that must exist before audio is not copied any
  2480. more. By specifying a higher duration, silence that is wanted can be left in
  2481. the audio.
  2482. Default value is @code{0}.
  2483. @item stop_threshold
  2484. This is the same as @option{start_threshold} but for trimming silence from
  2485. the end of audio.
  2486. Can be specified in dB (in case "dB" is appended to the specified value)
  2487. or amplitude ratio. Default value is @code{0}.
  2488. @item leave_silence
  2489. This indicate that @var{stop_duration} length of audio should be left intact
  2490. at the beginning of each period of silence.
  2491. For example, if you want to remove long pauses between words but do not want
  2492. to remove the pauses completely. Default value is @code{0}.
  2493. @item detection
  2494. Set how is silence detected. Can be @code{rms} or @code{peak}. Second is faster
  2495. and works better with digital silence which is exactly 0.
  2496. Default value is @code{rms}.
  2497. @item window
  2498. Set ratio used to calculate size of window for detecting silence.
  2499. Default value is @code{0.02}. Allowed range is from @code{0} to @code{10}.
  2500. @end table
  2501. @subsection Examples
  2502. @itemize
  2503. @item
  2504. The following example shows how this filter can be used to start a recording
  2505. that does not contain the delay at the start which usually occurs between
  2506. pressing the record button and the start of the performance:
  2507. @example
  2508. silenceremove=1:5:0.02
  2509. @end example
  2510. @item
  2511. Trim all silence encountered from begining to end where there is more than 1
  2512. second of silence in audio:
  2513. @example
  2514. silenceremove=0:0:0:-1:1:-90dB
  2515. @end example
  2516. @end itemize
  2517. @section sofalizer
  2518. SOFAlizer uses head-related transfer functions (HRTFs) to create virtual
  2519. loudspeakers around the user for binaural listening via headphones (audio
  2520. formats up to 9 channels supported).
  2521. The HRTFs are stored in SOFA files (see @url{http://www.sofacoustics.org/} for a database).
  2522. SOFAlizer is developed at the Acoustics Research Institute (ARI) of the
  2523. Austrian Academy of Sciences.
  2524. To enable compilation of this filter you need to configure FFmpeg with
  2525. @code{--enable-netcdf}.
  2526. The filter accepts the following options:
  2527. @table @option
  2528. @item sofa
  2529. Set the SOFA file used for rendering.
  2530. @item gain
  2531. Set gain applied to audio. Value is in dB. Default is 0.
  2532. @item rotation
  2533. Set rotation of virtual loudspeakers in deg. Default is 0.
  2534. @item elevation
  2535. Set elevation of virtual speakers in deg. Default is 0.
  2536. @item radius
  2537. Set distance in meters between loudspeakers and the listener with near-field
  2538. HRTFs. Default is 1.
  2539. @item type
  2540. Set processing type. Can be @var{time} or @var{freq}. @var{time} is
  2541. processing audio in time domain which is slow.
  2542. @var{freq} is processing audio in frequency domain which is fast.
  2543. Default is @var{freq}.
  2544. @item speakers
  2545. Set custom positions of virtual loudspeakers. Syntax for this option is:
  2546. <CH> <AZIM> <ELEV>[|<CH> <AZIM> <ELEV>|...].
  2547. Each virtual loudspeaker is described with short channel name following with
  2548. azimuth and elevation in degreees.
  2549. Each virtual loudspeaker description is separated by '|'.
  2550. For example to override front left and front right channel positions use:
  2551. 'speakers=FL 45 15|FR 345 15'.
  2552. Descriptions with unrecognised channel names are ignored.
  2553. @end table
  2554. @subsection Examples
  2555. @itemize
  2556. @item
  2557. Using ClubFritz6 sofa file:
  2558. @example
  2559. sofalizer=sofa=/path/to/ClubFritz6.sofa:type=freq:radius=1
  2560. @end example
  2561. @item
  2562. Using ClubFritz12 sofa file and bigger radius with small rotation:
  2563. @example
  2564. sofalizer=sofa=/path/to/ClubFritz12.sofa:type=freq:radius=2:rotation=5
  2565. @end example
  2566. @item
  2567. Similar as above but with custom speaker positions for front left, front right, rear left and rear right
  2568. and also with custom gain:
  2569. @example
  2570. "sofalizer=sofa=/path/to/ClubFritz6.sofa:type=freq:radius=2:speakers=FL 45|FR 315|RL 135|RR 225:gain=28"
  2571. @end example
  2572. @end itemize
  2573. @section stereotools
  2574. This filter has some handy utilities to manage stereo signals, for converting
  2575. M/S stereo recordings to L/R signal while having control over the parameters
  2576. or spreading the stereo image of master track.
  2577. The filter accepts the following options:
  2578. @table @option
  2579. @item level_in
  2580. Set input level before filtering for both channels. Defaults is 1.
  2581. Allowed range is from 0.015625 to 64.
  2582. @item level_out
  2583. Set output level after filtering for both channels. Defaults is 1.
  2584. Allowed range is from 0.015625 to 64.
  2585. @item balance_in
  2586. Set input balance between both channels. Default is 0.
  2587. Allowed range is from -1 to 1.
  2588. @item balance_out
  2589. Set output balance between both channels. Default is 0.
  2590. Allowed range is from -1 to 1.
  2591. @item softclip
  2592. Enable softclipping. Results in analog distortion instead of harsh digital 0dB
  2593. clipping. Disabled by default.
  2594. @item mutel
  2595. Mute the left channel. Disabled by default.
  2596. @item muter
  2597. Mute the right channel. Disabled by default.
  2598. @item phasel
  2599. Change the phase of the left channel. Disabled by default.
  2600. @item phaser
  2601. Change the phase of the right channel. Disabled by default.
  2602. @item mode
  2603. Set stereo mode. Available values are:
  2604. @table @samp
  2605. @item lr>lr
  2606. Left/Right to Left/Right, this is default.
  2607. @item lr>ms
  2608. Left/Right to Mid/Side.
  2609. @item ms>lr
  2610. Mid/Side to Left/Right.
  2611. @item lr>ll
  2612. Left/Right to Left/Left.
  2613. @item lr>rr
  2614. Left/Right to Right/Right.
  2615. @item lr>l+r
  2616. Left/Right to Left + Right.
  2617. @item lr>rl
  2618. Left/Right to Right/Left.
  2619. @end table
  2620. @item slev
  2621. Set level of side signal. Default is 1.
  2622. Allowed range is from 0.015625 to 64.
  2623. @item sbal
  2624. Set balance of side signal. Default is 0.
  2625. Allowed range is from -1 to 1.
  2626. @item mlev
  2627. Set level of the middle signal. Default is 1.
  2628. Allowed range is from 0.015625 to 64.
  2629. @item mpan
  2630. Set middle signal pan. Default is 0. Allowed range is from -1 to 1.
  2631. @item base
  2632. Set stereo base between mono and inversed channels. Default is 0.
  2633. Allowed range is from -1 to 1.
  2634. @item delay
  2635. Set delay in milliseconds how much to delay left from right channel and
  2636. vice versa. Default is 0. Allowed range is from -20 to 20.
  2637. @item sclevel
  2638. Set S/C level. Default is 1. Allowed range is from 1 to 100.
  2639. @item phase
  2640. Set the stereo phase in degrees. Default is 0. Allowed range is from 0 to 360.
  2641. @end table
  2642. @subsection Examples
  2643. @itemize
  2644. @item
  2645. Apply karaoke like effect:
  2646. @example
  2647. stereotools=mlev=0.015625
  2648. @end example
  2649. @item
  2650. Convert M/S signal to L/R:
  2651. @example
  2652. "stereotools=mode=ms>lr"
  2653. @end example
  2654. @end itemize
  2655. @section stereowiden
  2656. This filter enhance the stereo effect by suppressing signal common to both
  2657. channels and by delaying the signal of left into right and vice versa,
  2658. thereby widening the stereo effect.
  2659. The filter accepts the following options:
  2660. @table @option
  2661. @item delay
  2662. Time in milliseconds of the delay of left signal into right and vice versa.
  2663. Default is 20 milliseconds.
  2664. @item feedback
  2665. Amount of gain in delayed signal into right and vice versa. Gives a delay
  2666. effect of left signal in right output and vice versa which gives widening
  2667. effect. Default is 0.3.
  2668. @item crossfeed
  2669. Cross feed of left into right with inverted phase. This helps in suppressing
  2670. the mono. If the value is 1 it will cancel all the signal common to both
  2671. channels. Default is 0.3.
  2672. @item drymix
  2673. Set level of input signal of original channel. Default is 0.8.
  2674. @end table
  2675. @section treble
  2676. Boost or cut treble (upper) frequencies of the audio using a two-pole
  2677. shelving filter with a response similar to that of a standard
  2678. hi-fi's tone-controls. This is also known as shelving equalisation (EQ).
  2679. The filter accepts the following options:
  2680. @table @option
  2681. @item gain, g
  2682. Give the gain at whichever is the lower of ~22 kHz and the
  2683. Nyquist frequency. Its useful range is about -20 (for a large cut)
  2684. to +20 (for a large boost). Beware of clipping when using a positive gain.
  2685. @item frequency, f
  2686. Set the filter's central frequency and so can be used
  2687. to extend or reduce the frequency range to be boosted or cut.
  2688. The default value is @code{3000} Hz.
  2689. @item width_type
  2690. Set method to specify band-width of filter.
  2691. @table @option
  2692. @item h
  2693. Hz
  2694. @item q
  2695. Q-Factor
  2696. @item o
  2697. octave
  2698. @item s
  2699. slope
  2700. @end table
  2701. @item width, w
  2702. Determine how steep is the filter's shelf transition.
  2703. @end table
  2704. @section tremolo
  2705. Sinusoidal amplitude modulation.
  2706. The filter accepts the following options:
  2707. @table @option
  2708. @item f
  2709. Modulation frequency in Hertz. Modulation frequencies in the subharmonic range
  2710. (20 Hz or lower) will result in a tremolo effect.
  2711. This filter may also be used as a ring modulator by specifying
  2712. a modulation frequency higher than 20 Hz.
  2713. Range is 0.1 - 20000.0. Default value is 5.0 Hz.
  2714. @item d
  2715. Depth of modulation as a percentage. Range is 0.0 - 1.0.
  2716. Default value is 0.5.
  2717. @end table
  2718. @section vibrato
  2719. Sinusoidal phase modulation.
  2720. The filter accepts the following options:
  2721. @table @option
  2722. @item f
  2723. Modulation frequency in Hertz.
  2724. Range is 0.1 - 20000.0. Default value is 5.0 Hz.
  2725. @item d
  2726. Depth of modulation as a percentage. Range is 0.0 - 1.0.
  2727. Default value is 0.5.
  2728. @end table
  2729. @section volume
  2730. Adjust the input audio volume.
  2731. It accepts the following parameters:
  2732. @table @option
  2733. @item volume
  2734. Set audio volume expression.
  2735. Output values are clipped to the maximum value.
  2736. The output audio volume is given by the relation:
  2737. @example
  2738. @var{output_volume} = @var{volume} * @var{input_volume}
  2739. @end example
  2740. The default value for @var{volume} is "1.0".
  2741. @item precision
  2742. This parameter represents the mathematical precision.
  2743. It determines which input sample formats will be allowed, which affects the
  2744. precision of the volume scaling.
  2745. @table @option
  2746. @item fixed
  2747. 8-bit fixed-point; this limits input sample format to U8, S16, and S32.
  2748. @item float
  2749. 32-bit floating-point; this limits input sample format to FLT. (default)
  2750. @item double
  2751. 64-bit floating-point; this limits input sample format to DBL.
  2752. @end table
  2753. @item replaygain
  2754. Choose the behaviour on encountering ReplayGain side data in input frames.
  2755. @table @option
  2756. @item drop
  2757. Remove ReplayGain side data, ignoring its contents (the default).
  2758. @item ignore
  2759. Ignore ReplayGain side data, but leave it in the frame.
  2760. @item track
  2761. Prefer the track gain, if present.
  2762. @item album
  2763. Prefer the album gain, if present.
  2764. @end table
  2765. @item replaygain_preamp
  2766. Pre-amplification gain in dB to apply to the selected replaygain gain.
  2767. Default value for @var{replaygain_preamp} is 0.0.
  2768. @item eval
  2769. Set when the volume expression is evaluated.
  2770. It accepts the following values:
  2771. @table @samp
  2772. @item once
  2773. only evaluate expression once during the filter initialization, or
  2774. when the @samp{volume} command is sent
  2775. @item frame
  2776. evaluate expression for each incoming frame
  2777. @end table
  2778. Default value is @samp{once}.
  2779. @end table
  2780. The volume expression can contain the following parameters.
  2781. @table @option
  2782. @item n
  2783. frame number (starting at zero)
  2784. @item nb_channels
  2785. number of channels
  2786. @item nb_consumed_samples
  2787. number of samples consumed by the filter
  2788. @item nb_samples
  2789. number of samples in the current frame
  2790. @item pos
  2791. original frame position in the file
  2792. @item pts
  2793. frame PTS
  2794. @item sample_rate
  2795. sample rate
  2796. @item startpts
  2797. PTS at start of stream
  2798. @item startt
  2799. time at start of stream
  2800. @item t
  2801. frame time
  2802. @item tb
  2803. timestamp timebase
  2804. @item volume
  2805. last set volume value
  2806. @end table
  2807. Note that when @option{eval} is set to @samp{once} only the
  2808. @var{sample_rate} and @var{tb} variables are available, all other
  2809. variables will evaluate to NAN.
  2810. @subsection Commands
  2811. This filter supports the following commands:
  2812. @table @option
  2813. @item volume
  2814. Modify the volume expression.
  2815. The command accepts the same syntax of the corresponding option.
  2816. If the specified expression is not valid, it is kept at its current
  2817. value.
  2818. @item replaygain_noclip
  2819. Prevent clipping by limiting the gain applied.
  2820. Default value for @var{replaygain_noclip} is 1.
  2821. @end table
  2822. @subsection Examples
  2823. @itemize
  2824. @item
  2825. Halve the input audio volume:
  2826. @example
  2827. volume=volume=0.5
  2828. volume=volume=1/2
  2829. volume=volume=-6.0206dB
  2830. @end example
  2831. In all the above example the named key for @option{volume} can be
  2832. omitted, for example like in:
  2833. @example
  2834. volume=0.5
  2835. @end example
  2836. @item
  2837. Increase input audio power by 6 decibels using fixed-point precision:
  2838. @example
  2839. volume=volume=6dB:precision=fixed
  2840. @end example
  2841. @item
  2842. Fade volume after time 10 with an annihilation period of 5 seconds:
  2843. @example
  2844. volume='if(lt(t,10),1,max(1-(t-10)/5,0))':eval=frame
  2845. @end example
  2846. @end itemize
  2847. @section volumedetect
  2848. Detect the volume of the input video.
  2849. The filter has no parameters. The input is not modified. Statistics about
  2850. the volume will be printed in the log when the input stream end is reached.
  2851. In particular it will show the mean volume (root mean square), maximum
  2852. volume (on a per-sample basis), and the beginning of a histogram of the
  2853. registered volume values (from the maximum value to a cumulated 1/1000 of
  2854. the samples).
  2855. All volumes are in decibels relative to the maximum PCM value.
  2856. @subsection Examples
  2857. Here is an excerpt of the output:
  2858. @example
  2859. [Parsed_volumedetect_0 @ 0xa23120] mean_volume: -27 dB
  2860. [Parsed_volumedetect_0 @ 0xa23120] max_volume: -4 dB
  2861. [Parsed_volumedetect_0 @ 0xa23120] histogram_4db: 6
  2862. [Parsed_volumedetect_0 @ 0xa23120] histogram_5db: 62
  2863. [Parsed_volumedetect_0 @ 0xa23120] histogram_6db: 286
  2864. [Parsed_volumedetect_0 @ 0xa23120] histogram_7db: 1042
  2865. [Parsed_volumedetect_0 @ 0xa23120] histogram_8db: 2551
  2866. [Parsed_volumedetect_0 @ 0xa23120] histogram_9db: 4609
  2867. [Parsed_volumedetect_0 @ 0xa23120] histogram_10db: 8409
  2868. @end example
  2869. It means that:
  2870. @itemize
  2871. @item
  2872. The mean square energy is approximately -27 dB, or 10^-2.7.
  2873. @item
  2874. The largest sample is at -4 dB, or more precisely between -4 dB and -5 dB.
  2875. @item
  2876. There are 6 samples at -4 dB, 62 at -5 dB, 286 at -6 dB, etc.
  2877. @end itemize
  2878. In other words, raising the volume by +4 dB does not cause any clipping,
  2879. raising it by +5 dB causes clipping for 6 samples, etc.
  2880. @c man end AUDIO FILTERS
  2881. @chapter Audio Sources
  2882. @c man begin AUDIO SOURCES
  2883. Below is a description of the currently available audio sources.
  2884. @section abuffer
  2885. Buffer audio frames, and make them available to the filter chain.
  2886. This source is mainly intended for a programmatic use, in particular
  2887. through the interface defined in @file{libavfilter/asrc_abuffer.h}.
  2888. It accepts the following parameters:
  2889. @table @option
  2890. @item time_base
  2891. The timebase which will be used for timestamps of submitted frames. It must be
  2892. either a floating-point number or in @var{numerator}/@var{denominator} form.
  2893. @item sample_rate
  2894. The sample rate of the incoming audio buffers.
  2895. @item sample_fmt
  2896. The sample format of the incoming audio buffers.
  2897. Either a sample format name or its corresponding integer representation from
  2898. the enum AVSampleFormat in @file{libavutil/samplefmt.h}
  2899. @item channel_layout
  2900. The channel layout of the incoming audio buffers.
  2901. Either a channel layout name from channel_layout_map in
  2902. @file{libavutil/channel_layout.c} or its corresponding integer representation
  2903. from the AV_CH_LAYOUT_* macros in @file{libavutil/channel_layout.h}
  2904. @item channels
  2905. The number of channels of the incoming audio buffers.
  2906. If both @var{channels} and @var{channel_layout} are specified, then they
  2907. must be consistent.
  2908. @end table
  2909. @subsection Examples
  2910. @example
  2911. abuffer=sample_rate=44100:sample_fmt=s16p:channel_layout=stereo
  2912. @end example
  2913. will instruct the source to accept planar 16bit signed stereo at 44100Hz.
  2914. Since the sample format with name "s16p" corresponds to the number
  2915. 6 and the "stereo" channel layout corresponds to the value 0x3, this is
  2916. equivalent to:
  2917. @example
  2918. abuffer=sample_rate=44100:sample_fmt=6:channel_layout=0x3
  2919. @end example
  2920. @section aevalsrc
  2921. Generate an audio signal specified by an expression.
  2922. This source accepts in input one or more expressions (one for each
  2923. channel), which are evaluated and used to generate a corresponding
  2924. audio signal.
  2925. This source accepts the following options:
  2926. @table @option
  2927. @item exprs
  2928. Set the '|'-separated expressions list for each separate channel. In case the
  2929. @option{channel_layout} option is not specified, the selected channel layout
  2930. depends on the number of provided expressions. Otherwise the last
  2931. specified expression is applied to the remaining output channels.
  2932. @item channel_layout, c
  2933. Set the channel layout. The number of channels in the specified layout
  2934. must be equal to the number of specified expressions.
  2935. @item duration, d
  2936. Set the minimum duration of the sourced audio. See
  2937. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  2938. for the accepted syntax.
  2939. Note that the resulting duration may be greater than the specified
  2940. duration, as the generated audio is always cut at the end of a
  2941. complete frame.
  2942. If not specified, or the expressed duration is negative, the audio is
  2943. supposed to be generated forever.
  2944. @item nb_samples, n
  2945. Set the number of samples per channel per each output frame,
  2946. default to 1024.
  2947. @item sample_rate, s
  2948. Specify the sample rate, default to 44100.
  2949. @end table
  2950. Each expression in @var{exprs} can contain the following constants:
  2951. @table @option
  2952. @item n
  2953. number of the evaluated sample, starting from 0
  2954. @item t
  2955. time of the evaluated sample expressed in seconds, starting from 0
  2956. @item s
  2957. sample rate
  2958. @end table
  2959. @subsection Examples
  2960. @itemize
  2961. @item
  2962. Generate silence:
  2963. @example
  2964. aevalsrc=0
  2965. @end example
  2966. @item
  2967. Generate a sin signal with frequency of 440 Hz, set sample rate to
  2968. 8000 Hz:
  2969. @example
  2970. aevalsrc="sin(440*2*PI*t):s=8000"
  2971. @end example
  2972. @item
  2973. Generate a two channels signal, specify the channel layout (Front
  2974. Center + Back Center) explicitly:
  2975. @example
  2976. aevalsrc="sin(420*2*PI*t)|cos(430*2*PI*t):c=FC|BC"
  2977. @end example
  2978. @item
  2979. Generate white noise:
  2980. @example
  2981. aevalsrc="-2+random(0)"
  2982. @end example
  2983. @item
  2984. Generate an amplitude modulated signal:
  2985. @example
  2986. aevalsrc="sin(10*2*PI*t)*sin(880*2*PI*t)"
  2987. @end example
  2988. @item
  2989. Generate 2.5 Hz binaural beats on a 360 Hz carrier:
  2990. @example
  2991. aevalsrc="0.1*sin(2*PI*(360-2.5/2)*t) | 0.1*sin(2*PI*(360+2.5/2)*t)"
  2992. @end example
  2993. @end itemize
  2994. @section anullsrc
  2995. The null audio source, return unprocessed audio frames. It is mainly useful
  2996. as a template and to be employed in analysis / debugging tools, or as
  2997. the source for filters which ignore the input data (for example the sox
  2998. synth filter).
  2999. This source accepts the following options:
  3000. @table @option
  3001. @item channel_layout, cl
  3002. Specifies the channel layout, and can be either an integer or a string
  3003. representing a channel layout. The default value of @var{channel_layout}
  3004. is "stereo".
  3005. Check the channel_layout_map definition in
  3006. @file{libavutil/channel_layout.c} for the mapping between strings and
  3007. channel layout values.
  3008. @item sample_rate, r
  3009. Specifies the sample rate, and defaults to 44100.
  3010. @item nb_samples, n
  3011. Set the number of samples per requested frames.
  3012. @end table
  3013. @subsection Examples
  3014. @itemize
  3015. @item
  3016. Set the sample rate to 48000 Hz and the channel layout to AV_CH_LAYOUT_MONO.
  3017. @example
  3018. anullsrc=r=48000:cl=4
  3019. @end example
  3020. @item
  3021. Do the same operation with a more obvious syntax:
  3022. @example
  3023. anullsrc=r=48000:cl=mono
  3024. @end example
  3025. @end itemize
  3026. All the parameters need to be explicitly defined.
  3027. @section flite
  3028. Synthesize a voice utterance using the libflite library.
  3029. To enable compilation of this filter you need to configure FFmpeg with
  3030. @code{--enable-libflite}.
  3031. Note that the flite library is not thread-safe.
  3032. The filter accepts the following options:
  3033. @table @option
  3034. @item list_voices
  3035. If set to 1, list the names of the available voices and exit
  3036. immediately. Default value is 0.
  3037. @item nb_samples, n
  3038. Set the maximum number of samples per frame. Default value is 512.
  3039. @item textfile
  3040. Set the filename containing the text to speak.
  3041. @item text
  3042. Set the text to speak.
  3043. @item voice, v
  3044. Set the voice to use for the speech synthesis. Default value is
  3045. @code{kal}. See also the @var{list_voices} option.
  3046. @end table
  3047. @subsection Examples
  3048. @itemize
  3049. @item
  3050. Read from file @file{speech.txt}, and synthesize the text using the
  3051. standard flite voice:
  3052. @example
  3053. flite=textfile=speech.txt
  3054. @end example
  3055. @item
  3056. Read the specified text selecting the @code{slt} voice:
  3057. @example
  3058. flite=text='So fare thee well, poor devil of a Sub-Sub, whose commentator I am':voice=slt
  3059. @end example
  3060. @item
  3061. Input text to ffmpeg:
  3062. @example
  3063. ffmpeg -f lavfi -i flite=text='So fare thee well, poor devil of a Sub-Sub, whose commentator I am':voice=slt
  3064. @end example
  3065. @item
  3066. Make @file{ffplay} speak the specified text, using @code{flite} and
  3067. the @code{lavfi} device:
  3068. @example
  3069. ffplay -f lavfi flite=text='No more be grieved for which that thou hast done.'
  3070. @end example
  3071. @end itemize
  3072. For more information about libflite, check:
  3073. @url{http://www.speech.cs.cmu.edu/flite/}
  3074. @section anoisesrc
  3075. Generate a noise audio signal.
  3076. The filter accepts the following options:
  3077. @table @option
  3078. @item sample_rate, r
  3079. Specify the sample rate. Default value is 48000 Hz.
  3080. @item amplitude, a
  3081. Specify the amplitude (0.0 - 1.0) of the generated audio stream. Default value
  3082. is 1.0.
  3083. @item duration, d
  3084. Specify the duration of the generated audio stream. Not specifying this option
  3085. results in noise with an infinite length.
  3086. @item color, colour, c
  3087. Specify the color of noise. Available noise colors are white, pink, and brown.
  3088. Default color is white.
  3089. @item seed, s
  3090. Specify a value used to seed the PRNG.
  3091. @item nb_samples, n
  3092. Set the number of samples per each output frame, default is 1024.
  3093. @end table
  3094. @subsection Examples
  3095. @itemize
  3096. @item
  3097. Generate 60 seconds of pink noise, with a 44.1 kHz sampling rate and an amplitude of 0.5:
  3098. @example
  3099. anoisesrc=d=60:c=pink:r=44100:a=0.5
  3100. @end example
  3101. @end itemize
  3102. @section sine
  3103. Generate an audio signal made of a sine wave with amplitude 1/8.
  3104. The audio signal is bit-exact.
  3105. The filter accepts the following options:
  3106. @table @option
  3107. @item frequency, f
  3108. Set the carrier frequency. Default is 440 Hz.
  3109. @item beep_factor, b
  3110. Enable a periodic beep every second with frequency @var{beep_factor} times
  3111. the carrier frequency. Default is 0, meaning the beep is disabled.
  3112. @item sample_rate, r
  3113. Specify the sample rate, default is 44100.
  3114. @item duration, d
  3115. Specify the duration of the generated audio stream.
  3116. @item samples_per_frame
  3117. Set the number of samples per output frame.
  3118. The expression can contain the following constants:
  3119. @table @option
  3120. @item n
  3121. The (sequential) number of the output audio frame, starting from 0.
  3122. @item pts
  3123. The PTS (Presentation TimeStamp) of the output audio frame,
  3124. expressed in @var{TB} units.
  3125. @item t
  3126. The PTS of the output audio frame, expressed in seconds.
  3127. @item TB
  3128. The timebase of the output audio frames.
  3129. @end table
  3130. Default is @code{1024}.
  3131. @end table
  3132. @subsection Examples
  3133. @itemize
  3134. @item
  3135. Generate a simple 440 Hz sine wave:
  3136. @example
  3137. sine
  3138. @end example
  3139. @item
  3140. Generate a 220 Hz sine wave with a 880 Hz beep each second, for 5 seconds:
  3141. @example
  3142. sine=220:4:d=5
  3143. sine=f=220:b=4:d=5
  3144. sine=frequency=220:beep_factor=4:duration=5
  3145. @end example
  3146. @item
  3147. Generate a 1 kHz sine wave following @code{1602,1601,1602,1601,1602} NTSC
  3148. pattern:
  3149. @example
  3150. sine=1000:samples_per_frame='st(0,mod(n,5)); 1602-not(not(eq(ld(0),1)+eq(ld(0),3)))'
  3151. @end example
  3152. @end itemize
  3153. @c man end AUDIO SOURCES
  3154. @chapter Audio Sinks
  3155. @c man begin AUDIO SINKS
  3156. Below is a description of the currently available audio sinks.
  3157. @section abuffersink
  3158. Buffer audio frames, and make them available to the end of filter chain.
  3159. This sink is mainly intended for programmatic use, in particular
  3160. through the interface defined in @file{libavfilter/buffersink.h}
  3161. or the options system.
  3162. It accepts a pointer to an AVABufferSinkContext structure, which
  3163. defines the incoming buffers' formats, to be passed as the opaque
  3164. parameter to @code{avfilter_init_filter} for initialization.
  3165. @section anullsink
  3166. Null audio sink; do absolutely nothing with the input audio. It is
  3167. mainly useful as a template and for use in analysis / debugging
  3168. tools.
  3169. @c man end AUDIO SINKS
  3170. @chapter Video Filters
  3171. @c man begin VIDEO FILTERS
  3172. When you configure your FFmpeg build, you can disable any of the
  3173. existing filters using @code{--disable-filters}.
  3174. The configure output will show the video filters included in your
  3175. build.
  3176. Below is a description of the currently available video filters.
  3177. @section alphaextract
  3178. Extract the alpha component from the input as a grayscale video. This
  3179. is especially useful with the @var{alphamerge} filter.
  3180. @section alphamerge
  3181. Add or replace the alpha component of the primary input with the
  3182. grayscale value of a second input. This is intended for use with
  3183. @var{alphaextract} to allow the transmission or storage of frame
  3184. sequences that have alpha in a format that doesn't support an alpha
  3185. channel.
  3186. For example, to reconstruct full frames from a normal YUV-encoded video
  3187. and a separate video created with @var{alphaextract}, you might use:
  3188. @example
  3189. movie=in_alpha.mkv [alpha]; [in][alpha] alphamerge [out]
  3190. @end example
  3191. Since this filter is designed for reconstruction, it operates on frame
  3192. sequences without considering timestamps, and terminates when either
  3193. input reaches end of stream. This will cause problems if your encoding
  3194. pipeline drops frames. If you're trying to apply an image as an
  3195. overlay to a video stream, consider the @var{overlay} filter instead.
  3196. @section ass
  3197. Same as the @ref{subtitles} filter, except that it doesn't require libavcodec
  3198. and libavformat to work. On the other hand, it is limited to ASS (Advanced
  3199. Substation Alpha) subtitles files.
  3200. This filter accepts the following option in addition to the common options from
  3201. the @ref{subtitles} filter:
  3202. @table @option
  3203. @item shaping
  3204. Set the shaping engine
  3205. Available values are:
  3206. @table @samp
  3207. @item auto
  3208. The default libass shaping engine, which is the best available.
  3209. @item simple
  3210. Fast, font-agnostic shaper that can do only substitutions
  3211. @item complex
  3212. Slower shaper using OpenType for substitutions and positioning
  3213. @end table
  3214. The default is @code{auto}.
  3215. @end table
  3216. @section atadenoise
  3217. Apply an Adaptive Temporal Averaging Denoiser to the video input.
  3218. The filter accepts the following options:
  3219. @table @option
  3220. @item 0a
  3221. Set threshold A for 1st plane. Default is 0.02.
  3222. Valid range is 0 to 0.3.
  3223. @item 0b
  3224. Set threshold B for 1st plane. Default is 0.04.
  3225. Valid range is 0 to 5.
  3226. @item 1a
  3227. Set threshold A for 2nd plane. Default is 0.02.
  3228. Valid range is 0 to 0.3.
  3229. @item 1b
  3230. Set threshold B for 2nd plane. Default is 0.04.
  3231. Valid range is 0 to 5.
  3232. @item 2a
  3233. Set threshold A for 3rd plane. Default is 0.02.
  3234. Valid range is 0 to 0.3.
  3235. @item 2b
  3236. Set threshold B for 3rd plane. Default is 0.04.
  3237. Valid range is 0 to 5.
  3238. Threshold A is designed to react on abrupt changes in the input signal and
  3239. threshold B is designed to react on continuous changes in the input signal.
  3240. @item s
  3241. Set number of frames filter will use for averaging. Default is 33. Must be odd
  3242. number in range [5, 129].
  3243. @end table
  3244. @section bbox
  3245. Compute the bounding box for the non-black pixels in the input frame
  3246. luminance plane.
  3247. This filter computes the bounding box containing all the pixels with a
  3248. luminance value greater than the minimum allowed value.
  3249. The parameters describing the bounding box are printed on the filter
  3250. log.
  3251. The filter accepts the following option:
  3252. @table @option
  3253. @item min_val
  3254. Set the minimal luminance value. Default is @code{16}.
  3255. @end table
  3256. @section blackdetect
  3257. Detect video intervals that are (almost) completely black. Can be
  3258. useful to detect chapter transitions, commercials, or invalid
  3259. recordings. Output lines contains the time for the start, end and
  3260. duration of the detected black interval expressed in seconds.
  3261. In order to display the output lines, you need to set the loglevel at
  3262. least to the AV_LOG_INFO value.
  3263. The filter accepts the following options:
  3264. @table @option
  3265. @item black_min_duration, d
  3266. Set the minimum detected black duration expressed in seconds. It must
  3267. be a non-negative floating point number.
  3268. Default value is 2.0.
  3269. @item picture_black_ratio_th, pic_th
  3270. Set the threshold for considering a picture "black".
  3271. Express the minimum value for the ratio:
  3272. @example
  3273. @var{nb_black_pixels} / @var{nb_pixels}
  3274. @end example
  3275. for which a picture is considered black.
  3276. Default value is 0.98.
  3277. @item pixel_black_th, pix_th
  3278. Set the threshold for considering a pixel "black".
  3279. The threshold expresses the maximum pixel luminance value for which a
  3280. pixel is considered "black". The provided value is scaled according to
  3281. the following equation:
  3282. @example
  3283. @var{absolute_threshold} = @var{luminance_minimum_value} + @var{pixel_black_th} * @var{luminance_range_size}
  3284. @end example
  3285. @var{luminance_range_size} and @var{luminance_minimum_value} depend on
  3286. the input video format, the range is [0-255] for YUV full-range
  3287. formats and [16-235] for YUV non full-range formats.
  3288. Default value is 0.10.
  3289. @end table
  3290. The following example sets the maximum pixel threshold to the minimum
  3291. value, and detects only black intervals of 2 or more seconds:
  3292. @example
  3293. blackdetect=d=2:pix_th=0.00
  3294. @end example
  3295. @section blackframe
  3296. Detect frames that are (almost) completely black. Can be useful to
  3297. detect chapter transitions or commercials. Output lines consist of
  3298. the frame number of the detected frame, the percentage of blackness,
  3299. the position in the file if known or -1 and the timestamp in seconds.
  3300. In order to display the output lines, you need to set the loglevel at
  3301. least to the AV_LOG_INFO value.
  3302. It accepts the following parameters:
  3303. @table @option
  3304. @item amount
  3305. The percentage of the pixels that have to be below the threshold; it defaults to
  3306. @code{98}.
  3307. @item threshold, thresh
  3308. The threshold below which a pixel value is considered black; it defaults to
  3309. @code{32}.
  3310. @end table
  3311. @section blend, tblend
  3312. Blend two video frames into each other.
  3313. The @code{blend} filter takes two input streams and outputs one
  3314. stream, the first input is the "top" layer and second input is
  3315. "bottom" layer. Output terminates when shortest input terminates.
  3316. The @code{tblend} (time blend) filter takes two consecutive frames
  3317. from one single stream, and outputs the result obtained by blending
  3318. the new frame on top of the old frame.
  3319. A description of the accepted options follows.
  3320. @table @option
  3321. @item c0_mode
  3322. @item c1_mode
  3323. @item c2_mode
  3324. @item c3_mode
  3325. @item all_mode
  3326. Set blend mode for specific pixel component or all pixel components in case
  3327. of @var{all_mode}. Default value is @code{normal}.
  3328. Available values for component modes are:
  3329. @table @samp
  3330. @item addition
  3331. @item addition128
  3332. @item and
  3333. @item average
  3334. @item burn
  3335. @item darken
  3336. @item difference
  3337. @item difference128
  3338. @item divide
  3339. @item dodge
  3340. @item freeze
  3341. @item exclusion
  3342. @item glow
  3343. @item hardlight
  3344. @item hardmix
  3345. @item heat
  3346. @item lighten
  3347. @item linearlight
  3348. @item multiply
  3349. @item multiply128
  3350. @item negation
  3351. @item normal
  3352. @item or
  3353. @item overlay
  3354. @item phoenix
  3355. @item pinlight
  3356. @item reflect
  3357. @item screen
  3358. @item softlight
  3359. @item subtract
  3360. @item vividlight
  3361. @item xor
  3362. @end table
  3363. @item c0_opacity
  3364. @item c1_opacity
  3365. @item c2_opacity
  3366. @item c3_opacity
  3367. @item all_opacity
  3368. Set blend opacity for specific pixel component or all pixel components in case
  3369. of @var{all_opacity}. Only used in combination with pixel component blend modes.
  3370. @item c0_expr
  3371. @item c1_expr
  3372. @item c2_expr
  3373. @item c3_expr
  3374. @item all_expr
  3375. Set blend expression for specific pixel component or all pixel components in case
  3376. of @var{all_expr}. Note that related mode options will be ignored if those are set.
  3377. The expressions can use the following variables:
  3378. @table @option
  3379. @item N
  3380. The sequential number of the filtered frame, starting from @code{0}.
  3381. @item X
  3382. @item Y
  3383. the coordinates of the current sample
  3384. @item W
  3385. @item H
  3386. the width and height of currently filtered plane
  3387. @item SW
  3388. @item SH
  3389. Width and height scale depending on the currently filtered plane. It is the
  3390. ratio between the corresponding luma plane number of pixels and the current
  3391. plane ones. E.g. for YUV4:2:0 the values are @code{1,1} for the luma plane, and
  3392. @code{0.5,0.5} for chroma planes.
  3393. @item T
  3394. Time of the current frame, expressed in seconds.
  3395. @item TOP, A
  3396. Value of pixel component at current location for first video frame (top layer).
  3397. @item BOTTOM, B
  3398. Value of pixel component at current location for second video frame (bottom layer).
  3399. @end table
  3400. @item shortest
  3401. Force termination when the shortest input terminates. Default is
  3402. @code{0}. This option is only defined for the @code{blend} filter.
  3403. @item repeatlast
  3404. Continue applying the last bottom frame after the end of the stream. A value of
  3405. @code{0} disable the filter after the last frame of the bottom layer is reached.
  3406. Default is @code{1}. This option is only defined for the @code{blend} filter.
  3407. @end table
  3408. @subsection Examples
  3409. @itemize
  3410. @item
  3411. Apply transition from bottom layer to top layer in first 10 seconds:
  3412. @example
  3413. blend=all_expr='A*(if(gte(T,10),1,T/10))+B*(1-(if(gte(T,10),1,T/10)))'
  3414. @end example
  3415. @item
  3416. Apply 1x1 checkerboard effect:
  3417. @example
  3418. blend=all_expr='if(eq(mod(X,2),mod(Y,2)),A,B)'
  3419. @end example
  3420. @item
  3421. Apply uncover left effect:
  3422. @example
  3423. blend=all_expr='if(gte(N*SW+X,W),A,B)'
  3424. @end example
  3425. @item
  3426. Apply uncover down effect:
  3427. @example
  3428. blend=all_expr='if(gte(Y-N*SH,0),A,B)'
  3429. @end example
  3430. @item
  3431. Apply uncover up-left effect:
  3432. @example
  3433. blend=all_expr='if(gte(T*SH*40+Y,H)*gte((T*40*SW+X)*W/H,W),A,B)'
  3434. @end example
  3435. @item
  3436. Split diagonally video and shows top and bottom layer on each side:
  3437. @example
  3438. blend=all_expr=if(gt(X,Y*(W/H)),A,B)
  3439. @end example
  3440. @item
  3441. Display differences between the current and the previous frame:
  3442. @example
  3443. tblend=all_mode=difference128
  3444. @end example
  3445. @end itemize
  3446. @section bwdif
  3447. Deinterlace the input video ("bwdif" stands for "Bob Weaver
  3448. Deinterlacing Filter").
  3449. Motion adaptive deinterlacing based on yadif with the use of w3fdif and cubic
  3450. interpolation algorithms.
  3451. It accepts the following parameters:
  3452. @table @option
  3453. @item mode
  3454. The interlacing mode to adopt. It accepts one of the following values:
  3455. @table @option
  3456. @item 0, send_frame
  3457. Output one frame for each frame.
  3458. @item 1, send_field
  3459. Output one frame for each field.
  3460. @end table
  3461. The default value is @code{send_field}.
  3462. @item parity
  3463. The picture field parity assumed for the input interlaced video. It accepts one
  3464. of the following values:
  3465. @table @option
  3466. @item 0, tff
  3467. Assume the top field is first.
  3468. @item 1, bff
  3469. Assume the bottom field is first.
  3470. @item -1, auto
  3471. Enable automatic detection of field parity.
  3472. @end table
  3473. The default value is @code{auto}.
  3474. If the interlacing is unknown or the decoder does not export this information,
  3475. top field first will be assumed.
  3476. @item deint
  3477. Specify which frames to deinterlace. Accept one of the following
  3478. values:
  3479. @table @option
  3480. @item 0, all
  3481. Deinterlace all frames.
  3482. @item 1, interlaced
  3483. Only deinterlace frames marked as interlaced.
  3484. @end table
  3485. The default value is @code{all}.
  3486. @end table
  3487. @section boxblur
  3488. Apply a boxblur algorithm to the input video.
  3489. It accepts the following parameters:
  3490. @table @option
  3491. @item luma_radius, lr
  3492. @item luma_power, lp
  3493. @item chroma_radius, cr
  3494. @item chroma_power, cp
  3495. @item alpha_radius, ar
  3496. @item alpha_power, ap
  3497. @end table
  3498. A description of the accepted options follows.
  3499. @table @option
  3500. @item luma_radius, lr
  3501. @item chroma_radius, cr
  3502. @item alpha_radius, ar
  3503. Set an expression for the box radius in pixels used for blurring the
  3504. corresponding input plane.
  3505. The radius value must be a non-negative number, and must not be
  3506. greater than the value of the expression @code{min(w,h)/2} for the
  3507. luma and alpha planes, and of @code{min(cw,ch)/2} for the chroma
  3508. planes.
  3509. Default value for @option{luma_radius} is "2". If not specified,
  3510. @option{chroma_radius} and @option{alpha_radius} default to the
  3511. corresponding value set for @option{luma_radius}.
  3512. The expressions can contain the following constants:
  3513. @table @option
  3514. @item w
  3515. @item h
  3516. The input width and height in pixels.
  3517. @item cw
  3518. @item ch
  3519. The input chroma image width and height in pixels.
  3520. @item hsub
  3521. @item vsub
  3522. The horizontal and vertical chroma subsample values. For example, for the
  3523. pixel format "yuv422p", @var{hsub} is 2 and @var{vsub} is 1.
  3524. @end table
  3525. @item luma_power, lp
  3526. @item chroma_power, cp
  3527. @item alpha_power, ap
  3528. Specify how many times the boxblur filter is applied to the
  3529. corresponding plane.
  3530. Default value for @option{luma_power} is 2. If not specified,
  3531. @option{chroma_power} and @option{alpha_power} default to the
  3532. corresponding value set for @option{luma_power}.
  3533. A value of 0 will disable the effect.
  3534. @end table
  3535. @subsection Examples
  3536. @itemize
  3537. @item
  3538. Apply a boxblur filter with the luma, chroma, and alpha radii
  3539. set to 2:
  3540. @example
  3541. boxblur=luma_radius=2:luma_power=1
  3542. boxblur=2:1
  3543. @end example
  3544. @item
  3545. Set the luma radius to 2, and alpha and chroma radius to 0:
  3546. @example
  3547. boxblur=2:1:cr=0:ar=0
  3548. @end example
  3549. @item
  3550. Set the luma and chroma radii to a fraction of the video dimension:
  3551. @example
  3552. boxblur=luma_radius=min(h\,w)/10:luma_power=1:chroma_radius=min(cw\,ch)/10:chroma_power=1
  3553. @end example
  3554. @end itemize
  3555. @section chromakey
  3556. YUV colorspace color/chroma keying.
  3557. The filter accepts the following options:
  3558. @table @option
  3559. @item color
  3560. The color which will be replaced with transparency.
  3561. @item similarity
  3562. Similarity percentage with the key color.
  3563. 0.01 matches only the exact key color, while 1.0 matches everything.
  3564. @item blend
  3565. Blend percentage.
  3566. 0.0 makes pixels either fully transparent, or not transparent at all.
  3567. Higher values result in semi-transparent pixels, with a higher transparency
  3568. the more similar the pixels color is to the key color.
  3569. @item yuv
  3570. Signals that the color passed is already in YUV instead of RGB.
  3571. Litteral colors like "green" or "red" don't make sense with this enabled anymore.
  3572. This can be used to pass exact YUV values as hexadecimal numbers.
  3573. @end table
  3574. @subsection Examples
  3575. @itemize
  3576. @item
  3577. Make every green pixel in the input image transparent:
  3578. @example
  3579. ffmpeg -i input.png -vf chromakey=green out.png
  3580. @end example
  3581. @item
  3582. Overlay a greenscreen-video on top of a static black background.
  3583. @example
  3584. ffmpeg -f lavfi -i color=c=black:s=1280x720 -i video.mp4 -shortest -filter_complex "[1:v]chromakey=0x70de77:0.1:0.2[ckout];[0:v][ckout]overlay[out]" -map "[out]" output.mkv
  3585. @end example
  3586. @end itemize
  3587. @section ciescope
  3588. Display CIE color diagram with pixels overlaid onto it.
  3589. The filter acccepts the following options:
  3590. @table @option
  3591. @item system
  3592. Set color system.
  3593. @table @samp
  3594. @item ntsc, 470m
  3595. @item ebu, 470bg
  3596. @item smpte
  3597. @item 240m
  3598. @item apple
  3599. @item widergb
  3600. @item cie1931
  3601. @item rec709, hdtv
  3602. @item uhdtv, rec2020
  3603. @end table
  3604. @item cie
  3605. Set CIE system.
  3606. @table @samp
  3607. @item xyy
  3608. @item ucs
  3609. @item luv
  3610. @end table
  3611. @item gamuts
  3612. Set what gamuts to draw.
  3613. See @code{system} option for avaiable values.
  3614. @item size, s
  3615. Set ciescope size, by default set to 512.
  3616. @item intensity, i
  3617. Set intensity used to map input pixel values to CIE diagram.
  3618. @item contrast
  3619. Set contrast used to draw tongue colors that are out of active color system gamut.
  3620. @item corrgamma
  3621. Correct gamma displayed on scope, by default enabled.
  3622. @item showwhite
  3623. Show white point on CIE diagram, by default disabled.
  3624. @item gamma
  3625. Set input gamma. Used only with XYZ input color space.
  3626. @end table
  3627. @section codecview
  3628. Visualize information exported by some codecs.
  3629. Some codecs can export information through frames using side-data or other
  3630. means. For example, some MPEG based codecs export motion vectors through the
  3631. @var{export_mvs} flag in the codec @option{flags2} option.
  3632. The filter accepts the following option:
  3633. @table @option
  3634. @item mv
  3635. Set motion vectors to visualize.
  3636. Available flags for @var{mv} are:
  3637. @table @samp
  3638. @item pf
  3639. forward predicted MVs of P-frames
  3640. @item bf
  3641. forward predicted MVs of B-frames
  3642. @item bb
  3643. backward predicted MVs of B-frames
  3644. @end table
  3645. @item qp
  3646. Display quantization parameters using the chroma planes
  3647. @end table
  3648. @subsection Examples
  3649. @itemize
  3650. @item
  3651. Visualizes multi-directionals MVs from P and B-Frames using @command{ffplay}:
  3652. @example
  3653. ffplay -flags2 +export_mvs input.mpg -vf codecview=mv=pf+bf+bb
  3654. @end example
  3655. @end itemize
  3656. @section colorbalance
  3657. Modify intensity of primary colors (red, green and blue) of input frames.
  3658. The filter allows an input frame to be adjusted in the shadows, midtones or highlights
  3659. regions for the red-cyan, green-magenta or blue-yellow balance.
  3660. A positive adjustment value shifts the balance towards the primary color, a negative
  3661. value towards the complementary color.
  3662. The filter accepts the following options:
  3663. @table @option
  3664. @item rs
  3665. @item gs
  3666. @item bs
  3667. Adjust red, green and blue shadows (darkest pixels).
  3668. @item rm
  3669. @item gm
  3670. @item bm
  3671. Adjust red, green and blue midtones (medium pixels).
  3672. @item rh
  3673. @item gh
  3674. @item bh
  3675. Adjust red, green and blue highlights (brightest pixels).
  3676. Allowed ranges for options are @code{[-1.0, 1.0]}. Defaults are @code{0}.
  3677. @end table
  3678. @subsection Examples
  3679. @itemize
  3680. @item
  3681. Add red color cast to shadows:
  3682. @example
  3683. colorbalance=rs=.3
  3684. @end example
  3685. @end itemize
  3686. @section colorkey
  3687. RGB colorspace color keying.
  3688. The filter accepts the following options:
  3689. @table @option
  3690. @item color
  3691. The color which will be replaced with transparency.
  3692. @item similarity
  3693. Similarity percentage with the key color.
  3694. 0.01 matches only the exact key color, while 1.0 matches everything.
  3695. @item blend
  3696. Blend percentage.
  3697. 0.0 makes pixels either fully transparent, or not transparent at all.
  3698. Higher values result in semi-transparent pixels, with a higher transparency
  3699. the more similar the pixels color is to the key color.
  3700. @end table
  3701. @subsection Examples
  3702. @itemize
  3703. @item
  3704. Make every green pixel in the input image transparent:
  3705. @example
  3706. ffmpeg -i input.png -vf colorkey=green out.png
  3707. @end example
  3708. @item
  3709. Overlay a greenscreen-video on top of a static background image.
  3710. @example
  3711. ffmpeg -i background.png -i video.mp4 -filter_complex "[1:v]colorkey=0x3BBD1E:0.3:0.2[ckout];[0:v][ckout]overlay[out]" -map "[out]" output.flv
  3712. @end example
  3713. @end itemize
  3714. @section colorlevels
  3715. Adjust video input frames using levels.
  3716. The filter accepts the following options:
  3717. @table @option
  3718. @item rimin
  3719. @item gimin
  3720. @item bimin
  3721. @item aimin
  3722. Adjust red, green, blue and alpha input black point.
  3723. Allowed ranges for options are @code{[-1.0, 1.0]}. Defaults are @code{0}.
  3724. @item rimax
  3725. @item gimax
  3726. @item bimax
  3727. @item aimax
  3728. Adjust red, green, blue and alpha input white point.
  3729. Allowed ranges for options are @code{[-1.0, 1.0]}. Defaults are @code{1}.
  3730. Input levels are used to lighten highlights (bright tones), darken shadows
  3731. (dark tones), change the balance of bright and dark tones.
  3732. @item romin
  3733. @item gomin
  3734. @item bomin
  3735. @item aomin
  3736. Adjust red, green, blue and alpha output black point.
  3737. Allowed ranges for options are @code{[0, 1.0]}. Defaults are @code{0}.
  3738. @item romax
  3739. @item gomax
  3740. @item bomax
  3741. @item aomax
  3742. Adjust red, green, blue and alpha output white point.
  3743. Allowed ranges for options are @code{[0, 1.0]}. Defaults are @code{1}.
  3744. Output levels allows manual selection of a constrained output level range.
  3745. @end table
  3746. @subsection Examples
  3747. @itemize
  3748. @item
  3749. Make video output darker:
  3750. @example
  3751. colorlevels=rimin=0.058:gimin=0.058:bimin=0.058
  3752. @end example
  3753. @item
  3754. Increase contrast:
  3755. @example
  3756. colorlevels=rimin=0.039:gimin=0.039:bimin=0.039:rimax=0.96:gimax=0.96:bimax=0.96
  3757. @end example
  3758. @item
  3759. Make video output lighter:
  3760. @example
  3761. colorlevels=rimax=0.902:gimax=0.902:bimax=0.902
  3762. @end example
  3763. @item
  3764. Increase brightness:
  3765. @example
  3766. colorlevels=romin=0.5:gomin=0.5:bomin=0.5
  3767. @end example
  3768. @end itemize
  3769. @section colorchannelmixer
  3770. Adjust video input frames by re-mixing color channels.
  3771. This filter modifies a color channel by adding the values associated to
  3772. the other channels of the same pixels. For example if the value to
  3773. modify is red, the output value will be:
  3774. @example
  3775. @var{red}=@var{red}*@var{rr} + @var{blue}*@var{rb} + @var{green}*@var{rg} + @var{alpha}*@var{ra}
  3776. @end example
  3777. The filter accepts the following options:
  3778. @table @option
  3779. @item rr
  3780. @item rg
  3781. @item rb
  3782. @item ra
  3783. Adjust contribution of input red, green, blue and alpha channels for output red channel.
  3784. Default is @code{1} for @var{rr}, and @code{0} for @var{rg}, @var{rb} and @var{ra}.
  3785. @item gr
  3786. @item gg
  3787. @item gb
  3788. @item ga
  3789. Adjust contribution of input red, green, blue and alpha channels for output green channel.
  3790. Default is @code{1} for @var{gg}, and @code{0} for @var{gr}, @var{gb} and @var{ga}.
  3791. @item br
  3792. @item bg
  3793. @item bb
  3794. @item ba
  3795. Adjust contribution of input red, green, blue and alpha channels for output blue channel.
  3796. Default is @code{1} for @var{bb}, and @code{0} for @var{br}, @var{bg} and @var{ba}.
  3797. @item ar
  3798. @item ag
  3799. @item ab
  3800. @item aa
  3801. Adjust contribution of input red, green, blue and alpha channels for output alpha channel.
  3802. Default is @code{1} for @var{aa}, and @code{0} for @var{ar}, @var{ag} and @var{ab}.
  3803. Allowed ranges for options are @code{[-2.0, 2.0]}.
  3804. @end table
  3805. @subsection Examples
  3806. @itemize
  3807. @item
  3808. Convert source to grayscale:
  3809. @example
  3810. colorchannelmixer=.3:.4:.3:0:.3:.4:.3:0:.3:.4:.3
  3811. @end example
  3812. @item
  3813. Simulate sepia tones:
  3814. @example
  3815. colorchannelmixer=.393:.769:.189:0:.349:.686:.168:0:.272:.534:.131
  3816. @end example
  3817. @end itemize
  3818. @section colormatrix
  3819. Convert color matrix.
  3820. The filter accepts the following options:
  3821. @table @option
  3822. @item src
  3823. @item dst
  3824. Specify the source and destination color matrix. Both values must be
  3825. specified.
  3826. The accepted values are:
  3827. @table @samp
  3828. @item bt709
  3829. BT.709
  3830. @item bt601
  3831. BT.601
  3832. @item smpte240m
  3833. SMPTE-240M
  3834. @item fcc
  3835. FCC
  3836. @end table
  3837. @end table
  3838. For example to convert from BT.601 to SMPTE-240M, use the command:
  3839. @example
  3840. colormatrix=bt601:smpte240m
  3841. @end example
  3842. @section colorspace
  3843. Convert colorspace, transfer characteristics or color primaries.
  3844. The filter accepts the following options:
  3845. @table @option
  3846. @item all
  3847. Specify all color properties at once.
  3848. The accepted values are:
  3849. @table @samp
  3850. @item bt470m
  3851. BT.470M
  3852. @item bt470bg
  3853. BT.470BG
  3854. @item bt601-6-525
  3855. BT.601-6 525
  3856. @item bt601-6-625
  3857. BT.601-6 625
  3858. @item bt709
  3859. BT.709
  3860. @item smpte170m
  3861. SMPTE-170M
  3862. @item smpte240m
  3863. SMPTE-240M
  3864. @item bt2020
  3865. BT.2020
  3866. @end table
  3867. @item space
  3868. Specify output colorspace.
  3869. The accepted values are:
  3870. @table @samp
  3871. @item bt709
  3872. BT.709
  3873. @item fcc
  3874. FCC
  3875. @item bt470bg
  3876. BT.470BG or BT.601-6 625
  3877. @item smpte170m
  3878. SMPTE-170M or BT.601-6 525
  3879. @item smpte240m
  3880. SMPTE-240M
  3881. @item bt2020ncl
  3882. BT.2020 with non-constant luminance
  3883. @end table
  3884. @item trc
  3885. Specify output transfer characteristics.
  3886. The accepted values are:
  3887. @table @samp
  3888. @item bt709
  3889. BT.709
  3890. @item gamma22
  3891. Constant gamma of 2.2
  3892. @item gamma28
  3893. Constant gamma of 2.8
  3894. @item smpte170m
  3895. SMPTE-170M, BT.601-6 625 or BT.601-6 525
  3896. @item smpte240m
  3897. SMPTE-240M
  3898. @item bt2020-10
  3899. BT.2020 for 10-bits content
  3900. @item bt2020-12
  3901. BT.2020 for 12-bits content
  3902. @end table
  3903. @item prm
  3904. Specify output color primaries.
  3905. The accepted values are:
  3906. @table @samp
  3907. @item bt709
  3908. BT.709
  3909. @item bt470m
  3910. BT.470M
  3911. @item bt470bg
  3912. BT.470BG or BT.601-6 625
  3913. @item smpte170m
  3914. SMPTE-170M or BT.601-6 525
  3915. @item smpte240m
  3916. SMPTE-240M
  3917. @item bt2020
  3918. BT.2020
  3919. @end table
  3920. @item rng
  3921. Specify output color range.
  3922. The accepted values are:
  3923. @table @samp
  3924. @item mpeg
  3925. MPEG (restricted) range
  3926. @item jpeg
  3927. JPEG (full) range
  3928. @end table
  3929. @item format
  3930. Specify output color format.
  3931. The accepted values are:
  3932. @table @samp
  3933. @item yuv420p
  3934. YUV 4:2:0 planar 8-bits
  3935. @item yuv420p10
  3936. YUV 4:2:0 planar 10-bits
  3937. @item yuv420p12
  3938. YUV 4:2:0 planar 12-bits
  3939. @item yuv422p
  3940. YUV 4:2:2 planar 8-bits
  3941. @item yuv422p10
  3942. YUV 4:2:2 planar 10-bits
  3943. @item yuv422p12
  3944. YUV 4:2:2 planar 12-bits
  3945. @item yuv444p
  3946. YUV 4:4:4 planar 8-bits
  3947. @item yuv444p10
  3948. YUV 4:4:4 planar 10-bits
  3949. @item yuv444p12
  3950. YUV 4:4:4 planar 12-bits
  3951. @end table
  3952. @item fast
  3953. Do a fast conversion, which skips gamma/primary correction. This will take
  3954. significantly less CPU, but will be mathematically incorrect. To get output
  3955. compatible with that produced by the colormatrix filter, use fast=1.
  3956. @end table
  3957. The filter converts the transfer characteristics, color space and color
  3958. primaries to the specified user values. The output value, if not specified,
  3959. is set to a default value based on the "all" property. If that property is
  3960. also not specified, the filter will log an error. The output color range and
  3961. format default to the same value as the input color range and format. The
  3962. input transfer characteristics, color space, color primaries and color range
  3963. should be set on the input data. If any of these are missing, the filter will
  3964. log an error and no conversion will take place.
  3965. For example to convert the input to SMPTE-240M, use the command:
  3966. @example
  3967. colorspace=smpte240m
  3968. @end example
  3969. @section convolution
  3970. Apply convolution 3x3 or 5x5 filter.
  3971. The filter accepts the following options:
  3972. @table @option
  3973. @item 0m
  3974. @item 1m
  3975. @item 2m
  3976. @item 3m
  3977. Set matrix for each plane.
  3978. Matrix is sequence of 9 or 25 signed integers.
  3979. @item 0rdiv
  3980. @item 1rdiv
  3981. @item 2rdiv
  3982. @item 3rdiv
  3983. Set multiplier for calculated value for each plane.
  3984. @item 0bias
  3985. @item 1bias
  3986. @item 2bias
  3987. @item 3bias
  3988. Set bias for each plane. This value is added to the result of the multiplication.
  3989. Useful for making the overall image brighter or darker. Default is 0.0.
  3990. @end table
  3991. @subsection Examples
  3992. @itemize
  3993. @item
  3994. Apply sharpen:
  3995. @example
  3996. convolution="0 -1 0 -1 5 -1 0 -1 0:0 -1 0 -1 5 -1 0 -1 0:0 -1 0 -1 5 -1 0 -1 0:0 -1 0 -1 5 -1 0 -1 0"
  3997. @end example
  3998. @item
  3999. Apply blur:
  4000. @example
  4001. convolution="1 1 1 1 1 1 1 1 1:1 1 1 1 1 1 1 1 1:1 1 1 1 1 1 1 1 1:1 1 1 1 1 1 1 1 1:1/9:1/9:1/9:1/9"
  4002. @end example
  4003. @item
  4004. Apply edge enhance:
  4005. @example
  4006. convolution="0 0 0 -1 1 0 0 0 0:0 0 0 -1 1 0 0 0 0:0 0 0 -1 1 0 0 0 0:0 0 0 -1 1 0 0 0 0:5:1:1:1:0:128:128:128"
  4007. @end example
  4008. @item
  4009. Apply edge detect:
  4010. @example
  4011. convolution="0 1 0 1 -4 1 0 1 0:0 1 0 1 -4 1 0 1 0:0 1 0 1 -4 1 0 1 0:0 1 0 1 -4 1 0 1 0:5:5:5:1:0:128:128:128"
  4012. @end example
  4013. @item
  4014. Apply emboss:
  4015. @example
  4016. convolution="-2 -1 0 -1 1 1 0 1 2:-2 -1 0 -1 1 1 0 1 2:-2 -1 0 -1 1 1 0 1 2:-2 -1 0 -1 1 1 0 1 2"
  4017. @end example
  4018. @end itemize
  4019. @section copy
  4020. Copy the input source unchanged to the output. This is mainly useful for
  4021. testing purposes.
  4022. @anchor{coreimage}
  4023. @section coreimage
  4024. Video filtering on GPU using Apple's CoreImage API on OSX.
  4025. Hardware acceleration is based on an OpenGL context. Usually, this means it is
  4026. processed by video hardware. However, software-based OpenGL implementations
  4027. exist which means there is no guarantee for hardware processing. It depends on
  4028. the respective OSX.
  4029. There are many filters and image generators provided by Apple that come with a
  4030. large variety of options. The filter has to be referenced by its name along
  4031. with its options.
  4032. The coreimage filter accepts the following options:
  4033. @table @option
  4034. @item list_filters
  4035. List all available filters and generators along with all their respective
  4036. options as well as possible minimum and maximum values along with the default
  4037. values.
  4038. @example
  4039. list_filters=true
  4040. @end example
  4041. @item filter
  4042. Specify all filters by their respective name and options.
  4043. Use @var{list_filters} to determine all valid filter names and options.
  4044. Numerical options are specified by a float value and are automatically clamped
  4045. to their respective value range. Vector and color options have to be specified
  4046. by a list of space separated float values. Character escaping has to be done.
  4047. A special option name @code{default} is available to use default options for a
  4048. filter.
  4049. It is required to specify either @code{default} or at least one of the filter options.
  4050. All omitted options are used with their default values.
  4051. The syntax of the filter string is as follows:
  4052. @example
  4053. filter=<NAME>@@<OPTION>=<VALUE>[@@<OPTION>=<VALUE>][@@...][#<NAME>@@<OPTION>=<VALUE>[@@<OPTION>=<VALUE>][@@...]][#...]
  4054. @end example
  4055. @item output_rect
  4056. Specify a rectangle where the output of the filter chain is copied into the
  4057. input image. It is given by a list of space separated float values:
  4058. @example
  4059. output_rect=x\ y\ width\ height
  4060. @end example
  4061. If not given, the output rectangle equals the dimensions of the input image.
  4062. The output rectangle is automatically cropped at the borders of the input
  4063. image. Negative values are valid for each component.
  4064. @example
  4065. output_rect=25\ 25\ 100\ 100
  4066. @end example
  4067. @end table
  4068. Several filters can be chained for successive processing without GPU-HOST
  4069. transfers allowing for fast processing of complex filter chains.
  4070. Currently, only filters with zero (generators) or exactly one (filters) input
  4071. image and one output image are supported. Also, transition filters are not yet
  4072. usable as intended.
  4073. Some filters generate output images with additional padding depending on the
  4074. respective filter kernel. The padding is automatically removed to ensure the
  4075. filter output has the same size as the input image.
  4076. For image generators, the size of the output image is determined by the
  4077. previous output image of the filter chain or the input image of the whole
  4078. filterchain, respectively. The generators do not use the pixel information of
  4079. this image to generate their output. However, the generated output is
  4080. blended onto this image, resulting in partial or complete coverage of the
  4081. output image.
  4082. The @ref{coreimagesrc} video source can be used for generating input images
  4083. which are directly fed into the filter chain. By using it, providing input
  4084. images by another video source or an input video is not required.
  4085. @subsection Examples
  4086. @itemize
  4087. @item
  4088. List all filters available:
  4089. @example
  4090. coreimage=list_filters=true
  4091. @end example
  4092. @item
  4093. Use the CIBoxBlur filter with default options to blur an image:
  4094. @example
  4095. coreimage=filter=CIBoxBlur@@default
  4096. @end example
  4097. @item
  4098. Use a filter chain with CISepiaTone at default values and CIVignetteEffect with
  4099. its center at 100x100 and a radius of 50 pixels:
  4100. @example
  4101. coreimage=filter=CIBoxBlur@@default#CIVignetteEffect@@inputCenter=100\ 100@@inputRadius=50
  4102. @end example
  4103. @item
  4104. Use nullsrc and CIQRCodeGenerator to create a QR code for the FFmpeg homepage,
  4105. given as complete and escaped command-line for Apple's standard bash shell:
  4106. @example
  4107. ffmpeg -f lavfi -i nullsrc=s=100x100,coreimage=filter=CIQRCodeGenerator@@inputMessage=https\\\\\://FFmpeg.org/@@inputCorrectionLevel=H -frames:v 1 QRCode.png
  4108. @end example
  4109. @end itemize
  4110. @section crop
  4111. Crop the input video to given dimensions.
  4112. It accepts the following parameters:
  4113. @table @option
  4114. @item w, out_w
  4115. The width of the output video. It defaults to @code{iw}.
  4116. This expression is evaluated only once during the filter
  4117. configuration, or when the @samp{w} or @samp{out_w} command is sent.
  4118. @item h, out_h
  4119. The height of the output video. It defaults to @code{ih}.
  4120. This expression is evaluated only once during the filter
  4121. configuration, or when the @samp{h} or @samp{out_h} command is sent.
  4122. @item x
  4123. The horizontal position, in the input video, of the left edge of the output
  4124. video. It defaults to @code{(in_w-out_w)/2}.
  4125. This expression is evaluated per-frame.
  4126. @item y
  4127. The vertical position, in the input video, of the top edge of the output video.
  4128. It defaults to @code{(in_h-out_h)/2}.
  4129. This expression is evaluated per-frame.
  4130. @item keep_aspect
  4131. If set to 1 will force the output display aspect ratio
  4132. to be the same of the input, by changing the output sample aspect
  4133. ratio. It defaults to 0.
  4134. @end table
  4135. The @var{out_w}, @var{out_h}, @var{x}, @var{y} parameters are
  4136. expressions containing the following constants:
  4137. @table @option
  4138. @item x
  4139. @item y
  4140. The computed values for @var{x} and @var{y}. They are evaluated for
  4141. each new frame.
  4142. @item in_w
  4143. @item in_h
  4144. The input width and height.
  4145. @item iw
  4146. @item ih
  4147. These are the same as @var{in_w} and @var{in_h}.
  4148. @item out_w
  4149. @item out_h
  4150. The output (cropped) width and height.
  4151. @item ow
  4152. @item oh
  4153. These are the same as @var{out_w} and @var{out_h}.
  4154. @item a
  4155. same as @var{iw} / @var{ih}
  4156. @item sar
  4157. input sample aspect ratio
  4158. @item dar
  4159. input display aspect ratio, it is the same as (@var{iw} / @var{ih}) * @var{sar}
  4160. @item hsub
  4161. @item vsub
  4162. horizontal and vertical chroma subsample values. For example for the
  4163. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  4164. @item n
  4165. The number of the input frame, starting from 0.
  4166. @item pos
  4167. the position in the file of the input frame, NAN if unknown
  4168. @item t
  4169. The timestamp expressed in seconds. It's NAN if the input timestamp is unknown.
  4170. @end table
  4171. The expression for @var{out_w} may depend on the value of @var{out_h},
  4172. and the expression for @var{out_h} may depend on @var{out_w}, but they
  4173. cannot depend on @var{x} and @var{y}, as @var{x} and @var{y} are
  4174. evaluated after @var{out_w} and @var{out_h}.
  4175. The @var{x} and @var{y} parameters specify the expressions for the
  4176. position of the top-left corner of the output (non-cropped) area. They
  4177. are evaluated for each frame. If the evaluated value is not valid, it
  4178. is approximated to the nearest valid value.
  4179. The expression for @var{x} may depend on @var{y}, and the expression
  4180. for @var{y} may depend on @var{x}.
  4181. @subsection Examples
  4182. @itemize
  4183. @item
  4184. Crop area with size 100x100 at position (12,34).
  4185. @example
  4186. crop=100:100:12:34
  4187. @end example
  4188. Using named options, the example above becomes:
  4189. @example
  4190. crop=w=100:h=100:x=12:y=34
  4191. @end example
  4192. @item
  4193. Crop the central input area with size 100x100:
  4194. @example
  4195. crop=100:100
  4196. @end example
  4197. @item
  4198. Crop the central input area with size 2/3 of the input video:
  4199. @example
  4200. crop=2/3*in_w:2/3*in_h
  4201. @end example
  4202. @item
  4203. Crop the input video central square:
  4204. @example
  4205. crop=out_w=in_h
  4206. crop=in_h
  4207. @end example
  4208. @item
  4209. Delimit the rectangle with the top-left corner placed at position
  4210. 100:100 and the right-bottom corner corresponding to the right-bottom
  4211. corner of the input image.
  4212. @example
  4213. crop=in_w-100:in_h-100:100:100
  4214. @end example
  4215. @item
  4216. Crop 10 pixels from the left and right borders, and 20 pixels from
  4217. the top and bottom borders
  4218. @example
  4219. crop=in_w-2*10:in_h-2*20
  4220. @end example
  4221. @item
  4222. Keep only the bottom right quarter of the input image:
  4223. @example
  4224. crop=in_w/2:in_h/2:in_w/2:in_h/2
  4225. @end example
  4226. @item
  4227. Crop height for getting Greek harmony:
  4228. @example
  4229. crop=in_w:1/PHI*in_w
  4230. @end example
  4231. @item
  4232. Apply trembling effect:
  4233. @example
  4234. crop=in_w/2:in_h/2:(in_w-out_w)/2+((in_w-out_w)/2)*sin(n/10):(in_h-out_h)/2 +((in_h-out_h)/2)*sin(n/7)
  4235. @end example
  4236. @item
  4237. Apply erratic camera effect depending on timestamp:
  4238. @example
  4239. crop=in_w/2:in_h/2:(in_w-out_w)/2+((in_w-out_w)/2)*sin(t*10):(in_h-out_h)/2 +((in_h-out_h)/2)*sin(t*13)"
  4240. @end example
  4241. @item
  4242. Set x depending on the value of y:
  4243. @example
  4244. crop=in_w/2:in_h/2:y:10+10*sin(n/10)
  4245. @end example
  4246. @end itemize
  4247. @subsection Commands
  4248. This filter supports the following commands:
  4249. @table @option
  4250. @item w, out_w
  4251. @item h, out_h
  4252. @item x
  4253. @item y
  4254. Set width/height of the output video and the horizontal/vertical position
  4255. in the input video.
  4256. The command accepts the same syntax of the corresponding option.
  4257. If the specified expression is not valid, it is kept at its current
  4258. value.
  4259. @end table
  4260. @section cropdetect
  4261. Auto-detect the crop size.
  4262. It calculates the necessary cropping parameters and prints the
  4263. recommended parameters via the logging system. The detected dimensions
  4264. correspond to the non-black area of the input video.
  4265. It accepts the following parameters:
  4266. @table @option
  4267. @item limit
  4268. Set higher black value threshold, which can be optionally specified
  4269. from nothing (0) to everything (255 for 8bit based formats). An intensity
  4270. value greater to the set value is considered non-black. It defaults to 24.
  4271. You can also specify a value between 0.0 and 1.0 which will be scaled depending
  4272. on the bitdepth of the pixel format.
  4273. @item round
  4274. The value which the width/height should be divisible by. It defaults to
  4275. 16. The offset is automatically adjusted to center the video. Use 2 to
  4276. get only even dimensions (needed for 4:2:2 video). 16 is best when
  4277. encoding to most video codecs.
  4278. @item reset_count, reset
  4279. Set the counter that determines after how many frames cropdetect will
  4280. reset the previously detected largest video area and start over to
  4281. detect the current optimal crop area. Default value is 0.
  4282. This can be useful when channel logos distort the video area. 0
  4283. indicates 'never reset', and returns the largest area encountered during
  4284. playback.
  4285. @end table
  4286. @anchor{curves}
  4287. @section curves
  4288. Apply color adjustments using curves.
  4289. This filter is similar to the Adobe Photoshop and GIMP curves tools. Each
  4290. component (red, green and blue) has its values defined by @var{N} key points
  4291. tied from each other using a smooth curve. The x-axis represents the pixel
  4292. values from the input frame, and the y-axis the new pixel values to be set for
  4293. the output frame.
  4294. By default, a component curve is defined by the two points @var{(0;0)} and
  4295. @var{(1;1)}. This creates a straight line where each original pixel value is
  4296. "adjusted" to its own value, which means no change to the image.
  4297. The filter allows you to redefine these two points and add some more. A new
  4298. curve (using a natural cubic spline interpolation) will be define to pass
  4299. smoothly through all these new coordinates. The new defined points needs to be
  4300. strictly increasing over the x-axis, and their @var{x} and @var{y} values must
  4301. be in the @var{[0;1]} interval. If the computed curves happened to go outside
  4302. the vector spaces, the values will be clipped accordingly.
  4303. If there is no key point defined in @code{x=0}, the filter will automatically
  4304. insert a @var{(0;0)} point. In the same way, if there is no key point defined
  4305. in @code{x=1}, the filter will automatically insert a @var{(1;1)} point.
  4306. The filter accepts the following options:
  4307. @table @option
  4308. @item preset
  4309. Select one of the available color presets. This option can be used in addition
  4310. to the @option{r}, @option{g}, @option{b} parameters; in this case, the later
  4311. options takes priority on the preset values.
  4312. Available presets are:
  4313. @table @samp
  4314. @item none
  4315. @item color_negative
  4316. @item cross_process
  4317. @item darker
  4318. @item increase_contrast
  4319. @item lighter
  4320. @item linear_contrast
  4321. @item medium_contrast
  4322. @item negative
  4323. @item strong_contrast
  4324. @item vintage
  4325. @end table
  4326. Default is @code{none}.
  4327. @item master, m
  4328. Set the master key points. These points will define a second pass mapping. It
  4329. is sometimes called a "luminance" or "value" mapping. It can be used with
  4330. @option{r}, @option{g}, @option{b} or @option{all} since it acts like a
  4331. post-processing LUT.
  4332. @item red, r
  4333. Set the key points for the red component.
  4334. @item green, g
  4335. Set the key points for the green component.
  4336. @item blue, b
  4337. Set the key points for the blue component.
  4338. @item all
  4339. Set the key points for all components (not including master).
  4340. Can be used in addition to the other key points component
  4341. options. In this case, the unset component(s) will fallback on this
  4342. @option{all} setting.
  4343. @item psfile
  4344. Specify a Photoshop curves file (@code{.acv}) to import the settings from.
  4345. @end table
  4346. To avoid some filtergraph syntax conflicts, each key points list need to be
  4347. defined using the following syntax: @code{x0/y0 x1/y1 x2/y2 ...}.
  4348. @subsection Examples
  4349. @itemize
  4350. @item
  4351. Increase slightly the middle level of blue:
  4352. @example
  4353. curves=blue='0.5/0.58'
  4354. @end example
  4355. @item
  4356. Vintage effect:
  4357. @example
  4358. curves=r='0/0.11 .42/.51 1/0.95':g='0.50/0.48':b='0/0.22 .49/.44 1/0.8'
  4359. @end example
  4360. Here we obtain the following coordinates for each components:
  4361. @table @var
  4362. @item red
  4363. @code{(0;0.11) (0.42;0.51) (1;0.95)}
  4364. @item green
  4365. @code{(0;0) (0.50;0.48) (1;1)}
  4366. @item blue
  4367. @code{(0;0.22) (0.49;0.44) (1;0.80)}
  4368. @end table
  4369. @item
  4370. The previous example can also be achieved with the associated built-in preset:
  4371. @example
  4372. curves=preset=vintage
  4373. @end example
  4374. @item
  4375. Or simply:
  4376. @example
  4377. curves=vintage
  4378. @end example
  4379. @item
  4380. Use a Photoshop preset and redefine the points of the green component:
  4381. @example
  4382. curves=psfile='MyCurvesPresets/purple.acv':green='0.45/0.53'
  4383. @end example
  4384. @end itemize
  4385. @section datascope
  4386. Video data analysis filter.
  4387. This filter shows hexadecimal pixel values of part of video.
  4388. The filter accepts the following options:
  4389. @table @option
  4390. @item size, s
  4391. Set output video size.
  4392. @item x
  4393. Set x offset from where to pick pixels.
  4394. @item y
  4395. Set y offset from where to pick pixels.
  4396. @item mode
  4397. Set scope mode, can be one of the following:
  4398. @table @samp
  4399. @item mono
  4400. Draw hexadecimal pixel values with white color on black background.
  4401. @item color
  4402. Draw hexadecimal pixel values with input video pixel color on black
  4403. background.
  4404. @item color2
  4405. Draw hexadecimal pixel values on color background picked from input video,
  4406. the text color is picked in such way so its always visible.
  4407. @end table
  4408. @item axis
  4409. Draw rows and columns numbers on left and top of video.
  4410. @end table
  4411. @section dctdnoiz
  4412. Denoise frames using 2D DCT (frequency domain filtering).
  4413. This filter is not designed for real time.
  4414. The filter accepts the following options:
  4415. @table @option
  4416. @item sigma, s
  4417. Set the noise sigma constant.
  4418. This @var{sigma} defines a hard threshold of @code{3 * sigma}; every DCT
  4419. coefficient (absolute value) below this threshold with be dropped.
  4420. If you need a more advanced filtering, see @option{expr}.
  4421. Default is @code{0}.
  4422. @item overlap
  4423. Set number overlapping pixels for each block. Since the filter can be slow, you
  4424. may want to reduce this value, at the cost of a less effective filter and the
  4425. risk of various artefacts.
  4426. If the overlapping value doesn't permit processing the whole input width or
  4427. height, a warning will be displayed and according borders won't be denoised.
  4428. Default value is @var{blocksize}-1, which is the best possible setting.
  4429. @item expr, e
  4430. Set the coefficient factor expression.
  4431. For each coefficient of a DCT block, this expression will be evaluated as a
  4432. multiplier value for the coefficient.
  4433. If this is option is set, the @option{sigma} option will be ignored.
  4434. The absolute value of the coefficient can be accessed through the @var{c}
  4435. variable.
  4436. @item n
  4437. Set the @var{blocksize} using the number of bits. @code{1<<@var{n}} defines the
  4438. @var{blocksize}, which is the width and height of the processed blocks.
  4439. The default value is @var{3} (8x8) and can be raised to @var{4} for a
  4440. @var{blocksize} of 16x16. Note that changing this setting has huge consequences
  4441. on the speed processing. Also, a larger block size does not necessarily means a
  4442. better de-noising.
  4443. @end table
  4444. @subsection Examples
  4445. Apply a denoise with a @option{sigma} of @code{4.5}:
  4446. @example
  4447. dctdnoiz=4.5
  4448. @end example
  4449. The same operation can be achieved using the expression system:
  4450. @example
  4451. dctdnoiz=e='gte(c, 4.5*3)'
  4452. @end example
  4453. Violent denoise using a block size of @code{16x16}:
  4454. @example
  4455. dctdnoiz=15:n=4
  4456. @end example
  4457. @section deband
  4458. Remove banding artifacts from input video.
  4459. It works by replacing banded pixels with average value of referenced pixels.
  4460. The filter accepts the following options:
  4461. @table @option
  4462. @item 1thr
  4463. @item 2thr
  4464. @item 3thr
  4465. @item 4thr
  4466. Set banding detection threshold for each plane. Default is 0.02.
  4467. Valid range is 0.00003 to 0.5.
  4468. If difference between current pixel and reference pixel is less than threshold,
  4469. it will be considered as banded.
  4470. @item range, r
  4471. Banding detection range in pixels. Default is 16. If positive, random number
  4472. in range 0 to set value will be used. If negative, exact absolute value
  4473. will be used.
  4474. The range defines square of four pixels around current pixel.
  4475. @item direction, d
  4476. Set direction in radians from which four pixel will be compared. If positive,
  4477. random direction from 0 to set direction will be picked. If negative, exact of
  4478. absolute value will be picked. For example direction 0, -PI or -2*PI radians
  4479. will pick only pixels on same row and -PI/2 will pick only pixels on same
  4480. column.
  4481. @item blur
  4482. If enabled, current pixel is compared with average value of all four
  4483. surrounding pixels. The default is enabled. If disabled current pixel is
  4484. compared with all four surrounding pixels. The pixel is considered banded
  4485. if only all four differences with surrounding pixels are less than threshold.
  4486. @end table
  4487. @anchor{decimate}
  4488. @section decimate
  4489. Drop duplicated frames at regular intervals.
  4490. The filter accepts the following options:
  4491. @table @option
  4492. @item cycle
  4493. Set the number of frames from which one will be dropped. Setting this to
  4494. @var{N} means one frame in every batch of @var{N} frames will be dropped.
  4495. Default is @code{5}.
  4496. @item dupthresh
  4497. Set the threshold for duplicate detection. If the difference metric for a frame
  4498. is less than or equal to this value, then it is declared as duplicate. Default
  4499. is @code{1.1}
  4500. @item scthresh
  4501. Set scene change threshold. Default is @code{15}.
  4502. @item blockx
  4503. @item blocky
  4504. Set the size of the x and y-axis blocks used during metric calculations.
  4505. Larger blocks give better noise suppression, but also give worse detection of
  4506. small movements. Must be a power of two. Default is @code{32}.
  4507. @item ppsrc
  4508. Mark main input as a pre-processed input and activate clean source input
  4509. stream. This allows the input to be pre-processed with various filters to help
  4510. the metrics calculation while keeping the frame selection lossless. When set to
  4511. @code{1}, the first stream is for the pre-processed input, and the second
  4512. stream is the clean source from where the kept frames are chosen. Default is
  4513. @code{0}.
  4514. @item chroma
  4515. Set whether or not chroma is considered in the metric calculations. Default is
  4516. @code{1}.
  4517. @end table
  4518. @section deflate
  4519. Apply deflate effect to the video.
  4520. This filter replaces the pixel by the local(3x3) average by taking into account
  4521. only values lower than the pixel.
  4522. It accepts the following options:
  4523. @table @option
  4524. @item threshold0
  4525. @item threshold1
  4526. @item threshold2
  4527. @item threshold3
  4528. Limit the maximum change for each plane, default is 65535.
  4529. If 0, plane will remain unchanged.
  4530. @end table
  4531. @section dejudder
  4532. Remove judder produced by partially interlaced telecined content.
  4533. Judder can be introduced, for instance, by @ref{pullup} filter. If the original
  4534. source was partially telecined content then the output of @code{pullup,dejudder}
  4535. will have a variable frame rate. May change the recorded frame rate of the
  4536. container. Aside from that change, this filter will not affect constant frame
  4537. rate video.
  4538. The option available in this filter is:
  4539. @table @option
  4540. @item cycle
  4541. Specify the length of the window over which the judder repeats.
  4542. Accepts any integer greater than 1. Useful values are:
  4543. @table @samp
  4544. @item 4
  4545. If the original was telecined from 24 to 30 fps (Film to NTSC).
  4546. @item 5
  4547. If the original was telecined from 25 to 30 fps (PAL to NTSC).
  4548. @item 20
  4549. If a mixture of the two.
  4550. @end table
  4551. The default is @samp{4}.
  4552. @end table
  4553. @section delogo
  4554. Suppress a TV station logo by a simple interpolation of the surrounding
  4555. pixels. Just set a rectangle covering the logo and watch it disappear
  4556. (and sometimes something even uglier appear - your mileage may vary).
  4557. It accepts the following parameters:
  4558. @table @option
  4559. @item x
  4560. @item y
  4561. Specify the top left corner coordinates of the logo. They must be
  4562. specified.
  4563. @item w
  4564. @item h
  4565. Specify the width and height of the logo to clear. They must be
  4566. specified.
  4567. @item band, t
  4568. Specify the thickness of the fuzzy edge of the rectangle (added to
  4569. @var{w} and @var{h}). The default value is 1. This option is
  4570. deprecated, setting higher values should no longer be necessary and
  4571. is not recommended.
  4572. @item show
  4573. When set to 1, a green rectangle is drawn on the screen to simplify
  4574. finding the right @var{x}, @var{y}, @var{w}, and @var{h} parameters.
  4575. The default value is 0.
  4576. The rectangle is drawn on the outermost pixels which will be (partly)
  4577. replaced with interpolated values. The values of the next pixels
  4578. immediately outside this rectangle in each direction will be used to
  4579. compute the interpolated pixel values inside the rectangle.
  4580. @end table
  4581. @subsection Examples
  4582. @itemize
  4583. @item
  4584. Set a rectangle covering the area with top left corner coordinates 0,0
  4585. and size 100x77, and a band of size 10:
  4586. @example
  4587. delogo=x=0:y=0:w=100:h=77:band=10
  4588. @end example
  4589. @end itemize
  4590. @section deshake
  4591. Attempt to fix small changes in horizontal and/or vertical shift. This
  4592. filter helps remove camera shake from hand-holding a camera, bumping a
  4593. tripod, moving on a vehicle, etc.
  4594. The filter accepts the following options:
  4595. @table @option
  4596. @item x
  4597. @item y
  4598. @item w
  4599. @item h
  4600. Specify a rectangular area where to limit the search for motion
  4601. vectors.
  4602. If desired the search for motion vectors can be limited to a
  4603. rectangular area of the frame defined by its top left corner, width
  4604. and height. These parameters have the same meaning as the drawbox
  4605. filter which can be used to visualise the position of the bounding
  4606. box.
  4607. This is useful when simultaneous movement of subjects within the frame
  4608. might be confused for camera motion by the motion vector search.
  4609. If any or all of @var{x}, @var{y}, @var{w} and @var{h} are set to -1
  4610. then the full frame is used. This allows later options to be set
  4611. without specifying the bounding box for the motion vector search.
  4612. Default - search the whole frame.
  4613. @item rx
  4614. @item ry
  4615. Specify the maximum extent of movement in x and y directions in the
  4616. range 0-64 pixels. Default 16.
  4617. @item edge
  4618. Specify how to generate pixels to fill blanks at the edge of the
  4619. frame. Available values are:
  4620. @table @samp
  4621. @item blank, 0
  4622. Fill zeroes at blank locations
  4623. @item original, 1
  4624. Original image at blank locations
  4625. @item clamp, 2
  4626. Extruded edge value at blank locations
  4627. @item mirror, 3
  4628. Mirrored edge at blank locations
  4629. @end table
  4630. Default value is @samp{mirror}.
  4631. @item blocksize
  4632. Specify the blocksize to use for motion search. Range 4-128 pixels,
  4633. default 8.
  4634. @item contrast
  4635. Specify the contrast threshold for blocks. Only blocks with more than
  4636. the specified contrast (difference between darkest and lightest
  4637. pixels) will be considered. Range 1-255, default 125.
  4638. @item search
  4639. Specify the search strategy. Available values are:
  4640. @table @samp
  4641. @item exhaustive, 0
  4642. Set exhaustive search
  4643. @item less, 1
  4644. Set less exhaustive search.
  4645. @end table
  4646. Default value is @samp{exhaustive}.
  4647. @item filename
  4648. If set then a detailed log of the motion search is written to the
  4649. specified file.
  4650. @item opencl
  4651. If set to 1, specify using OpenCL capabilities, only available if
  4652. FFmpeg was configured with @code{--enable-opencl}. Default value is 0.
  4653. @end table
  4654. @section detelecine
  4655. Apply an exact inverse of the telecine operation. It requires a predefined
  4656. pattern specified using the pattern option which must be the same as that passed
  4657. to the telecine filter.
  4658. This filter accepts the following options:
  4659. @table @option
  4660. @item first_field
  4661. @table @samp
  4662. @item top, t
  4663. top field first
  4664. @item bottom, b
  4665. bottom field first
  4666. The default value is @code{top}.
  4667. @end table
  4668. @item pattern
  4669. A string of numbers representing the pulldown pattern you wish to apply.
  4670. The default value is @code{23}.
  4671. @item start_frame
  4672. A number representing position of the first frame with respect to the telecine
  4673. pattern. This is to be used if the stream is cut. The default value is @code{0}.
  4674. @end table
  4675. @section dilation
  4676. Apply dilation effect to the video.
  4677. This filter replaces the pixel by the local(3x3) maximum.
  4678. It accepts the following options:
  4679. @table @option
  4680. @item threshold0
  4681. @item threshold1
  4682. @item threshold2
  4683. @item threshold3
  4684. Limit the maximum change for each plane, default is 65535.
  4685. If 0, plane will remain unchanged.
  4686. @item coordinates
  4687. Flag which specifies the pixel to refer to. Default is 255 i.e. all eight
  4688. pixels are used.
  4689. Flags to local 3x3 coordinates maps like this:
  4690. 1 2 3
  4691. 4 5
  4692. 6 7 8
  4693. @end table
  4694. @section displace
  4695. Displace pixels as indicated by second and third input stream.
  4696. It takes three input streams and outputs one stream, the first input is the
  4697. source, and second and third input are displacement maps.
  4698. The second input specifies how much to displace pixels along the
  4699. x-axis, while the third input specifies how much to displace pixels
  4700. along the y-axis.
  4701. If one of displacement map streams terminates, last frame from that
  4702. displacement map will be used.
  4703. Note that once generated, displacements maps can be reused over and over again.
  4704. A description of the accepted options follows.
  4705. @table @option
  4706. @item edge
  4707. Set displace behavior for pixels that are out of range.
  4708. Available values are:
  4709. @table @samp
  4710. @item blank
  4711. Missing pixels are replaced by black pixels.
  4712. @item smear
  4713. Adjacent pixels will spread out to replace missing pixels.
  4714. @item wrap
  4715. Out of range pixels are wrapped so they point to pixels of other side.
  4716. @end table
  4717. Default is @samp{smear}.
  4718. @end table
  4719. @subsection Examples
  4720. @itemize
  4721. @item
  4722. Add ripple effect to rgb input of video size hd720:
  4723. @example
  4724. ffmpeg -i INPUT -f lavfi -i nullsrc=s=hd720,lutrgb=128:128:128 -f lavfi -i nullsrc=s=hd720,geq='r=128+30*sin(2*PI*X/400+T):g=128+30*sin(2*PI*X/400+T):b=128+30*sin(2*PI*X/400+T)' -lavfi '[0][1][2]displace' OUTPUT
  4725. @end example
  4726. @item
  4727. Add wave effect to rgb input of video size hd720:
  4728. @example
  4729. ffmpeg -i INPUT -f lavfi -i nullsrc=hd720,geq='r=128+80*(sin(sqrt((X-W/2)*(X-W/2)+(Y-H/2)*(Y-H/2))/220*2*PI+T)):g=128+80*(sin(sqrt((X-W/2)*(X-W/2)+(Y-H/2)*(Y-H/2))/220*2*PI+T)):b=128+80*(sin(sqrt((X-W/2)*(X-W/2)+(Y-H/2)*(Y-H/2))/220*2*PI+T))' -lavfi '[1]split[x][y],[0][x][y]displace' OUTPUT
  4730. @end example
  4731. @end itemize
  4732. @section drawbox
  4733. Draw a colored box on the input image.
  4734. It accepts the following parameters:
  4735. @table @option
  4736. @item x
  4737. @item y
  4738. The expressions which specify the top left corner coordinates of the box. It defaults to 0.
  4739. @item width, w
  4740. @item height, h
  4741. The expressions which specify the width and height of the box; if 0 they are interpreted as
  4742. the input width and height. It defaults to 0.
  4743. @item color, c
  4744. Specify the color of the box to write. For the general syntax of this option,
  4745. check the "Color" section in the ffmpeg-utils manual. If the special
  4746. value @code{invert} is used, the box edge color is the same as the
  4747. video with inverted luma.
  4748. @item thickness, t
  4749. The expression which sets the thickness of the box edge. Default value is @code{3}.
  4750. See below for the list of accepted constants.
  4751. @end table
  4752. The parameters for @var{x}, @var{y}, @var{w} and @var{h} and @var{t} are expressions containing the
  4753. following constants:
  4754. @table @option
  4755. @item dar
  4756. The input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}.
  4757. @item hsub
  4758. @item vsub
  4759. horizontal and vertical chroma subsample values. For example for the
  4760. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  4761. @item in_h, ih
  4762. @item in_w, iw
  4763. The input width and height.
  4764. @item sar
  4765. The input sample aspect ratio.
  4766. @item x
  4767. @item y
  4768. The x and y offset coordinates where the box is drawn.
  4769. @item w
  4770. @item h
  4771. The width and height of the drawn box.
  4772. @item t
  4773. The thickness of the drawn box.
  4774. These constants allow the @var{x}, @var{y}, @var{w}, @var{h} and @var{t} expressions to refer to
  4775. each other, so you may for example specify @code{y=x/dar} or @code{h=w/dar}.
  4776. @end table
  4777. @subsection Examples
  4778. @itemize
  4779. @item
  4780. Draw a black box around the edge of the input image:
  4781. @example
  4782. drawbox
  4783. @end example
  4784. @item
  4785. Draw a box with color red and an opacity of 50%:
  4786. @example
  4787. drawbox=10:20:200:60:red@@0.5
  4788. @end example
  4789. The previous example can be specified as:
  4790. @example
  4791. drawbox=x=10:y=20:w=200:h=60:color=red@@0.5
  4792. @end example
  4793. @item
  4794. Fill the box with pink color:
  4795. @example
  4796. drawbox=x=10:y=10:w=100:h=100:color=pink@@0.5:t=max
  4797. @end example
  4798. @item
  4799. Draw a 2-pixel red 2.40:1 mask:
  4800. @example
  4801. drawbox=x=-t:y=0.5*(ih-iw/2.4)-t:w=iw+t*2:h=iw/2.4+t*2:t=2:c=red
  4802. @end example
  4803. @end itemize
  4804. @section drawgraph, adrawgraph
  4805. Draw a graph using input video or audio metadata.
  4806. It accepts the following parameters:
  4807. @table @option
  4808. @item m1
  4809. Set 1st frame metadata key from which metadata values will be used to draw a graph.
  4810. @item fg1
  4811. Set 1st foreground color expression.
  4812. @item m2
  4813. Set 2nd frame metadata key from which metadata values will be used to draw a graph.
  4814. @item fg2
  4815. Set 2nd foreground color expression.
  4816. @item m3
  4817. Set 3rd frame metadata key from which metadata values will be used to draw a graph.
  4818. @item fg3
  4819. Set 3rd foreground color expression.
  4820. @item m4
  4821. Set 4th frame metadata key from which metadata values will be used to draw a graph.
  4822. @item fg4
  4823. Set 4th foreground color expression.
  4824. @item min
  4825. Set minimal value of metadata value.
  4826. @item max
  4827. Set maximal value of metadata value.
  4828. @item bg
  4829. Set graph background color. Default is white.
  4830. @item mode
  4831. Set graph mode.
  4832. Available values for mode is:
  4833. @table @samp
  4834. @item bar
  4835. @item dot
  4836. @item line
  4837. @end table
  4838. Default is @code{line}.
  4839. @item slide
  4840. Set slide mode.
  4841. Available values for slide is:
  4842. @table @samp
  4843. @item frame
  4844. Draw new frame when right border is reached.
  4845. @item replace
  4846. Replace old columns with new ones.
  4847. @item scroll
  4848. Scroll from right to left.
  4849. @item rscroll
  4850. Scroll from left to right.
  4851. @end table
  4852. Default is @code{frame}.
  4853. @item size
  4854. Set size of graph video. For the syntax of this option, check the
  4855. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  4856. The default value is @code{900x256}.
  4857. The foreground color expressions can use the following variables:
  4858. @table @option
  4859. @item MIN
  4860. Minimal value of metadata value.
  4861. @item MAX
  4862. Maximal value of metadata value.
  4863. @item VAL
  4864. Current metadata key value.
  4865. @end table
  4866. The color is defined as 0xAABBGGRR.
  4867. @end table
  4868. Example using metadata from @ref{signalstats} filter:
  4869. @example
  4870. signalstats,drawgraph=lavfi.signalstats.YAVG:min=0:max=255
  4871. @end example
  4872. Example using metadata from @ref{ebur128} filter:
  4873. @example
  4874. ebur128=metadata=1,adrawgraph=lavfi.r128.M:min=-120:max=5
  4875. @end example
  4876. @section drawgrid
  4877. Draw a grid on the input image.
  4878. It accepts the following parameters:
  4879. @table @option
  4880. @item x
  4881. @item y
  4882. The expressions which specify the coordinates of some point of grid intersection (meant to configure offset). Both default to 0.
  4883. @item width, w
  4884. @item height, h
  4885. The expressions which specify the width and height of the grid cell, if 0 they are interpreted as the
  4886. input width and height, respectively, minus @code{thickness}, so image gets
  4887. framed. Default to 0.
  4888. @item color, c
  4889. Specify the color of the grid. For the general syntax of this option,
  4890. check the "Color" section in the ffmpeg-utils manual. If the special
  4891. value @code{invert} is used, the grid color is the same as the
  4892. video with inverted luma.
  4893. @item thickness, t
  4894. The expression which sets the thickness of the grid line. Default value is @code{1}.
  4895. See below for the list of accepted constants.
  4896. @end table
  4897. The parameters for @var{x}, @var{y}, @var{w} and @var{h} and @var{t} are expressions containing the
  4898. following constants:
  4899. @table @option
  4900. @item dar
  4901. The input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}.
  4902. @item hsub
  4903. @item vsub
  4904. horizontal and vertical chroma subsample values. For example for the
  4905. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  4906. @item in_h, ih
  4907. @item in_w, iw
  4908. The input grid cell width and height.
  4909. @item sar
  4910. The input sample aspect ratio.
  4911. @item x
  4912. @item y
  4913. The x and y coordinates of some point of grid intersection (meant to configure offset).
  4914. @item w
  4915. @item h
  4916. The width and height of the drawn cell.
  4917. @item t
  4918. The thickness of the drawn cell.
  4919. These constants allow the @var{x}, @var{y}, @var{w}, @var{h} and @var{t} expressions to refer to
  4920. each other, so you may for example specify @code{y=x/dar} or @code{h=w/dar}.
  4921. @end table
  4922. @subsection Examples
  4923. @itemize
  4924. @item
  4925. Draw a grid with cell 100x100 pixels, thickness 2 pixels, with color red and an opacity of 50%:
  4926. @example
  4927. drawgrid=width=100:height=100:thickness=2:color=red@@0.5
  4928. @end example
  4929. @item
  4930. Draw a white 3x3 grid with an opacity of 50%:
  4931. @example
  4932. drawgrid=w=iw/3:h=ih/3:t=2:c=white@@0.5
  4933. @end example
  4934. @end itemize
  4935. @anchor{drawtext}
  4936. @section drawtext
  4937. Draw a text string or text from a specified file on top of a video, using the
  4938. libfreetype library.
  4939. To enable compilation of this filter, you need to configure FFmpeg with
  4940. @code{--enable-libfreetype}.
  4941. To enable default font fallback and the @var{font} option you need to
  4942. configure FFmpeg with @code{--enable-libfontconfig}.
  4943. To enable the @var{text_shaping} option, you need to configure FFmpeg with
  4944. @code{--enable-libfribidi}.
  4945. @subsection Syntax
  4946. It accepts the following parameters:
  4947. @table @option
  4948. @item box
  4949. Used to draw a box around text using the background color.
  4950. The value must be either 1 (enable) or 0 (disable).
  4951. The default value of @var{box} is 0.
  4952. @item boxborderw
  4953. Set the width of the border to be drawn around the box using @var{boxcolor}.
  4954. The default value of @var{boxborderw} is 0.
  4955. @item boxcolor
  4956. The color to be used for drawing box around text. For the syntax of this
  4957. option, check the "Color" section in the ffmpeg-utils manual.
  4958. The default value of @var{boxcolor} is "white".
  4959. @item borderw
  4960. Set the width of the border to be drawn around the text using @var{bordercolor}.
  4961. The default value of @var{borderw} is 0.
  4962. @item bordercolor
  4963. Set the color to be used for drawing border around text. For the syntax of this
  4964. option, check the "Color" section in the ffmpeg-utils manual.
  4965. The default value of @var{bordercolor} is "black".
  4966. @item expansion
  4967. Select how the @var{text} is expanded. Can be either @code{none},
  4968. @code{strftime} (deprecated) or
  4969. @code{normal} (default). See the @ref{drawtext_expansion, Text expansion} section
  4970. below for details.
  4971. @item fix_bounds
  4972. If true, check and fix text coords to avoid clipping.
  4973. @item fontcolor
  4974. The color to be used for drawing fonts. For the syntax of this option, check
  4975. the "Color" section in the ffmpeg-utils manual.
  4976. The default value of @var{fontcolor} is "black".
  4977. @item fontcolor_expr
  4978. String which is expanded the same way as @var{text} to obtain dynamic
  4979. @var{fontcolor} value. By default this option has empty value and is not
  4980. processed. When this option is set, it overrides @var{fontcolor} option.
  4981. @item font
  4982. The font family to be used for drawing text. By default Sans.
  4983. @item fontfile
  4984. The font file to be used for drawing text. The path must be included.
  4985. This parameter is mandatory if the fontconfig support is disabled.
  4986. @item draw
  4987. This option does not exist, please see the timeline system
  4988. @item alpha
  4989. Draw the text applying alpha blending. The value can
  4990. be either a number between 0.0 and 1.0
  4991. The expression accepts the same variables @var{x, y} do.
  4992. The default value is 1.
  4993. Please see fontcolor_expr
  4994. @item fontsize
  4995. The font size to be used for drawing text.
  4996. The default value of @var{fontsize} is 16.
  4997. @item text_shaping
  4998. If set to 1, attempt to shape the text (for example, reverse the order of
  4999. right-to-left text and join Arabic characters) before drawing it.
  5000. Otherwise, just draw the text exactly as given.
  5001. By default 1 (if supported).
  5002. @item ft_load_flags
  5003. The flags to be used for loading the fonts.
  5004. The flags map the corresponding flags supported by libfreetype, and are
  5005. a combination of the following values:
  5006. @table @var
  5007. @item default
  5008. @item no_scale
  5009. @item no_hinting
  5010. @item render
  5011. @item no_bitmap
  5012. @item vertical_layout
  5013. @item force_autohint
  5014. @item crop_bitmap
  5015. @item pedantic
  5016. @item ignore_global_advance_width
  5017. @item no_recurse
  5018. @item ignore_transform
  5019. @item monochrome
  5020. @item linear_design
  5021. @item no_autohint
  5022. @end table
  5023. Default value is "default".
  5024. For more information consult the documentation for the FT_LOAD_*
  5025. libfreetype flags.
  5026. @item shadowcolor
  5027. The color to be used for drawing a shadow behind the drawn text. For the
  5028. syntax of this option, check the "Color" section in the ffmpeg-utils manual.
  5029. The default value of @var{shadowcolor} is "black".
  5030. @item shadowx
  5031. @item shadowy
  5032. The x and y offsets for the text shadow position with respect to the
  5033. position of the text. They can be either positive or negative
  5034. values. The default value for both is "0".
  5035. @item start_number
  5036. The starting frame number for the n/frame_num variable. The default value
  5037. is "0".
  5038. @item tabsize
  5039. The size in number of spaces to use for rendering the tab.
  5040. Default value is 4.
  5041. @item timecode
  5042. Set the initial timecode representation in "hh:mm:ss[:;.]ff"
  5043. format. It can be used with or without text parameter. @var{timecode_rate}
  5044. option must be specified.
  5045. @item timecode_rate, rate, r
  5046. Set the timecode frame rate (timecode only).
  5047. @item text
  5048. The text string to be drawn. The text must be a sequence of UTF-8
  5049. encoded characters.
  5050. This parameter is mandatory if no file is specified with the parameter
  5051. @var{textfile}.
  5052. @item textfile
  5053. A text file containing text to be drawn. The text must be a sequence
  5054. of UTF-8 encoded characters.
  5055. This parameter is mandatory if no text string is specified with the
  5056. parameter @var{text}.
  5057. If both @var{text} and @var{textfile} are specified, an error is thrown.
  5058. @item reload
  5059. If set to 1, the @var{textfile} will be reloaded before each frame.
  5060. Be sure to update it atomically, or it may be read partially, or even fail.
  5061. @item x
  5062. @item y
  5063. The expressions which specify the offsets where text will be drawn
  5064. within the video frame. They are relative to the top/left border of the
  5065. output image.
  5066. The default value of @var{x} and @var{y} is "0".
  5067. See below for the list of accepted constants and functions.
  5068. @end table
  5069. The parameters for @var{x} and @var{y} are expressions containing the
  5070. following constants and functions:
  5071. @table @option
  5072. @item dar
  5073. input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}
  5074. @item hsub
  5075. @item vsub
  5076. horizontal and vertical chroma subsample values. For example for the
  5077. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  5078. @item line_h, lh
  5079. the height of each text line
  5080. @item main_h, h, H
  5081. the input height
  5082. @item main_w, w, W
  5083. the input width
  5084. @item max_glyph_a, ascent
  5085. the maximum distance from the baseline to the highest/upper grid
  5086. coordinate used to place a glyph outline point, for all the rendered
  5087. glyphs.
  5088. It is a positive value, due to the grid's orientation with the Y axis
  5089. upwards.
  5090. @item max_glyph_d, descent
  5091. the maximum distance from the baseline to the lowest grid coordinate
  5092. used to place a glyph outline point, for all the rendered glyphs.
  5093. This is a negative value, due to the grid's orientation, with the Y axis
  5094. upwards.
  5095. @item max_glyph_h
  5096. maximum glyph height, that is the maximum height for all the glyphs
  5097. contained in the rendered text, it is equivalent to @var{ascent} -
  5098. @var{descent}.
  5099. @item max_glyph_w
  5100. maximum glyph width, that is the maximum width for all the glyphs
  5101. contained in the rendered text
  5102. @item n
  5103. the number of input frame, starting from 0
  5104. @item rand(min, max)
  5105. return a random number included between @var{min} and @var{max}
  5106. @item sar
  5107. The input sample aspect ratio.
  5108. @item t
  5109. timestamp expressed in seconds, NAN if the input timestamp is unknown
  5110. @item text_h, th
  5111. the height of the rendered text
  5112. @item text_w, tw
  5113. the width of the rendered text
  5114. @item x
  5115. @item y
  5116. the x and y offset coordinates where the text is drawn.
  5117. These parameters allow the @var{x} and @var{y} expressions to refer
  5118. each other, so you can for example specify @code{y=x/dar}.
  5119. @end table
  5120. @anchor{drawtext_expansion}
  5121. @subsection Text expansion
  5122. If @option{expansion} is set to @code{strftime},
  5123. the filter recognizes strftime() sequences in the provided text and
  5124. expands them accordingly. Check the documentation of strftime(). This
  5125. feature is deprecated.
  5126. If @option{expansion} is set to @code{none}, the text is printed verbatim.
  5127. If @option{expansion} is set to @code{normal} (which is the default),
  5128. the following expansion mechanism is used.
  5129. The backslash character @samp{\}, followed by any character, always expands to
  5130. the second character.
  5131. Sequence of the form @code{%@{...@}} are expanded. The text between the
  5132. braces is a function name, possibly followed by arguments separated by ':'.
  5133. If the arguments contain special characters or delimiters (':' or '@}'),
  5134. they should be escaped.
  5135. Note that they probably must also be escaped as the value for the
  5136. @option{text} option in the filter argument string and as the filter
  5137. argument in the filtergraph description, and possibly also for the shell,
  5138. that makes up to four levels of escaping; using a text file avoids these
  5139. problems.
  5140. The following functions are available:
  5141. @table @command
  5142. @item expr, e
  5143. The expression evaluation result.
  5144. It must take one argument specifying the expression to be evaluated,
  5145. which accepts the same constants and functions as the @var{x} and
  5146. @var{y} values. Note that not all constants should be used, for
  5147. example the text size is not known when evaluating the expression, so
  5148. the constants @var{text_w} and @var{text_h} will have an undefined
  5149. value.
  5150. @item expr_int_format, eif
  5151. Evaluate the expression's value and output as formatted integer.
  5152. The first argument is the expression to be evaluated, just as for the @var{expr} function.
  5153. The second argument specifies the output format. Allowed values are @samp{x},
  5154. @samp{X}, @samp{d} and @samp{u}. They are treated exactly as in the
  5155. @code{printf} function.
  5156. The third parameter is optional and sets the number of positions taken by the output.
  5157. It can be used to add padding with zeros from the left.
  5158. @item gmtime
  5159. The time at which the filter is running, expressed in UTC.
  5160. It can accept an argument: a strftime() format string.
  5161. @item localtime
  5162. The time at which the filter is running, expressed in the local time zone.
  5163. It can accept an argument: a strftime() format string.
  5164. @item metadata
  5165. Frame metadata. Takes one or two arguments.
  5166. The first argument is mandatory and specifies the metadata key.
  5167. The second argument is optional and specifies a default value, used when the
  5168. metadata key is not found or empty.
  5169. @item n, frame_num
  5170. The frame number, starting from 0.
  5171. @item pict_type
  5172. A 1 character description of the current picture type.
  5173. @item pts
  5174. The timestamp of the current frame.
  5175. It can take up to three arguments.
  5176. The first argument is the format of the timestamp; it defaults to @code{flt}
  5177. for seconds as a decimal number with microsecond accuracy; @code{hms} stands
  5178. for a formatted @var{[-]HH:MM:SS.mmm} timestamp with millisecond accuracy.
  5179. @code{gmtime} stands for the timestamp of the frame formatted as UTC time;
  5180. @code{localtime} stands for the timestamp of the frame formatted as
  5181. local time zone time.
  5182. The second argument is an offset added to the timestamp.
  5183. If the format is set to @code{localtime} or @code{gmtime},
  5184. a third argument may be supplied: a strftime() format string.
  5185. By default, @var{YYYY-MM-DD HH:MM:SS} format will be used.
  5186. @end table
  5187. @subsection Examples
  5188. @itemize
  5189. @item
  5190. Draw "Test Text" with font FreeSerif, using the default values for the
  5191. optional parameters.
  5192. @example
  5193. drawtext="fontfile=/usr/share/fonts/truetype/freefont/FreeSerif.ttf: text='Test Text'"
  5194. @end example
  5195. @item
  5196. Draw 'Test Text' with font FreeSerif of size 24 at position x=100
  5197. and y=50 (counting from the top-left corner of the screen), text is
  5198. yellow with a red box around it. Both the text and the box have an
  5199. opacity of 20%.
  5200. @example
  5201. drawtext="fontfile=/usr/share/fonts/truetype/freefont/FreeSerif.ttf: text='Test Text':\
  5202. x=100: y=50: fontsize=24: fontcolor=yellow@@0.2: box=1: boxcolor=red@@0.2"
  5203. @end example
  5204. Note that the double quotes are not necessary if spaces are not used
  5205. within the parameter list.
  5206. @item
  5207. Show the text at the center of the video frame:
  5208. @example
  5209. drawtext="fontsize=30:fontfile=FreeSerif.ttf:text='hello world':x=(w-text_w)/2:y=(h-text_h)/2"
  5210. @end example
  5211. @item
  5212. Show the text at a random position, switching to a new position every 30 seconds:
  5213. @example
  5214. drawtext="fontsize=30:fontfile=FreeSerif.ttf:text='hello world':x=if(eq(mod(t\,30)\,0)\,rand(0\,(w-text_w))\,x):y=if(eq(mod(t\,30)\,0)\,rand(0\,(h-text_h))\,y)"
  5215. @end example
  5216. @item
  5217. Show a text line sliding from right to left in the last row of the video
  5218. frame. The file @file{LONG_LINE} is assumed to contain a single line
  5219. with no newlines.
  5220. @example
  5221. drawtext="fontsize=15:fontfile=FreeSerif.ttf:text=LONG_LINE:y=h-line_h:x=-50*t"
  5222. @end example
  5223. @item
  5224. Show the content of file @file{CREDITS} off the bottom of the frame and scroll up.
  5225. @example
  5226. drawtext="fontsize=20:fontfile=FreeSerif.ttf:textfile=CREDITS:y=h-20*t"
  5227. @end example
  5228. @item
  5229. Draw a single green letter "g", at the center of the input video.
  5230. The glyph baseline is placed at half screen height.
  5231. @example
  5232. drawtext="fontsize=60:fontfile=FreeSerif.ttf:fontcolor=green:text=g:x=(w-max_glyph_w)/2:y=h/2-ascent"
  5233. @end example
  5234. @item
  5235. Show text for 1 second every 3 seconds:
  5236. @example
  5237. drawtext="fontfile=FreeSerif.ttf:fontcolor=white:x=100:y=x/dar:enable=lt(mod(t\,3)\,1):text='blink'"
  5238. @end example
  5239. @item
  5240. Use fontconfig to set the font. Note that the colons need to be escaped.
  5241. @example
  5242. drawtext='fontfile=Linux Libertine O-40\:style=Semibold:text=FFmpeg'
  5243. @end example
  5244. @item
  5245. Print the date of a real-time encoding (see strftime(3)):
  5246. @example
  5247. drawtext='fontfile=FreeSans.ttf:text=%@{localtime\:%a %b %d %Y@}'
  5248. @end example
  5249. @item
  5250. Show text fading in and out (appearing/disappearing):
  5251. @example
  5252. #!/bin/sh
  5253. DS=1.0 # display start
  5254. DE=10.0 # display end
  5255. FID=1.5 # fade in duration
  5256. FOD=5 # fade out duration
  5257. ffplay -f lavfi "color,drawtext=text=TEST:fontsize=50:fontfile=FreeSerif.ttf:fontcolor_expr=ff0000%@{eif\\\\: clip(255*(1*between(t\\, $DS + $FID\\, $DE - $FOD) + ((t - $DS)/$FID)*between(t\\, $DS\\, $DS + $FID) + (-(t - $DE)/$FOD)*between(t\\, $DE - $FOD\\, $DE) )\\, 0\\, 255) \\\\: x\\\\: 2 @}"
  5258. @end example
  5259. @end itemize
  5260. For more information about libfreetype, check:
  5261. @url{http://www.freetype.org/}.
  5262. For more information about fontconfig, check:
  5263. @url{http://freedesktop.org/software/fontconfig/fontconfig-user.html}.
  5264. For more information about libfribidi, check:
  5265. @url{http://fribidi.org/}.
  5266. @section edgedetect
  5267. Detect and draw edges. The filter uses the Canny Edge Detection algorithm.
  5268. The filter accepts the following options:
  5269. @table @option
  5270. @item low
  5271. @item high
  5272. Set low and high threshold values used by the Canny thresholding
  5273. algorithm.
  5274. The high threshold selects the "strong" edge pixels, which are then
  5275. connected through 8-connectivity with the "weak" edge pixels selected
  5276. by the low threshold.
  5277. @var{low} and @var{high} threshold values must be chosen in the range
  5278. [0,1], and @var{low} should be lesser or equal to @var{high}.
  5279. Default value for @var{low} is @code{20/255}, and default value for @var{high}
  5280. is @code{50/255}.
  5281. @item mode
  5282. Define the drawing mode.
  5283. @table @samp
  5284. @item wires
  5285. Draw white/gray wires on black background.
  5286. @item colormix
  5287. Mix the colors to create a paint/cartoon effect.
  5288. @end table
  5289. Default value is @var{wires}.
  5290. @end table
  5291. @subsection Examples
  5292. @itemize
  5293. @item
  5294. Standard edge detection with custom values for the hysteresis thresholding:
  5295. @example
  5296. edgedetect=low=0.1:high=0.4
  5297. @end example
  5298. @item
  5299. Painting effect without thresholding:
  5300. @example
  5301. edgedetect=mode=colormix:high=0
  5302. @end example
  5303. @end itemize
  5304. @section eq
  5305. Set brightness, contrast, saturation and approximate gamma adjustment.
  5306. The filter accepts the following options:
  5307. @table @option
  5308. @item contrast
  5309. Set the contrast expression. The value must be a float value in range
  5310. @code{-2.0} to @code{2.0}. The default value is "1".
  5311. @item brightness
  5312. Set the brightness expression. The value must be a float value in
  5313. range @code{-1.0} to @code{1.0}. The default value is "0".
  5314. @item saturation
  5315. Set the saturation expression. The value must be a float in
  5316. range @code{0.0} to @code{3.0}. The default value is "1".
  5317. @item gamma
  5318. Set the gamma expression. The value must be a float in range
  5319. @code{0.1} to @code{10.0}. The default value is "1".
  5320. @item gamma_r
  5321. Set the gamma expression for red. The value must be a float in
  5322. range @code{0.1} to @code{10.0}. The default value is "1".
  5323. @item gamma_g
  5324. Set the gamma expression for green. The value must be a float in range
  5325. @code{0.1} to @code{10.0}. The default value is "1".
  5326. @item gamma_b
  5327. Set the gamma expression for blue. The value must be a float in range
  5328. @code{0.1} to @code{10.0}. The default value is "1".
  5329. @item gamma_weight
  5330. Set the gamma weight expression. It can be used to reduce the effect
  5331. of a high gamma value on bright image areas, e.g. keep them from
  5332. getting overamplified and just plain white. The value must be a float
  5333. in range @code{0.0} to @code{1.0}. A value of @code{0.0} turns the
  5334. gamma correction all the way down while @code{1.0} leaves it at its
  5335. full strength. Default is "1".
  5336. @item eval
  5337. Set when the expressions for brightness, contrast, saturation and
  5338. gamma expressions are evaluated.
  5339. It accepts the following values:
  5340. @table @samp
  5341. @item init
  5342. only evaluate expressions once during the filter initialization or
  5343. when a command is processed
  5344. @item frame
  5345. evaluate expressions for each incoming frame
  5346. @end table
  5347. Default value is @samp{init}.
  5348. @end table
  5349. The expressions accept the following parameters:
  5350. @table @option
  5351. @item n
  5352. frame count of the input frame starting from 0
  5353. @item pos
  5354. byte position of the corresponding packet in the input file, NAN if
  5355. unspecified
  5356. @item r
  5357. frame rate of the input video, NAN if the input frame rate is unknown
  5358. @item t
  5359. timestamp expressed in seconds, NAN if the input timestamp is unknown
  5360. @end table
  5361. @subsection Commands
  5362. The filter supports the following commands:
  5363. @table @option
  5364. @item contrast
  5365. Set the contrast expression.
  5366. @item brightness
  5367. Set the brightness expression.
  5368. @item saturation
  5369. Set the saturation expression.
  5370. @item gamma
  5371. Set the gamma expression.
  5372. @item gamma_r
  5373. Set the gamma_r expression.
  5374. @item gamma_g
  5375. Set gamma_g expression.
  5376. @item gamma_b
  5377. Set gamma_b expression.
  5378. @item gamma_weight
  5379. Set gamma_weight expression.
  5380. The command accepts the same syntax of the corresponding option.
  5381. If the specified expression is not valid, it is kept at its current
  5382. value.
  5383. @end table
  5384. @section erosion
  5385. Apply erosion effect to the video.
  5386. This filter replaces the pixel by the local(3x3) minimum.
  5387. It accepts the following options:
  5388. @table @option
  5389. @item threshold0
  5390. @item threshold1
  5391. @item threshold2
  5392. @item threshold3
  5393. Limit the maximum change for each plane, default is 65535.
  5394. If 0, plane will remain unchanged.
  5395. @item coordinates
  5396. Flag which specifies the pixel to refer to. Default is 255 i.e. all eight
  5397. pixels are used.
  5398. Flags to local 3x3 coordinates maps like this:
  5399. 1 2 3
  5400. 4 5
  5401. 6 7 8
  5402. @end table
  5403. @section extractplanes
  5404. Extract color channel components from input video stream into
  5405. separate grayscale video streams.
  5406. The filter accepts the following option:
  5407. @table @option
  5408. @item planes
  5409. Set plane(s) to extract.
  5410. Available values for planes are:
  5411. @table @samp
  5412. @item y
  5413. @item u
  5414. @item v
  5415. @item a
  5416. @item r
  5417. @item g
  5418. @item b
  5419. @end table
  5420. Choosing planes not available in the input will result in an error.
  5421. That means you cannot select @code{r}, @code{g}, @code{b} planes
  5422. with @code{y}, @code{u}, @code{v} planes at same time.
  5423. @end table
  5424. @subsection Examples
  5425. @itemize
  5426. @item
  5427. Extract luma, u and v color channel component from input video frame
  5428. into 3 grayscale outputs:
  5429. @example
  5430. ffmpeg -i video.avi -filter_complex 'extractplanes=y+u+v[y][u][v]' -map '[y]' y.avi -map '[u]' u.avi -map '[v]' v.avi
  5431. @end example
  5432. @end itemize
  5433. @section elbg
  5434. Apply a posterize effect using the ELBG (Enhanced LBG) algorithm.
  5435. For each input image, the filter will compute the optimal mapping from
  5436. the input to the output given the codebook length, that is the number
  5437. of distinct output colors.
  5438. This filter accepts the following options.
  5439. @table @option
  5440. @item codebook_length, l
  5441. Set codebook length. The value must be a positive integer, and
  5442. represents the number of distinct output colors. Default value is 256.
  5443. @item nb_steps, n
  5444. Set the maximum number of iterations to apply for computing the optimal
  5445. mapping. The higher the value the better the result and the higher the
  5446. computation time. Default value is 1.
  5447. @item seed, s
  5448. Set a random seed, must be an integer included between 0 and
  5449. UINT32_MAX. If not specified, or if explicitly set to -1, the filter
  5450. will try to use a good random seed on a best effort basis.
  5451. @item pal8
  5452. Set pal8 output pixel format. This option does not work with codebook
  5453. length greater than 256.
  5454. @end table
  5455. @section fade
  5456. Apply a fade-in/out effect to the input video.
  5457. It accepts the following parameters:
  5458. @table @option
  5459. @item type, t
  5460. The effect type can be either "in" for a fade-in, or "out" for a fade-out
  5461. effect.
  5462. Default is @code{in}.
  5463. @item start_frame, s
  5464. Specify the number of the frame to start applying the fade
  5465. effect at. Default is 0.
  5466. @item nb_frames, n
  5467. The number of frames that the fade effect lasts. At the end of the
  5468. fade-in effect, the output video will have the same intensity as the input video.
  5469. At the end of the fade-out transition, the output video will be filled with the
  5470. selected @option{color}.
  5471. Default is 25.
  5472. @item alpha
  5473. If set to 1, fade only alpha channel, if one exists on the input.
  5474. Default value is 0.
  5475. @item start_time, st
  5476. Specify the timestamp (in seconds) of the frame to start to apply the fade
  5477. effect. If both start_frame and start_time are specified, the fade will start at
  5478. whichever comes last. Default is 0.
  5479. @item duration, d
  5480. The number of seconds for which the fade effect has to last. At the end of the
  5481. fade-in effect the output video will have the same intensity as the input video,
  5482. at the end of the fade-out transition the output video will be filled with the
  5483. selected @option{color}.
  5484. If both duration and nb_frames are specified, duration is used. Default is 0
  5485. (nb_frames is used by default).
  5486. @item color, c
  5487. Specify the color of the fade. Default is "black".
  5488. @end table
  5489. @subsection Examples
  5490. @itemize
  5491. @item
  5492. Fade in the first 30 frames of video:
  5493. @example
  5494. fade=in:0:30
  5495. @end example
  5496. The command above is equivalent to:
  5497. @example
  5498. fade=t=in:s=0:n=30
  5499. @end example
  5500. @item
  5501. Fade out the last 45 frames of a 200-frame video:
  5502. @example
  5503. fade=out:155:45
  5504. fade=type=out:start_frame=155:nb_frames=45
  5505. @end example
  5506. @item
  5507. Fade in the first 25 frames and fade out the last 25 frames of a 1000-frame video:
  5508. @example
  5509. fade=in:0:25, fade=out:975:25
  5510. @end example
  5511. @item
  5512. Make the first 5 frames yellow, then fade in from frame 5-24:
  5513. @example
  5514. fade=in:5:20:color=yellow
  5515. @end example
  5516. @item
  5517. Fade in alpha over first 25 frames of video:
  5518. @example
  5519. fade=in:0:25:alpha=1
  5520. @end example
  5521. @item
  5522. Make the first 5.5 seconds black, then fade in for 0.5 seconds:
  5523. @example
  5524. fade=t=in:st=5.5:d=0.5
  5525. @end example
  5526. @end itemize
  5527. @section fftfilt
  5528. Apply arbitrary expressions to samples in frequency domain
  5529. @table @option
  5530. @item dc_Y
  5531. Adjust the dc value (gain) of the luma plane of the image. The filter
  5532. accepts an integer value in range @code{0} to @code{1000}. The default
  5533. value is set to @code{0}.
  5534. @item dc_U
  5535. Adjust the dc value (gain) of the 1st chroma plane of the image. The
  5536. filter accepts an integer value in range @code{0} to @code{1000}. The
  5537. default value is set to @code{0}.
  5538. @item dc_V
  5539. Adjust the dc value (gain) of the 2nd chroma plane of the image. The
  5540. filter accepts an integer value in range @code{0} to @code{1000}. The
  5541. default value is set to @code{0}.
  5542. @item weight_Y
  5543. Set the frequency domain weight expression for the luma plane.
  5544. @item weight_U
  5545. Set the frequency domain weight expression for the 1st chroma plane.
  5546. @item weight_V
  5547. Set the frequency domain weight expression for the 2nd chroma plane.
  5548. The filter accepts the following variables:
  5549. @item X
  5550. @item Y
  5551. The coordinates of the current sample.
  5552. @item W
  5553. @item H
  5554. The width and height of the image.
  5555. @end table
  5556. @subsection Examples
  5557. @itemize
  5558. @item
  5559. High-pass:
  5560. @example
  5561. fftfilt=dc_Y=128:weight_Y='squish(1-(Y+X)/100)'
  5562. @end example
  5563. @item
  5564. Low-pass:
  5565. @example
  5566. fftfilt=dc_Y=0:weight_Y='squish((Y+X)/100-1)'
  5567. @end example
  5568. @item
  5569. Sharpen:
  5570. @example
  5571. fftfilt=dc_Y=0:weight_Y='1+squish(1-(Y+X)/100)'
  5572. @end example
  5573. @item
  5574. Blur:
  5575. @example
  5576. fftfilt=dc_Y=0:weight_Y='exp(-4 * ((Y+X)/(W+H)))'
  5577. @end example
  5578. @end itemize
  5579. @section field
  5580. Extract a single field from an interlaced image using stride
  5581. arithmetic to avoid wasting CPU time. The output frames are marked as
  5582. non-interlaced.
  5583. The filter accepts the following options:
  5584. @table @option
  5585. @item type
  5586. Specify whether to extract the top (if the value is @code{0} or
  5587. @code{top}) or the bottom field (if the value is @code{1} or
  5588. @code{bottom}).
  5589. @end table
  5590. @section fieldhint
  5591. Create new frames by copying the top and bottom fields from surrounding frames
  5592. supplied as numbers by the hint file.
  5593. @table @option
  5594. @item hint
  5595. Set file containing hints: absolute/relative frame numbers.
  5596. There must be one line for each frame in a clip. Each line must contain two
  5597. numbers separated by the comma, optionally followed by @code{-} or @code{+}.
  5598. Numbers supplied on each line of file can not be out of [N-1,N+1] where N
  5599. is current frame number for @code{absolute} mode or out of [-1, 1] range
  5600. for @code{relative} mode. First number tells from which frame to pick up top
  5601. field and second number tells from which frame to pick up bottom field.
  5602. If optionally followed by @code{+} output frame will be marked as interlaced,
  5603. else if followed by @code{-} output frame will be marked as progressive, else
  5604. it will be marked same as input frame.
  5605. If line starts with @code{#} or @code{;} that line is skipped.
  5606. @item mode
  5607. Can be item @code{absolute} or @code{relative}. Default is @code{absolute}.
  5608. @end table
  5609. Example of first several lines of @code{hint} file for @code{relative} mode:
  5610. @example
  5611. 0,0 - # first frame
  5612. 1,0 - # second frame, use third's frame top field and second's frame bottom field
  5613. 1,0 - # third frame, use fourth's frame top field and third's frame bottom field
  5614. 1,0 -
  5615. 0,0 -
  5616. 0,0 -
  5617. 1,0 -
  5618. 1,0 -
  5619. 1,0 -
  5620. 0,0 -
  5621. 0,0 -
  5622. 1,0 -
  5623. 1,0 -
  5624. 1,0 -
  5625. 0,0 -
  5626. @end example
  5627. @section fieldmatch
  5628. Field matching filter for inverse telecine. It is meant to reconstruct the
  5629. progressive frames from a telecined stream. The filter does not drop duplicated
  5630. frames, so to achieve a complete inverse telecine @code{fieldmatch} needs to be
  5631. followed by a decimation filter such as @ref{decimate} in the filtergraph.
  5632. The separation of the field matching and the decimation is notably motivated by
  5633. the possibility of inserting a de-interlacing filter fallback between the two.
  5634. If the source has mixed telecined and real interlaced content,
  5635. @code{fieldmatch} will not be able to match fields for the interlaced parts.
  5636. But these remaining combed frames will be marked as interlaced, and thus can be
  5637. de-interlaced by a later filter such as @ref{yadif} before decimation.
  5638. In addition to the various configuration options, @code{fieldmatch} can take an
  5639. optional second stream, activated through the @option{ppsrc} option. If
  5640. enabled, the frames reconstruction will be based on the fields and frames from
  5641. this second stream. This allows the first input to be pre-processed in order to
  5642. help the various algorithms of the filter, while keeping the output lossless
  5643. (assuming the fields are matched properly). Typically, a field-aware denoiser,
  5644. or brightness/contrast adjustments can help.
  5645. Note that this filter uses the same algorithms as TIVTC/TFM (AviSynth project)
  5646. and VIVTC/VFM (VapourSynth project). The later is a light clone of TFM from
  5647. which @code{fieldmatch} is based on. While the semantic and usage are very
  5648. close, some behaviour and options names can differ.
  5649. The @ref{decimate} filter currently only works for constant frame rate input.
  5650. If your input has mixed telecined (30fps) and progressive content with a lower
  5651. framerate like 24fps use the following filterchain to produce the necessary cfr
  5652. stream: @code{dejudder,fps=30000/1001,fieldmatch,decimate}.
  5653. The filter accepts the following options:
  5654. @table @option
  5655. @item order
  5656. Specify the assumed field order of the input stream. Available values are:
  5657. @table @samp
  5658. @item auto
  5659. Auto detect parity (use FFmpeg's internal parity value).
  5660. @item bff
  5661. Assume bottom field first.
  5662. @item tff
  5663. Assume top field first.
  5664. @end table
  5665. Note that it is sometimes recommended not to trust the parity announced by the
  5666. stream.
  5667. Default value is @var{auto}.
  5668. @item mode
  5669. Set the matching mode or strategy to use. @option{pc} mode is the safest in the
  5670. sense that it won't risk creating jerkiness due to duplicate frames when
  5671. possible, but if there are bad edits or blended fields it will end up
  5672. outputting combed frames when a good match might actually exist. On the other
  5673. hand, @option{pcn_ub} mode is the most risky in terms of creating jerkiness,
  5674. but will almost always find a good frame if there is one. The other values are
  5675. all somewhere in between @option{pc} and @option{pcn_ub} in terms of risking
  5676. jerkiness and creating duplicate frames versus finding good matches in sections
  5677. with bad edits, orphaned fields, blended fields, etc.
  5678. More details about p/c/n/u/b are available in @ref{p/c/n/u/b meaning} section.
  5679. Available values are:
  5680. @table @samp
  5681. @item pc
  5682. 2-way matching (p/c)
  5683. @item pc_n
  5684. 2-way matching, and trying 3rd match if still combed (p/c + n)
  5685. @item pc_u
  5686. 2-way matching, and trying 3rd match (same order) if still combed (p/c + u)
  5687. @item pc_n_ub
  5688. 2-way matching, trying 3rd match if still combed, and trying 4th/5th matches if
  5689. still combed (p/c + n + u/b)
  5690. @item pcn
  5691. 3-way matching (p/c/n)
  5692. @item pcn_ub
  5693. 3-way matching, and trying 4th/5th matches if all 3 of the original matches are
  5694. detected as combed (p/c/n + u/b)
  5695. @end table
  5696. The parenthesis at the end indicate the matches that would be used for that
  5697. mode assuming @option{order}=@var{tff} (and @option{field} on @var{auto} or
  5698. @var{top}).
  5699. In terms of speed @option{pc} mode is by far the fastest and @option{pcn_ub} is
  5700. the slowest.
  5701. Default value is @var{pc_n}.
  5702. @item ppsrc
  5703. Mark the main input stream as a pre-processed input, and enable the secondary
  5704. input stream as the clean source to pick the fields from. See the filter
  5705. introduction for more details. It is similar to the @option{clip2} feature from
  5706. VFM/TFM.
  5707. Default value is @code{0} (disabled).
  5708. @item field
  5709. Set the field to match from. It is recommended to set this to the same value as
  5710. @option{order} unless you experience matching failures with that setting. In
  5711. certain circumstances changing the field that is used to match from can have a
  5712. large impact on matching performance. Available values are:
  5713. @table @samp
  5714. @item auto
  5715. Automatic (same value as @option{order}).
  5716. @item bottom
  5717. Match from the bottom field.
  5718. @item top
  5719. Match from the top field.
  5720. @end table
  5721. Default value is @var{auto}.
  5722. @item mchroma
  5723. Set whether or not chroma is included during the match comparisons. In most
  5724. cases it is recommended to leave this enabled. You should set this to @code{0}
  5725. only if your clip has bad chroma problems such as heavy rainbowing or other
  5726. artifacts. Setting this to @code{0} could also be used to speed things up at
  5727. the cost of some accuracy.
  5728. Default value is @code{1}.
  5729. @item y0
  5730. @item y1
  5731. These define an exclusion band which excludes the lines between @option{y0} and
  5732. @option{y1} from being included in the field matching decision. An exclusion
  5733. band can be used to ignore subtitles, a logo, or other things that may
  5734. interfere with the matching. @option{y0} sets the starting scan line and
  5735. @option{y1} sets the ending line; all lines in between @option{y0} and
  5736. @option{y1} (including @option{y0} and @option{y1}) will be ignored. Setting
  5737. @option{y0} and @option{y1} to the same value will disable the feature.
  5738. @option{y0} and @option{y1} defaults to @code{0}.
  5739. @item scthresh
  5740. Set the scene change detection threshold as a percentage of maximum change on
  5741. the luma plane. Good values are in the @code{[8.0, 14.0]} range. Scene change
  5742. detection is only relevant in case @option{combmatch}=@var{sc}. The range for
  5743. @option{scthresh} is @code{[0.0, 100.0]}.
  5744. Default value is @code{12.0}.
  5745. @item combmatch
  5746. When @option{combatch} is not @var{none}, @code{fieldmatch} will take into
  5747. account the combed scores of matches when deciding what match to use as the
  5748. final match. Available values are:
  5749. @table @samp
  5750. @item none
  5751. No final matching based on combed scores.
  5752. @item sc
  5753. Combed scores are only used when a scene change is detected.
  5754. @item full
  5755. Use combed scores all the time.
  5756. @end table
  5757. Default is @var{sc}.
  5758. @item combdbg
  5759. Force @code{fieldmatch} to calculate the combed metrics for certain matches and
  5760. print them. This setting is known as @option{micout} in TFM/VFM vocabulary.
  5761. Available values are:
  5762. @table @samp
  5763. @item none
  5764. No forced calculation.
  5765. @item pcn
  5766. Force p/c/n calculations.
  5767. @item pcnub
  5768. Force p/c/n/u/b calculations.
  5769. @end table
  5770. Default value is @var{none}.
  5771. @item cthresh
  5772. This is the area combing threshold used for combed frame detection. This
  5773. essentially controls how "strong" or "visible" combing must be to be detected.
  5774. Larger values mean combing must be more visible and smaller values mean combing
  5775. can be less visible or strong and still be detected. Valid settings are from
  5776. @code{-1} (every pixel will be detected as combed) to @code{255} (no pixel will
  5777. be detected as combed). This is basically a pixel difference value. A good
  5778. range is @code{[8, 12]}.
  5779. Default value is @code{9}.
  5780. @item chroma
  5781. Sets whether or not chroma is considered in the combed frame decision. Only
  5782. disable this if your source has chroma problems (rainbowing, etc.) that are
  5783. causing problems for the combed frame detection with chroma enabled. Actually,
  5784. using @option{chroma}=@var{0} is usually more reliable, except for the case
  5785. where there is chroma only combing in the source.
  5786. Default value is @code{0}.
  5787. @item blockx
  5788. @item blocky
  5789. Respectively set the x-axis and y-axis size of the window used during combed
  5790. frame detection. This has to do with the size of the area in which
  5791. @option{combpel} pixels are required to be detected as combed for a frame to be
  5792. declared combed. See the @option{combpel} parameter description for more info.
  5793. Possible values are any number that is a power of 2 starting at 4 and going up
  5794. to 512.
  5795. Default value is @code{16}.
  5796. @item combpel
  5797. The number of combed pixels inside any of the @option{blocky} by
  5798. @option{blockx} size blocks on the frame for the frame to be detected as
  5799. combed. While @option{cthresh} controls how "visible" the combing must be, this
  5800. setting controls "how much" combing there must be in any localized area (a
  5801. window defined by the @option{blockx} and @option{blocky} settings) on the
  5802. frame. Minimum value is @code{0} and maximum is @code{blocky x blockx} (at
  5803. which point no frames will ever be detected as combed). This setting is known
  5804. as @option{MI} in TFM/VFM vocabulary.
  5805. Default value is @code{80}.
  5806. @end table
  5807. @anchor{p/c/n/u/b meaning}
  5808. @subsection p/c/n/u/b meaning
  5809. @subsubsection p/c/n
  5810. We assume the following telecined stream:
  5811. @example
  5812. Top fields: 1 2 2 3 4
  5813. Bottom fields: 1 2 3 4 4
  5814. @end example
  5815. The numbers correspond to the progressive frame the fields relate to. Here, the
  5816. first two frames are progressive, the 3rd and 4th are combed, and so on.
  5817. When @code{fieldmatch} is configured to run a matching from bottom
  5818. (@option{field}=@var{bottom}) this is how this input stream get transformed:
  5819. @example
  5820. Input stream:
  5821. T 1 2 2 3 4
  5822. B 1 2 3 4 4 <-- matching reference
  5823. Matches: c c n n c
  5824. Output stream:
  5825. T 1 2 3 4 4
  5826. B 1 2 3 4 4
  5827. @end example
  5828. As a result of the field matching, we can see that some frames get duplicated.
  5829. To perform a complete inverse telecine, you need to rely on a decimation filter
  5830. after this operation. See for instance the @ref{decimate} filter.
  5831. The same operation now matching from top fields (@option{field}=@var{top})
  5832. looks like this:
  5833. @example
  5834. Input stream:
  5835. T 1 2 2 3 4 <-- matching reference
  5836. B 1 2 3 4 4
  5837. Matches: c c p p c
  5838. Output stream:
  5839. T 1 2 2 3 4
  5840. B 1 2 2 3 4
  5841. @end example
  5842. In these examples, we can see what @var{p}, @var{c} and @var{n} mean;
  5843. basically, they refer to the frame and field of the opposite parity:
  5844. @itemize
  5845. @item @var{p} matches the field of the opposite parity in the previous frame
  5846. @item @var{c} matches the field of the opposite parity in the current frame
  5847. @item @var{n} matches the field of the opposite parity in the next frame
  5848. @end itemize
  5849. @subsubsection u/b
  5850. The @var{u} and @var{b} matching are a bit special in the sense that they match
  5851. from the opposite parity flag. In the following examples, we assume that we are
  5852. currently matching the 2nd frame (Top:2, bottom:2). According to the match, a
  5853. 'x' is placed above and below each matched fields.
  5854. With bottom matching (@option{field}=@var{bottom}):
  5855. @example
  5856. Match: c p n b u
  5857. x x x x x
  5858. Top 1 2 2 1 2 2 1 2 2 1 2 2 1 2 2
  5859. Bottom 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3
  5860. x x x x x
  5861. Output frames:
  5862. 2 1 2 2 2
  5863. 2 2 2 1 3
  5864. @end example
  5865. With top matching (@option{field}=@var{top}):
  5866. @example
  5867. Match: c p n b u
  5868. x x x x x
  5869. Top 1 2 2 1 2 2 1 2 2 1 2 2 1 2 2
  5870. Bottom 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3
  5871. x x x x x
  5872. Output frames:
  5873. 2 2 2 1 2
  5874. 2 1 3 2 2
  5875. @end example
  5876. @subsection Examples
  5877. Simple IVTC of a top field first telecined stream:
  5878. @example
  5879. fieldmatch=order=tff:combmatch=none, decimate
  5880. @end example
  5881. Advanced IVTC, with fallback on @ref{yadif} for still combed frames:
  5882. @example
  5883. fieldmatch=order=tff:combmatch=full, yadif=deint=interlaced, decimate
  5884. @end example
  5885. @section fieldorder
  5886. Transform the field order of the input video.
  5887. It accepts the following parameters:
  5888. @table @option
  5889. @item order
  5890. The output field order. Valid values are @var{tff} for top field first or @var{bff}
  5891. for bottom field first.
  5892. @end table
  5893. The default value is @samp{tff}.
  5894. The transformation is done by shifting the picture content up or down
  5895. by one line, and filling the remaining line with appropriate picture content.
  5896. This method is consistent with most broadcast field order converters.
  5897. If the input video is not flagged as being interlaced, or it is already
  5898. flagged as being of the required output field order, then this filter does
  5899. not alter the incoming video.
  5900. It is very useful when converting to or from PAL DV material,
  5901. which is bottom field first.
  5902. For example:
  5903. @example
  5904. ffmpeg -i in.vob -vf "fieldorder=bff" out.dv
  5905. @end example
  5906. @section fifo, afifo
  5907. Buffer input images and send them when they are requested.
  5908. It is mainly useful when auto-inserted by the libavfilter
  5909. framework.
  5910. It does not take parameters.
  5911. @section find_rect
  5912. Find a rectangular object
  5913. It accepts the following options:
  5914. @table @option
  5915. @item object
  5916. Filepath of the object image, needs to be in gray8.
  5917. @item threshold
  5918. Detection threshold, default is 0.5.
  5919. @item mipmaps
  5920. Number of mipmaps, default is 3.
  5921. @item xmin, ymin, xmax, ymax
  5922. Specifies the rectangle in which to search.
  5923. @end table
  5924. @subsection Examples
  5925. @itemize
  5926. @item
  5927. Generate a representative palette of a given video using @command{ffmpeg}:
  5928. @example
  5929. ffmpeg -i file.ts -vf find_rect=newref.pgm,cover_rect=cover.jpg:mode=cover new.mkv
  5930. @end example
  5931. @end itemize
  5932. @section cover_rect
  5933. Cover a rectangular object
  5934. It accepts the following options:
  5935. @table @option
  5936. @item cover
  5937. Filepath of the optional cover image, needs to be in yuv420.
  5938. @item mode
  5939. Set covering mode.
  5940. It accepts the following values:
  5941. @table @samp
  5942. @item cover
  5943. cover it by the supplied image
  5944. @item blur
  5945. cover it by interpolating the surrounding pixels
  5946. @end table
  5947. Default value is @var{blur}.
  5948. @end table
  5949. @subsection Examples
  5950. @itemize
  5951. @item
  5952. Generate a representative palette of a given video using @command{ffmpeg}:
  5953. @example
  5954. ffmpeg -i file.ts -vf find_rect=newref.pgm,cover_rect=cover.jpg:mode=cover new.mkv
  5955. @end example
  5956. @end itemize
  5957. @anchor{format}
  5958. @section format
  5959. Convert the input video to one of the specified pixel formats.
  5960. Libavfilter will try to pick one that is suitable as input to
  5961. the next filter.
  5962. It accepts the following parameters:
  5963. @table @option
  5964. @item pix_fmts
  5965. A '|'-separated list of pixel format names, such as
  5966. "pix_fmts=yuv420p|monow|rgb24".
  5967. @end table
  5968. @subsection Examples
  5969. @itemize
  5970. @item
  5971. Convert the input video to the @var{yuv420p} format
  5972. @example
  5973. format=pix_fmts=yuv420p
  5974. @end example
  5975. Convert the input video to any of the formats in the list
  5976. @example
  5977. format=pix_fmts=yuv420p|yuv444p|yuv410p
  5978. @end example
  5979. @end itemize
  5980. @anchor{fps}
  5981. @section fps
  5982. Convert the video to specified constant frame rate by duplicating or dropping
  5983. frames as necessary.
  5984. It accepts the following parameters:
  5985. @table @option
  5986. @item fps
  5987. The desired output frame rate. The default is @code{25}.
  5988. @item round
  5989. Rounding method.
  5990. Possible values are:
  5991. @table @option
  5992. @item zero
  5993. zero round towards 0
  5994. @item inf
  5995. round away from 0
  5996. @item down
  5997. round towards -infinity
  5998. @item up
  5999. round towards +infinity
  6000. @item near
  6001. round to nearest
  6002. @end table
  6003. The default is @code{near}.
  6004. @item start_time
  6005. Assume the first PTS should be the given value, in seconds. This allows for
  6006. padding/trimming at the start of stream. By default, no assumption is made
  6007. about the first frame's expected PTS, so no padding or trimming is done.
  6008. For example, this could be set to 0 to pad the beginning with duplicates of
  6009. the first frame if a video stream starts after the audio stream or to trim any
  6010. frames with a negative PTS.
  6011. @end table
  6012. Alternatively, the options can be specified as a flat string:
  6013. @var{fps}[:@var{round}].
  6014. See also the @ref{setpts} filter.
  6015. @subsection Examples
  6016. @itemize
  6017. @item
  6018. A typical usage in order to set the fps to 25:
  6019. @example
  6020. fps=fps=25
  6021. @end example
  6022. @item
  6023. Sets the fps to 24, using abbreviation and rounding method to round to nearest:
  6024. @example
  6025. fps=fps=film:round=near
  6026. @end example
  6027. @end itemize
  6028. @section framepack
  6029. Pack two different video streams into a stereoscopic video, setting proper
  6030. metadata on supported codecs. The two views should have the same size and
  6031. framerate and processing will stop when the shorter video ends. Please note
  6032. that you may conveniently adjust view properties with the @ref{scale} and
  6033. @ref{fps} filters.
  6034. It accepts the following parameters:
  6035. @table @option
  6036. @item format
  6037. The desired packing format. Supported values are:
  6038. @table @option
  6039. @item sbs
  6040. The views are next to each other (default).
  6041. @item tab
  6042. The views are on top of each other.
  6043. @item lines
  6044. The views are packed by line.
  6045. @item columns
  6046. The views are packed by column.
  6047. @item frameseq
  6048. The views are temporally interleaved.
  6049. @end table
  6050. @end table
  6051. Some examples:
  6052. @example
  6053. # Convert left and right views into a frame-sequential video
  6054. ffmpeg -i LEFT -i RIGHT -filter_complex framepack=frameseq OUTPUT
  6055. # Convert views into a side-by-side video with the same output resolution as the input
  6056. ffmpeg -i LEFT -i RIGHT -filter_complex [0:v]scale=w=iw/2[left],[1:v]scale=w=iw/2[right],[left][right]framepack=sbs OUTPUT
  6057. @end example
  6058. @section framerate
  6059. Change the frame rate by interpolating new video output frames from the source
  6060. frames.
  6061. This filter is not designed to function correctly with interlaced media. If
  6062. you wish to change the frame rate of interlaced media then you are required
  6063. to deinterlace before this filter and re-interlace after this filter.
  6064. A description of the accepted options follows.
  6065. @table @option
  6066. @item fps
  6067. Specify the output frames per second. This option can also be specified
  6068. as a value alone. The default is @code{50}.
  6069. @item interp_start
  6070. Specify the start of a range where the output frame will be created as a
  6071. linear interpolation of two frames. The range is [@code{0}-@code{255}],
  6072. the default is @code{15}.
  6073. @item interp_end
  6074. Specify the end of a range where the output frame will be created as a
  6075. linear interpolation of two frames. The range is [@code{0}-@code{255}],
  6076. the default is @code{240}.
  6077. @item scene
  6078. Specify the level at which a scene change is detected as a value between
  6079. 0 and 100 to indicate a new scene; a low value reflects a low
  6080. probability for the current frame to introduce a new scene, while a higher
  6081. value means the current frame is more likely to be one.
  6082. The default is @code{7}.
  6083. @item flags
  6084. Specify flags influencing the filter process.
  6085. Available value for @var{flags} is:
  6086. @table @option
  6087. @item scene_change_detect, scd
  6088. Enable scene change detection using the value of the option @var{scene}.
  6089. This flag is enabled by default.
  6090. @end table
  6091. @end table
  6092. @section framestep
  6093. Select one frame every N-th frame.
  6094. This filter accepts the following option:
  6095. @table @option
  6096. @item step
  6097. Select frame after every @code{step} frames.
  6098. Allowed values are positive integers higher than 0. Default value is @code{1}.
  6099. @end table
  6100. @anchor{frei0r}
  6101. @section frei0r
  6102. Apply a frei0r effect to the input video.
  6103. To enable the compilation of this filter, you need to install the frei0r
  6104. header and configure FFmpeg with @code{--enable-frei0r}.
  6105. It accepts the following parameters:
  6106. @table @option
  6107. @item filter_name
  6108. The name of the frei0r effect to load. If the environment variable
  6109. @env{FREI0R_PATH} is defined, the frei0r effect is searched for in each of the
  6110. directories specified by the colon-separated list in @env{FREIOR_PATH}.
  6111. Otherwise, the standard frei0r paths are searched, in this order:
  6112. @file{HOME/.frei0r-1/lib/}, @file{/usr/local/lib/frei0r-1/},
  6113. @file{/usr/lib/frei0r-1/}.
  6114. @item filter_params
  6115. A '|'-separated list of parameters to pass to the frei0r effect.
  6116. @end table
  6117. A frei0r effect parameter can be a boolean (its value is either
  6118. "y" or "n"), a double, a color (specified as
  6119. @var{R}/@var{G}/@var{B}, where @var{R}, @var{G}, and @var{B} are floating point
  6120. numbers between 0.0 and 1.0, inclusive) or by a color description specified in the "Color"
  6121. section in the ffmpeg-utils manual), a position (specified as @var{X}/@var{Y}, where
  6122. @var{X} and @var{Y} are floating point numbers) and/or a string.
  6123. The number and types of parameters depend on the loaded effect. If an
  6124. effect parameter is not specified, the default value is set.
  6125. @subsection Examples
  6126. @itemize
  6127. @item
  6128. Apply the distort0r effect, setting the first two double parameters:
  6129. @example
  6130. frei0r=filter_name=distort0r:filter_params=0.5|0.01
  6131. @end example
  6132. @item
  6133. Apply the colordistance effect, taking a color as the first parameter:
  6134. @example
  6135. frei0r=colordistance:0.2/0.3/0.4
  6136. frei0r=colordistance:violet
  6137. frei0r=colordistance:0x112233
  6138. @end example
  6139. @item
  6140. Apply the perspective effect, specifying the top left and top right image
  6141. positions:
  6142. @example
  6143. frei0r=perspective:0.2/0.2|0.8/0.2
  6144. @end example
  6145. @end itemize
  6146. For more information, see
  6147. @url{http://frei0r.dyne.org}
  6148. @section fspp
  6149. Apply fast and simple postprocessing. It is a faster version of @ref{spp}.
  6150. It splits (I)DCT into horizontal/vertical passes. Unlike the simple post-
  6151. processing filter, one of them is performed once per block, not per pixel.
  6152. This allows for much higher speed.
  6153. The filter accepts the following options:
  6154. @table @option
  6155. @item quality
  6156. Set quality. This option defines the number of levels for averaging. It accepts
  6157. an integer in the range 4-5. Default value is @code{4}.
  6158. @item qp
  6159. Force a constant quantization parameter. It accepts an integer in range 0-63.
  6160. If not set, the filter will use the QP from the video stream (if available).
  6161. @item strength
  6162. Set filter strength. It accepts an integer in range -15 to 32. Lower values mean
  6163. more details but also more artifacts, while higher values make the image smoother
  6164. but also blurrier. Default value is @code{0} − PSNR optimal.
  6165. @item use_bframe_qp
  6166. Enable the use of the QP from the B-Frames if set to @code{1}. Using this
  6167. option may cause flicker since the B-Frames have often larger QP. Default is
  6168. @code{0} (not enabled).
  6169. @end table
  6170. @section geq
  6171. The filter accepts the following options:
  6172. @table @option
  6173. @item lum_expr, lum
  6174. Set the luminance expression.
  6175. @item cb_expr, cb
  6176. Set the chrominance blue expression.
  6177. @item cr_expr, cr
  6178. Set the chrominance red expression.
  6179. @item alpha_expr, a
  6180. Set the alpha expression.
  6181. @item red_expr, r
  6182. Set the red expression.
  6183. @item green_expr, g
  6184. Set the green expression.
  6185. @item blue_expr, b
  6186. Set the blue expression.
  6187. @end table
  6188. The colorspace is selected according to the specified options. If one
  6189. of the @option{lum_expr}, @option{cb_expr}, or @option{cr_expr}
  6190. options is specified, the filter will automatically select a YCbCr
  6191. colorspace. If one of the @option{red_expr}, @option{green_expr}, or
  6192. @option{blue_expr} options is specified, it will select an RGB
  6193. colorspace.
  6194. If one of the chrominance expression is not defined, it falls back on the other
  6195. one. If no alpha expression is specified it will evaluate to opaque value.
  6196. If none of chrominance expressions are specified, they will evaluate
  6197. to the luminance expression.
  6198. The expressions can use the following variables and functions:
  6199. @table @option
  6200. @item N
  6201. The sequential number of the filtered frame, starting from @code{0}.
  6202. @item X
  6203. @item Y
  6204. The coordinates of the current sample.
  6205. @item W
  6206. @item H
  6207. The width and height of the image.
  6208. @item SW
  6209. @item SH
  6210. Width and height scale depending on the currently filtered plane. It is the
  6211. ratio between the corresponding luma plane number of pixels and the current
  6212. plane ones. E.g. for YUV4:2:0 the values are @code{1,1} for the luma plane, and
  6213. @code{0.5,0.5} for chroma planes.
  6214. @item T
  6215. Time of the current frame, expressed in seconds.
  6216. @item p(x, y)
  6217. Return the value of the pixel at location (@var{x},@var{y}) of the current
  6218. plane.
  6219. @item lum(x, y)
  6220. Return the value of the pixel at location (@var{x},@var{y}) of the luminance
  6221. plane.
  6222. @item cb(x, y)
  6223. Return the value of the pixel at location (@var{x},@var{y}) of the
  6224. blue-difference chroma plane. Return 0 if there is no such plane.
  6225. @item cr(x, y)
  6226. Return the value of the pixel at location (@var{x},@var{y}) of the
  6227. red-difference chroma plane. Return 0 if there is no such plane.
  6228. @item r(x, y)
  6229. @item g(x, y)
  6230. @item b(x, y)
  6231. Return the value of the pixel at location (@var{x},@var{y}) of the
  6232. red/green/blue component. Return 0 if there is no such component.
  6233. @item alpha(x, y)
  6234. Return the value of the pixel at location (@var{x},@var{y}) of the alpha
  6235. plane. Return 0 if there is no such plane.
  6236. @end table
  6237. For functions, if @var{x} and @var{y} are outside the area, the value will be
  6238. automatically clipped to the closer edge.
  6239. @subsection Examples
  6240. @itemize
  6241. @item
  6242. Flip the image horizontally:
  6243. @example
  6244. geq=p(W-X\,Y)
  6245. @end example
  6246. @item
  6247. Generate a bidimensional sine wave, with angle @code{PI/3} and a
  6248. wavelength of 100 pixels:
  6249. @example
  6250. geq=128 + 100*sin(2*(PI/100)*(cos(PI/3)*(X-50*T) + sin(PI/3)*Y)):128:128
  6251. @end example
  6252. @item
  6253. Generate a fancy enigmatic moving light:
  6254. @example
  6255. nullsrc=s=256x256,geq=random(1)/hypot(X-cos(N*0.07)*W/2-W/2\,Y-sin(N*0.09)*H/2-H/2)^2*1000000*sin(N*0.02):128:128
  6256. @end example
  6257. @item
  6258. Generate a quick emboss effect:
  6259. @example
  6260. format=gray,geq=lum_expr='(p(X,Y)+(256-p(X-4,Y-4)))/2'
  6261. @end example
  6262. @item
  6263. Modify RGB components depending on pixel position:
  6264. @example
  6265. geq=r='X/W*r(X,Y)':g='(1-X/W)*g(X,Y)':b='(H-Y)/H*b(X,Y)'
  6266. @end example
  6267. @item
  6268. Create a radial gradient that is the same size as the input (also see
  6269. the @ref{vignette} filter):
  6270. @example
  6271. geq=lum=255*gauss((X/W-0.5)*3)*gauss((Y/H-0.5)*3)/gauss(0)/gauss(0),format=gray
  6272. @end example
  6273. @end itemize
  6274. @section gradfun
  6275. Fix the banding artifacts that are sometimes introduced into nearly flat
  6276. regions by truncation to 8bit color depth.
  6277. Interpolate the gradients that should go where the bands are, and
  6278. dither them.
  6279. It is designed for playback only. Do not use it prior to
  6280. lossy compression, because compression tends to lose the dither and
  6281. bring back the bands.
  6282. It accepts the following parameters:
  6283. @table @option
  6284. @item strength
  6285. The maximum amount by which the filter will change any one pixel. This is also
  6286. the threshold for detecting nearly flat regions. Acceptable values range from
  6287. .51 to 64; the default value is 1.2. Out-of-range values will be clipped to the
  6288. valid range.
  6289. @item radius
  6290. The neighborhood to fit the gradient to. A larger radius makes for smoother
  6291. gradients, but also prevents the filter from modifying the pixels near detailed
  6292. regions. Acceptable values are 8-32; the default value is 16. Out-of-range
  6293. values will be clipped to the valid range.
  6294. @end table
  6295. Alternatively, the options can be specified as a flat string:
  6296. @var{strength}[:@var{radius}]
  6297. @subsection Examples
  6298. @itemize
  6299. @item
  6300. Apply the filter with a @code{3.5} strength and radius of @code{8}:
  6301. @example
  6302. gradfun=3.5:8
  6303. @end example
  6304. @item
  6305. Specify radius, omitting the strength (which will fall-back to the default
  6306. value):
  6307. @example
  6308. gradfun=radius=8
  6309. @end example
  6310. @end itemize
  6311. @anchor{haldclut}
  6312. @section haldclut
  6313. Apply a Hald CLUT to a video stream.
  6314. First input is the video stream to process, and second one is the Hald CLUT.
  6315. The Hald CLUT input can be a simple picture or a complete video stream.
  6316. The filter accepts the following options:
  6317. @table @option
  6318. @item shortest
  6319. Force termination when the shortest input terminates. Default is @code{0}.
  6320. @item repeatlast
  6321. Continue applying the last CLUT after the end of the stream. A value of
  6322. @code{0} disable the filter after the last frame of the CLUT is reached.
  6323. Default is @code{1}.
  6324. @end table
  6325. @code{haldclut} also has the same interpolation options as @ref{lut3d} (both
  6326. filters share the same internals).
  6327. More information about the Hald CLUT can be found on Eskil Steenberg's website
  6328. (Hald CLUT author) at @url{http://www.quelsolaar.com/technology/clut.html}.
  6329. @subsection Workflow examples
  6330. @subsubsection Hald CLUT video stream
  6331. Generate an identity Hald CLUT stream altered with various effects:
  6332. @example
  6333. ffmpeg -f lavfi -i @ref{haldclutsrc}=8 -vf "hue=H=2*PI*t:s=sin(2*PI*t)+1, curves=cross_process" -t 10 -c:v ffv1 clut.nut
  6334. @end example
  6335. Note: make sure you use a lossless codec.
  6336. Then use it with @code{haldclut} to apply it on some random stream:
  6337. @example
  6338. ffmpeg -f lavfi -i mandelbrot -i clut.nut -filter_complex '[0][1] haldclut' -t 20 mandelclut.mkv
  6339. @end example
  6340. The Hald CLUT will be applied to the 10 first seconds (duration of
  6341. @file{clut.nut}), then the latest picture of that CLUT stream will be applied
  6342. to the remaining frames of the @code{mandelbrot} stream.
  6343. @subsubsection Hald CLUT with preview
  6344. A Hald CLUT is supposed to be a squared image of @code{Level*Level*Level} by
  6345. @code{Level*Level*Level} pixels. For a given Hald CLUT, FFmpeg will select the
  6346. biggest possible square starting at the top left of the picture. The remaining
  6347. padding pixels (bottom or right) will be ignored. This area can be used to add
  6348. a preview of the Hald CLUT.
  6349. Typically, the following generated Hald CLUT will be supported by the
  6350. @code{haldclut} filter:
  6351. @example
  6352. ffmpeg -f lavfi -i @ref{haldclutsrc}=8 -vf "
  6353. pad=iw+320 [padded_clut];
  6354. smptebars=s=320x256, split [a][b];
  6355. [padded_clut][a] overlay=W-320:h, curves=color_negative [main];
  6356. [main][b] overlay=W-320" -frames:v 1 clut.png
  6357. @end example
  6358. It contains the original and a preview of the effect of the CLUT: SMPTE color
  6359. bars are displayed on the right-top, and below the same color bars processed by
  6360. the color changes.
  6361. Then, the effect of this Hald CLUT can be visualized with:
  6362. @example
  6363. ffplay input.mkv -vf "movie=clut.png, [in] haldclut"
  6364. @end example
  6365. @section hdcd
  6366. Decodes high definition audio cd data. 16-Bit PCM stream containing hdcd flags
  6367. is converted to 20-bit PCM stream.
  6368. @section hflip
  6369. Flip the input video horizontally.
  6370. For example, to horizontally flip the input video with @command{ffmpeg}:
  6371. @example
  6372. ffmpeg -i in.avi -vf "hflip" out.avi
  6373. @end example
  6374. @section histeq
  6375. This filter applies a global color histogram equalization on a
  6376. per-frame basis.
  6377. It can be used to correct video that has a compressed range of pixel
  6378. intensities. The filter redistributes the pixel intensities to
  6379. equalize their distribution across the intensity range. It may be
  6380. viewed as an "automatically adjusting contrast filter". This filter is
  6381. useful only for correcting degraded or poorly captured source
  6382. video.
  6383. The filter accepts the following options:
  6384. @table @option
  6385. @item strength
  6386. Determine the amount of equalization to be applied. As the strength
  6387. is reduced, the distribution of pixel intensities more-and-more
  6388. approaches that of the input frame. The value must be a float number
  6389. in the range [0,1] and defaults to 0.200.
  6390. @item intensity
  6391. Set the maximum intensity that can generated and scale the output
  6392. values appropriately. The strength should be set as desired and then
  6393. the intensity can be limited if needed to avoid washing-out. The value
  6394. must be a float number in the range [0,1] and defaults to 0.210.
  6395. @item antibanding
  6396. Set the antibanding level. If enabled the filter will randomly vary
  6397. the luminance of output pixels by a small amount to avoid banding of
  6398. the histogram. Possible values are @code{none}, @code{weak} or
  6399. @code{strong}. It defaults to @code{none}.
  6400. @end table
  6401. @section histogram
  6402. Compute and draw a color distribution histogram for the input video.
  6403. The computed histogram is a representation of the color component
  6404. distribution in an image.
  6405. Standard histogram displays the color components distribution in an image.
  6406. Displays color graph for each color component. Shows distribution of
  6407. the Y, U, V, A or R, G, B components, depending on input format, in the
  6408. current frame. Below each graph a color component scale meter is shown.
  6409. The filter accepts the following options:
  6410. @table @option
  6411. @item level_height
  6412. Set height of level. Default value is @code{200}.
  6413. Allowed range is [50, 2048].
  6414. @item scale_height
  6415. Set height of color scale. Default value is @code{12}.
  6416. Allowed range is [0, 40].
  6417. @item display_mode
  6418. Set display mode.
  6419. It accepts the following values:
  6420. @table @samp
  6421. @item parade
  6422. Per color component graphs are placed below each other.
  6423. @item overlay
  6424. Presents information identical to that in the @code{parade}, except
  6425. that the graphs representing color components are superimposed directly
  6426. over one another.
  6427. @end table
  6428. Default is @code{parade}.
  6429. @item levels_mode
  6430. Set mode. Can be either @code{linear}, or @code{logarithmic}.
  6431. Default is @code{linear}.
  6432. @item components
  6433. Set what color components to display.
  6434. Default is @code{7}.
  6435. @end table
  6436. @subsection Examples
  6437. @itemize
  6438. @item
  6439. Calculate and draw histogram:
  6440. @example
  6441. ffplay -i input -vf histogram
  6442. @end example
  6443. @end itemize
  6444. @anchor{hqdn3d}
  6445. @section hqdn3d
  6446. This is a high precision/quality 3d denoise filter. It aims to reduce
  6447. image noise, producing smooth images and making still images really
  6448. still. It should enhance compressibility.
  6449. It accepts the following optional parameters:
  6450. @table @option
  6451. @item luma_spatial
  6452. A non-negative floating point number which specifies spatial luma strength.
  6453. It defaults to 4.0.
  6454. @item chroma_spatial
  6455. A non-negative floating point number which specifies spatial chroma strength.
  6456. It defaults to 3.0*@var{luma_spatial}/4.0.
  6457. @item luma_tmp
  6458. A floating point number which specifies luma temporal strength. It defaults to
  6459. 6.0*@var{luma_spatial}/4.0.
  6460. @item chroma_tmp
  6461. A floating point number which specifies chroma temporal strength. It defaults to
  6462. @var{luma_tmp}*@var{chroma_spatial}/@var{luma_spatial}.
  6463. @end table
  6464. @anchor{hwupload_cuda}
  6465. @section hwupload_cuda
  6466. Upload system memory frames to a CUDA device.
  6467. It accepts the following optional parameters:
  6468. @table @option
  6469. @item device
  6470. The number of the CUDA device to use
  6471. @end table
  6472. @section hqx
  6473. Apply a high-quality magnification filter designed for pixel art. This filter
  6474. was originally created by Maxim Stepin.
  6475. It accepts the following option:
  6476. @table @option
  6477. @item n
  6478. Set the scaling dimension: @code{2} for @code{hq2x}, @code{3} for
  6479. @code{hq3x} and @code{4} for @code{hq4x}.
  6480. Default is @code{3}.
  6481. @end table
  6482. @section hstack
  6483. Stack input videos horizontally.
  6484. All streams must be of same pixel format and of same height.
  6485. Note that this filter is faster than using @ref{overlay} and @ref{pad} filter
  6486. to create same output.
  6487. The filter accept the following option:
  6488. @table @option
  6489. @item inputs
  6490. Set number of input streams. Default is 2.
  6491. @item shortest
  6492. If set to 1, force the output to terminate when the shortest input
  6493. terminates. Default value is 0.
  6494. @end table
  6495. @section hue
  6496. Modify the hue and/or the saturation of the input.
  6497. It accepts the following parameters:
  6498. @table @option
  6499. @item h
  6500. Specify the hue angle as a number of degrees. It accepts an expression,
  6501. and defaults to "0".
  6502. @item s
  6503. Specify the saturation in the [-10,10] range. It accepts an expression and
  6504. defaults to "1".
  6505. @item H
  6506. Specify the hue angle as a number of radians. It accepts an
  6507. expression, and defaults to "0".
  6508. @item b
  6509. Specify the brightness in the [-10,10] range. It accepts an expression and
  6510. defaults to "0".
  6511. @end table
  6512. @option{h} and @option{H} are mutually exclusive, and can't be
  6513. specified at the same time.
  6514. The @option{b}, @option{h}, @option{H} and @option{s} option values are
  6515. expressions containing the following constants:
  6516. @table @option
  6517. @item n
  6518. frame count of the input frame starting from 0
  6519. @item pts
  6520. presentation timestamp of the input frame expressed in time base units
  6521. @item r
  6522. frame rate of the input video, NAN if the input frame rate is unknown
  6523. @item t
  6524. timestamp expressed in seconds, NAN if the input timestamp is unknown
  6525. @item tb
  6526. time base of the input video
  6527. @end table
  6528. @subsection Examples
  6529. @itemize
  6530. @item
  6531. Set the hue to 90 degrees and the saturation to 1.0:
  6532. @example
  6533. hue=h=90:s=1
  6534. @end example
  6535. @item
  6536. Same command but expressing the hue in radians:
  6537. @example
  6538. hue=H=PI/2:s=1
  6539. @end example
  6540. @item
  6541. Rotate hue and make the saturation swing between 0
  6542. and 2 over a period of 1 second:
  6543. @example
  6544. hue="H=2*PI*t: s=sin(2*PI*t)+1"
  6545. @end example
  6546. @item
  6547. Apply a 3 seconds saturation fade-in effect starting at 0:
  6548. @example
  6549. hue="s=min(t/3\,1)"
  6550. @end example
  6551. The general fade-in expression can be written as:
  6552. @example
  6553. hue="s=min(0\, max((t-START)/DURATION\, 1))"
  6554. @end example
  6555. @item
  6556. Apply a 3 seconds saturation fade-out effect starting at 5 seconds:
  6557. @example
  6558. hue="s=max(0\, min(1\, (8-t)/3))"
  6559. @end example
  6560. The general fade-out expression can be written as:
  6561. @example
  6562. hue="s=max(0\, min(1\, (START+DURATION-t)/DURATION))"
  6563. @end example
  6564. @end itemize
  6565. @subsection Commands
  6566. This filter supports the following commands:
  6567. @table @option
  6568. @item b
  6569. @item s
  6570. @item h
  6571. @item H
  6572. Modify the hue and/or the saturation and/or brightness of the input video.
  6573. The command accepts the same syntax of the corresponding option.
  6574. If the specified expression is not valid, it is kept at its current
  6575. value.
  6576. @end table
  6577. @section idet
  6578. Detect video interlacing type.
  6579. This filter tries to detect if the input frames as interlaced, progressive,
  6580. top or bottom field first. It will also try and detect fields that are
  6581. repeated between adjacent frames (a sign of telecine).
  6582. Single frame detection considers only immediately adjacent frames when classifying each frame.
  6583. Multiple frame detection incorporates the classification history of previous frames.
  6584. The filter will log these metadata values:
  6585. @table @option
  6586. @item single.current_frame
  6587. Detected type of current frame using single-frame detection. One of:
  6588. ``tff'' (top field first), ``bff'' (bottom field first),
  6589. ``progressive'', or ``undetermined''
  6590. @item single.tff
  6591. Cumulative number of frames detected as top field first using single-frame detection.
  6592. @item multiple.tff
  6593. Cumulative number of frames detected as top field first using multiple-frame detection.
  6594. @item single.bff
  6595. Cumulative number of frames detected as bottom field first using single-frame detection.
  6596. @item multiple.current_frame
  6597. Detected type of current frame using multiple-frame detection. One of:
  6598. ``tff'' (top field first), ``bff'' (bottom field first),
  6599. ``progressive'', or ``undetermined''
  6600. @item multiple.bff
  6601. Cumulative number of frames detected as bottom field first using multiple-frame detection.
  6602. @item single.progressive
  6603. Cumulative number of frames detected as progressive using single-frame detection.
  6604. @item multiple.progressive
  6605. Cumulative number of frames detected as progressive using multiple-frame detection.
  6606. @item single.undetermined
  6607. Cumulative number of frames that could not be classified using single-frame detection.
  6608. @item multiple.undetermined
  6609. Cumulative number of frames that could not be classified using multiple-frame detection.
  6610. @item repeated.current_frame
  6611. Which field in the current frame is repeated from the last. One of ``neither'', ``top'', or ``bottom''.
  6612. @item repeated.neither
  6613. Cumulative number of frames with no repeated field.
  6614. @item repeated.top
  6615. Cumulative number of frames with the top field repeated from the previous frame's top field.
  6616. @item repeated.bottom
  6617. Cumulative number of frames with the bottom field repeated from the previous frame's bottom field.
  6618. @end table
  6619. The filter accepts the following options:
  6620. @table @option
  6621. @item intl_thres
  6622. Set interlacing threshold.
  6623. @item prog_thres
  6624. Set progressive threshold.
  6625. @item rep_thres
  6626. Threshold for repeated field detection.
  6627. @item half_life
  6628. Number of frames after which a given frame's contribution to the
  6629. statistics is halved (i.e., it contributes only 0.5 to it's
  6630. classification). The default of 0 means that all frames seen are given
  6631. full weight of 1.0 forever.
  6632. @item analyze_interlaced_flag
  6633. When this is not 0 then idet will use the specified number of frames to determine
  6634. if the interlaced flag is accurate, it will not count undetermined frames.
  6635. If the flag is found to be accurate it will be used without any further
  6636. computations, if it is found to be inaccurate it will be cleared without any
  6637. further computations. This allows inserting the idet filter as a low computational
  6638. method to clean up the interlaced flag
  6639. @end table
  6640. @section il
  6641. Deinterleave or interleave fields.
  6642. This filter allows one to process interlaced images fields without
  6643. deinterlacing them. Deinterleaving splits the input frame into 2
  6644. fields (so called half pictures). Odd lines are moved to the top
  6645. half of the output image, even lines to the bottom half.
  6646. You can process (filter) them independently and then re-interleave them.
  6647. The filter accepts the following options:
  6648. @table @option
  6649. @item luma_mode, l
  6650. @item chroma_mode, c
  6651. @item alpha_mode, a
  6652. Available values for @var{luma_mode}, @var{chroma_mode} and
  6653. @var{alpha_mode} are:
  6654. @table @samp
  6655. @item none
  6656. Do nothing.
  6657. @item deinterleave, d
  6658. Deinterleave fields, placing one above the other.
  6659. @item interleave, i
  6660. Interleave fields. Reverse the effect of deinterleaving.
  6661. @end table
  6662. Default value is @code{none}.
  6663. @item luma_swap, ls
  6664. @item chroma_swap, cs
  6665. @item alpha_swap, as
  6666. Swap luma/chroma/alpha fields. Exchange even & odd lines. Default value is @code{0}.
  6667. @end table
  6668. @section inflate
  6669. Apply inflate effect to the video.
  6670. This filter replaces the pixel by the local(3x3) average by taking into account
  6671. only values higher than the pixel.
  6672. It accepts the following options:
  6673. @table @option
  6674. @item threshold0
  6675. @item threshold1
  6676. @item threshold2
  6677. @item threshold3
  6678. Limit the maximum change for each plane, default is 65535.
  6679. If 0, plane will remain unchanged.
  6680. @end table
  6681. @section interlace
  6682. Simple interlacing filter from progressive contents. This interleaves upper (or
  6683. lower) lines from odd frames with lower (or upper) lines from even frames,
  6684. halving the frame rate and preserving image height.
  6685. @example
  6686. Original Original New Frame
  6687. Frame 'j' Frame 'j+1' (tff)
  6688. ========== =========== ==================
  6689. Line 0 --------------------> Frame 'j' Line 0
  6690. Line 1 Line 1 ----> Frame 'j+1' Line 1
  6691. Line 2 ---------------------> Frame 'j' Line 2
  6692. Line 3 Line 3 ----> Frame 'j+1' Line 3
  6693. ... ... ...
  6694. New Frame + 1 will be generated by Frame 'j+2' and Frame 'j+3' and so on
  6695. @end example
  6696. It accepts the following optional parameters:
  6697. @table @option
  6698. @item scan
  6699. This determines whether the interlaced frame is taken from the even
  6700. (tff - default) or odd (bff) lines of the progressive frame.
  6701. @item lowpass
  6702. Enable (default) or disable the vertical lowpass filter to avoid twitter
  6703. interlacing and reduce moire patterns.
  6704. @end table
  6705. @section kerndeint
  6706. Deinterlace input video by applying Donald Graft's adaptive kernel
  6707. deinterling. Work on interlaced parts of a video to produce
  6708. progressive frames.
  6709. The description of the accepted parameters follows.
  6710. @table @option
  6711. @item thresh
  6712. Set the threshold which affects the filter's tolerance when
  6713. determining if a pixel line must be processed. It must be an integer
  6714. in the range [0,255] and defaults to 10. A value of 0 will result in
  6715. applying the process on every pixels.
  6716. @item map
  6717. Paint pixels exceeding the threshold value to white if set to 1.
  6718. Default is 0.
  6719. @item order
  6720. Set the fields order. Swap fields if set to 1, leave fields alone if
  6721. 0. Default is 0.
  6722. @item sharp
  6723. Enable additional sharpening if set to 1. Default is 0.
  6724. @item twoway
  6725. Enable twoway sharpening if set to 1. Default is 0.
  6726. @end table
  6727. @subsection Examples
  6728. @itemize
  6729. @item
  6730. Apply default values:
  6731. @example
  6732. kerndeint=thresh=10:map=0:order=0:sharp=0:twoway=0
  6733. @end example
  6734. @item
  6735. Enable additional sharpening:
  6736. @example
  6737. kerndeint=sharp=1
  6738. @end example
  6739. @item
  6740. Paint processed pixels in white:
  6741. @example
  6742. kerndeint=map=1
  6743. @end example
  6744. @end itemize
  6745. @section lenscorrection
  6746. Correct radial lens distortion
  6747. This filter can be used to correct for radial distortion as can result from the use
  6748. of wide angle lenses, and thereby re-rectify the image. To find the right parameters
  6749. one can use tools available for example as part of opencv or simply trial-and-error.
  6750. To use opencv use the calibration sample (under samples/cpp) from the opencv sources
  6751. and extract the k1 and k2 coefficients from the resulting matrix.
  6752. Note that effectively the same filter is available in the open-source tools Krita and
  6753. Digikam from the KDE project.
  6754. In contrast to the @ref{vignette} filter, which can also be used to compensate lens errors,
  6755. this filter corrects the distortion of the image, whereas @ref{vignette} corrects the
  6756. brightness distribution, so you may want to use both filters together in certain
  6757. cases, though you will have to take care of ordering, i.e. whether vignetting should
  6758. be applied before or after lens correction.
  6759. @subsection Options
  6760. The filter accepts the following options:
  6761. @table @option
  6762. @item cx
  6763. Relative x-coordinate of the focal point of the image, and thereby the center of the
  6764. distortion. This value has a range [0,1] and is expressed as fractions of the image
  6765. width.
  6766. @item cy
  6767. Relative y-coordinate of the focal point of the image, and thereby the center of the
  6768. distortion. This value has a range [0,1] and is expressed as fractions of the image
  6769. height.
  6770. @item k1
  6771. Coefficient of the quadratic correction term. 0.5 means no correction.
  6772. @item k2
  6773. Coefficient of the double quadratic correction term. 0.5 means no correction.
  6774. @end table
  6775. The formula that generates the correction is:
  6776. @var{r_src} = @var{r_tgt} * (1 + @var{k1} * (@var{r_tgt} / @var{r_0})^2 + @var{k2} * (@var{r_tgt} / @var{r_0})^4)
  6777. where @var{r_0} is halve of the image diagonal and @var{r_src} and @var{r_tgt} are the
  6778. distances from the focal point in the source and target images, respectively.
  6779. @section loop, aloop
  6780. Loop video frames or audio samples.
  6781. Those filters accepts the following options:
  6782. @table @option
  6783. @item loop
  6784. Set the number of loops.
  6785. @item size
  6786. Set maximal size in number of frames for @code{loop} filter or maximal number
  6787. of samples in case of @code{aloop} filter.
  6788. @item start
  6789. Set first frame of loop for @code{loop} filter or first sample of loop in case
  6790. of @code{aloop} filter.
  6791. @end table
  6792. @anchor{lut3d}
  6793. @section lut3d
  6794. Apply a 3D LUT to an input video.
  6795. The filter accepts the following options:
  6796. @table @option
  6797. @item file
  6798. Set the 3D LUT file name.
  6799. Currently supported formats:
  6800. @table @samp
  6801. @item 3dl
  6802. AfterEffects
  6803. @item cube
  6804. Iridas
  6805. @item dat
  6806. DaVinci
  6807. @item m3d
  6808. Pandora
  6809. @end table
  6810. @item interp
  6811. Select interpolation mode.
  6812. Available values are:
  6813. @table @samp
  6814. @item nearest
  6815. Use values from the nearest defined point.
  6816. @item trilinear
  6817. Interpolate values using the 8 points defining a cube.
  6818. @item tetrahedral
  6819. Interpolate values using a tetrahedron.
  6820. @end table
  6821. @end table
  6822. @section lut, lutrgb, lutyuv
  6823. Compute a look-up table for binding each pixel component input value
  6824. to an output value, and apply it to the input video.
  6825. @var{lutyuv} applies a lookup table to a YUV input video, @var{lutrgb}
  6826. to an RGB input video.
  6827. These filters accept the following parameters:
  6828. @table @option
  6829. @item c0
  6830. set first pixel component expression
  6831. @item c1
  6832. set second pixel component expression
  6833. @item c2
  6834. set third pixel component expression
  6835. @item c3
  6836. set fourth pixel component expression, corresponds to the alpha component
  6837. @item r
  6838. set red component expression
  6839. @item g
  6840. set green component expression
  6841. @item b
  6842. set blue component expression
  6843. @item a
  6844. alpha component expression
  6845. @item y
  6846. set Y/luminance component expression
  6847. @item u
  6848. set U/Cb component expression
  6849. @item v
  6850. set V/Cr component expression
  6851. @end table
  6852. Each of them specifies the expression to use for computing the lookup table for
  6853. the corresponding pixel component values.
  6854. The exact component associated to each of the @var{c*} options depends on the
  6855. format in input.
  6856. The @var{lut} filter requires either YUV or RGB pixel formats in input,
  6857. @var{lutrgb} requires RGB pixel formats in input, and @var{lutyuv} requires YUV.
  6858. The expressions can contain the following constants and functions:
  6859. @table @option
  6860. @item w
  6861. @item h
  6862. The input width and height.
  6863. @item val
  6864. The input value for the pixel component.
  6865. @item clipval
  6866. The input value, clipped to the @var{minval}-@var{maxval} range.
  6867. @item maxval
  6868. The maximum value for the pixel component.
  6869. @item minval
  6870. The minimum value for the pixel component.
  6871. @item negval
  6872. The negated value for the pixel component value, clipped to the
  6873. @var{minval}-@var{maxval} range; it corresponds to the expression
  6874. "maxval-clipval+minval".
  6875. @item clip(val)
  6876. The computed value in @var{val}, clipped to the
  6877. @var{minval}-@var{maxval} range.
  6878. @item gammaval(gamma)
  6879. The computed gamma correction value of the pixel component value,
  6880. clipped to the @var{minval}-@var{maxval} range. It corresponds to the
  6881. expression
  6882. "pow((clipval-minval)/(maxval-minval)\,@var{gamma})*(maxval-minval)+minval"
  6883. @end table
  6884. All expressions default to "val".
  6885. @subsection Examples
  6886. @itemize
  6887. @item
  6888. Negate input video:
  6889. @example
  6890. lutrgb="r=maxval+minval-val:g=maxval+minval-val:b=maxval+minval-val"
  6891. lutyuv="y=maxval+minval-val:u=maxval+minval-val:v=maxval+minval-val"
  6892. @end example
  6893. The above is the same as:
  6894. @example
  6895. lutrgb="r=negval:g=negval:b=negval"
  6896. lutyuv="y=negval:u=negval:v=negval"
  6897. @end example
  6898. @item
  6899. Negate luminance:
  6900. @example
  6901. lutyuv=y=negval
  6902. @end example
  6903. @item
  6904. Remove chroma components, turning the video into a graytone image:
  6905. @example
  6906. lutyuv="u=128:v=128"
  6907. @end example
  6908. @item
  6909. Apply a luma burning effect:
  6910. @example
  6911. lutyuv="y=2*val"
  6912. @end example
  6913. @item
  6914. Remove green and blue components:
  6915. @example
  6916. lutrgb="g=0:b=0"
  6917. @end example
  6918. @item
  6919. Set a constant alpha channel value on input:
  6920. @example
  6921. format=rgba,lutrgb=a="maxval-minval/2"
  6922. @end example
  6923. @item
  6924. Correct luminance gamma by a factor of 0.5:
  6925. @example
  6926. lutyuv=y=gammaval(0.5)
  6927. @end example
  6928. @item
  6929. Discard least significant bits of luma:
  6930. @example
  6931. lutyuv=y='bitand(val, 128+64+32)'
  6932. @end example
  6933. @end itemize
  6934. @section maskedmerge
  6935. Merge the first input stream with the second input stream using per pixel
  6936. weights in the third input stream.
  6937. A value of 0 in the third stream pixel component means that pixel component
  6938. from first stream is returned unchanged, while maximum value (eg. 255 for
  6939. 8-bit videos) means that pixel component from second stream is returned
  6940. unchanged. Intermediate values define the amount of merging between both
  6941. input stream's pixel components.
  6942. This filter accepts the following options:
  6943. @table @option
  6944. @item planes
  6945. Set which planes will be processed as bitmap, unprocessed planes will be
  6946. copied from first stream.
  6947. By default value 0xf, all planes will be processed.
  6948. @end table
  6949. @section mcdeint
  6950. Apply motion-compensation deinterlacing.
  6951. It needs one field per frame as input and must thus be used together
  6952. with yadif=1/3 or equivalent.
  6953. This filter accepts the following options:
  6954. @table @option
  6955. @item mode
  6956. Set the deinterlacing mode.
  6957. It accepts one of the following values:
  6958. @table @samp
  6959. @item fast
  6960. @item medium
  6961. @item slow
  6962. use iterative motion estimation
  6963. @item extra_slow
  6964. like @samp{slow}, but use multiple reference frames.
  6965. @end table
  6966. Default value is @samp{fast}.
  6967. @item parity
  6968. Set the picture field parity assumed for the input video. It must be
  6969. one of the following values:
  6970. @table @samp
  6971. @item 0, tff
  6972. assume top field first
  6973. @item 1, bff
  6974. assume bottom field first
  6975. @end table
  6976. Default value is @samp{bff}.
  6977. @item qp
  6978. Set per-block quantization parameter (QP) used by the internal
  6979. encoder.
  6980. Higher values should result in a smoother motion vector field but less
  6981. optimal individual vectors. Default value is 1.
  6982. @end table
  6983. @section mergeplanes
  6984. Merge color channel components from several video streams.
  6985. The filter accepts up to 4 input streams, and merge selected input
  6986. planes to the output video.
  6987. This filter accepts the following options:
  6988. @table @option
  6989. @item mapping
  6990. Set input to output plane mapping. Default is @code{0}.
  6991. The mappings is specified as a bitmap. It should be specified as a
  6992. hexadecimal number in the form 0xAa[Bb[Cc[Dd]]]. 'Aa' describes the
  6993. mapping for the first plane of the output stream. 'A' sets the number of
  6994. the input stream to use (from 0 to 3), and 'a' the plane number of the
  6995. corresponding input to use (from 0 to 3). The rest of the mappings is
  6996. similar, 'Bb' describes the mapping for the output stream second
  6997. plane, 'Cc' describes the mapping for the output stream third plane and
  6998. 'Dd' describes the mapping for the output stream fourth plane.
  6999. @item format
  7000. Set output pixel format. Default is @code{yuva444p}.
  7001. @end table
  7002. @subsection Examples
  7003. @itemize
  7004. @item
  7005. Merge three gray video streams of same width and height into single video stream:
  7006. @example
  7007. [a0][a1][a2]mergeplanes=0x001020:yuv444p
  7008. @end example
  7009. @item
  7010. Merge 1st yuv444p stream and 2nd gray video stream into yuva444p video stream:
  7011. @example
  7012. [a0][a1]mergeplanes=0x00010210:yuva444p
  7013. @end example
  7014. @item
  7015. Swap Y and A plane in yuva444p stream:
  7016. @example
  7017. format=yuva444p,mergeplanes=0x03010200:yuva444p
  7018. @end example
  7019. @item
  7020. Swap U and V plane in yuv420p stream:
  7021. @example
  7022. format=yuv420p,mergeplanes=0x000201:yuv420p
  7023. @end example
  7024. @item
  7025. Cast a rgb24 clip to yuv444p:
  7026. @example
  7027. format=rgb24,mergeplanes=0x000102:yuv444p
  7028. @end example
  7029. @end itemize
  7030. @section metadata, ametadata
  7031. Manipulate frame metadata.
  7032. This filter accepts the following options:
  7033. @table @option
  7034. @item mode
  7035. Set mode of operation of the filter.
  7036. Can be one of the following:
  7037. @table @samp
  7038. @item select
  7039. If both @code{value} and @code{key} is set, select frames
  7040. which have such metadata. If only @code{key} is set, select
  7041. every frame that has such key in metadata.
  7042. @item add
  7043. Add new metadata @code{key} and @code{value}. If key is already available
  7044. do nothing.
  7045. @item modify
  7046. Modify value of already present key.
  7047. @item delete
  7048. If @code{value} is set, delete only keys that have such value.
  7049. Otherwise, delete key.
  7050. @item print
  7051. Print key and its value if metadata was found. If @code{key} is not set print all
  7052. metadata values available in frame.
  7053. @end table
  7054. @item key
  7055. Set key used with all modes. Must be set for all modes except @code{print}.
  7056. @item value
  7057. Set metadata value which will be used. This option is mandatory for
  7058. @code{modify} and @code{add} mode.
  7059. @item function
  7060. Which function to use when comparing metadata value and @code{value}.
  7061. Can be one of following:
  7062. @table @samp
  7063. @item same_str
  7064. Values are interpreted as strings, returns true if metadata value is same as @code{value}.
  7065. @item starts_with
  7066. Values are interpreted as strings, returns true if metadata value starts with
  7067. the @code{value} option string.
  7068. @item less
  7069. Values are interpreted as floats, returns true if metadata value is less than @code{value}.
  7070. @item equal
  7071. Values are interpreted as floats, returns true if @code{value} is equal with metadata value.
  7072. @item greater
  7073. Values are interpreted as floats, returns true if metadata value is greater than @code{value}.
  7074. @item expr
  7075. Values are interpreted as floats, returns true if expression from option @code{expr}
  7076. evaluates to true.
  7077. @end table
  7078. @item expr
  7079. Set expression which is used when @code{function} is set to @code{expr}.
  7080. The expression is evaluated through the eval API and can contain the following
  7081. constants:
  7082. @table @option
  7083. @item VALUE1
  7084. Float representation of @code{value} from metadata key.
  7085. @item VALUE2
  7086. Float representation of @code{value} as supplied by user in @code{value} option.
  7087. @end table
  7088. @item file
  7089. If specified in @code{print} mode, output is written to the named file. When
  7090. filename equals "-" data is written to standard output.
  7091. If @code{file} option is not set, output is written to the log with AV_LOG_INFO
  7092. loglevel.
  7093. @end table
  7094. @subsection Examples
  7095. @itemize
  7096. @item
  7097. Print all metadata values for frames with key @code{lavfi.singnalstats.YDIF} with values
  7098. between 0 and 1.
  7099. @example
  7100. @end example
  7101. signalstats,metadata=print:key=lavfi.signalstats.YDIF:value=0:function=expr:expr='between(VALUE1,0,1)'
  7102. @end itemize
  7103. @section mpdecimate
  7104. Drop frames that do not differ greatly from the previous frame in
  7105. order to reduce frame rate.
  7106. The main use of this filter is for very-low-bitrate encoding
  7107. (e.g. streaming over dialup modem), but it could in theory be used for
  7108. fixing movies that were inverse-telecined incorrectly.
  7109. A description of the accepted options follows.
  7110. @table @option
  7111. @item max
  7112. Set the maximum number of consecutive frames which can be dropped (if
  7113. positive), or the minimum interval between dropped frames (if
  7114. negative). If the value is 0, the frame is dropped unregarding the
  7115. number of previous sequentially dropped frames.
  7116. Default value is 0.
  7117. @item hi
  7118. @item lo
  7119. @item frac
  7120. Set the dropping threshold values.
  7121. Values for @option{hi} and @option{lo} are for 8x8 pixel blocks and
  7122. represent actual pixel value differences, so a threshold of 64
  7123. corresponds to 1 unit of difference for each pixel, or the same spread
  7124. out differently over the block.
  7125. A frame is a candidate for dropping if no 8x8 blocks differ by more
  7126. than a threshold of @option{hi}, and if no more than @option{frac} blocks (1
  7127. meaning the whole image) differ by more than a threshold of @option{lo}.
  7128. Default value for @option{hi} is 64*12, default value for @option{lo} is
  7129. 64*5, and default value for @option{frac} is 0.33.
  7130. @end table
  7131. @section negate
  7132. Negate input video.
  7133. It accepts an integer in input; if non-zero it negates the
  7134. alpha component (if available). The default value in input is 0.
  7135. @section nnedi
  7136. Deinterlace video using neural network edge directed interpolation.
  7137. This filter accepts the following options:
  7138. @table @option
  7139. @item weights
  7140. Mandatory option, without binary file filter can not work.
  7141. Currently file can be found here:
  7142. https://github.com/dubhater/vapoursynth-nnedi3/blob/master/src/nnedi3_weights.bin
  7143. @item deint
  7144. Set which frames to deinterlace, by default it is @code{all}.
  7145. Can be @code{all} or @code{interlaced}.
  7146. @item field
  7147. Set mode of operation.
  7148. Can be one of the following:
  7149. @table @samp
  7150. @item af
  7151. Use frame flags, both fields.
  7152. @item a
  7153. Use frame flags, single field.
  7154. @item t
  7155. Use top field only.
  7156. @item b
  7157. Use bottom field only.
  7158. @item tf
  7159. Use both fields, top first.
  7160. @item bf
  7161. Use both fields, bottom first.
  7162. @end table
  7163. @item planes
  7164. Set which planes to process, by default filter process all frames.
  7165. @item nsize
  7166. Set size of local neighborhood around each pixel, used by the predictor neural
  7167. network.
  7168. Can be one of the following:
  7169. @table @samp
  7170. @item s8x6
  7171. @item s16x6
  7172. @item s32x6
  7173. @item s48x6
  7174. @item s8x4
  7175. @item s16x4
  7176. @item s32x4
  7177. @end table
  7178. @item nns
  7179. Set the number of neurons in predicctor neural network.
  7180. Can be one of the following:
  7181. @table @samp
  7182. @item n16
  7183. @item n32
  7184. @item n64
  7185. @item n128
  7186. @item n256
  7187. @end table
  7188. @item qual
  7189. Controls the number of different neural network predictions that are blended
  7190. together to compute the final output value. Can be @code{fast}, default or
  7191. @code{slow}.
  7192. @item etype
  7193. Set which set of weights to use in the predictor.
  7194. Can be one of the following:
  7195. @table @samp
  7196. @item a
  7197. weights trained to minimize absolute error
  7198. @item s
  7199. weights trained to minimize squared error
  7200. @end table
  7201. @item pscrn
  7202. Controls whether or not the prescreener neural network is used to decide
  7203. which pixels should be processed by the predictor neural network and which
  7204. can be handled by simple cubic interpolation.
  7205. The prescreener is trained to know whether cubic interpolation will be
  7206. sufficient for a pixel or whether it should be predicted by the predictor nn.
  7207. The computational complexity of the prescreener nn is much less than that of
  7208. the predictor nn. Since most pixels can be handled by cubic interpolation,
  7209. using the prescreener generally results in much faster processing.
  7210. The prescreener is pretty accurate, so the difference between using it and not
  7211. using it is almost always unnoticeable.
  7212. Can be one of the following:
  7213. @table @samp
  7214. @item none
  7215. @item original
  7216. @item new
  7217. @end table
  7218. Default is @code{new}.
  7219. @item fapprox
  7220. Set various debugging flags.
  7221. @end table
  7222. @section noformat
  7223. Force libavfilter not to use any of the specified pixel formats for the
  7224. input to the next filter.
  7225. It accepts the following parameters:
  7226. @table @option
  7227. @item pix_fmts
  7228. A '|'-separated list of pixel format names, such as
  7229. apix_fmts=yuv420p|monow|rgb24".
  7230. @end table
  7231. @subsection Examples
  7232. @itemize
  7233. @item
  7234. Force libavfilter to use a format different from @var{yuv420p} for the
  7235. input to the vflip filter:
  7236. @example
  7237. noformat=pix_fmts=yuv420p,vflip
  7238. @end example
  7239. @item
  7240. Convert the input video to any of the formats not contained in the list:
  7241. @example
  7242. noformat=yuv420p|yuv444p|yuv410p
  7243. @end example
  7244. @end itemize
  7245. @section noise
  7246. Add noise on video input frame.
  7247. The filter accepts the following options:
  7248. @table @option
  7249. @item all_seed
  7250. @item c0_seed
  7251. @item c1_seed
  7252. @item c2_seed
  7253. @item c3_seed
  7254. Set noise seed for specific pixel component or all pixel components in case
  7255. of @var{all_seed}. Default value is @code{123457}.
  7256. @item all_strength, alls
  7257. @item c0_strength, c0s
  7258. @item c1_strength, c1s
  7259. @item c2_strength, c2s
  7260. @item c3_strength, c3s
  7261. Set noise strength for specific pixel component or all pixel components in case
  7262. @var{all_strength}. Default value is @code{0}. Allowed range is [0, 100].
  7263. @item all_flags, allf
  7264. @item c0_flags, c0f
  7265. @item c1_flags, c1f
  7266. @item c2_flags, c2f
  7267. @item c3_flags, c3f
  7268. Set pixel component flags or set flags for all components if @var{all_flags}.
  7269. Available values for component flags are:
  7270. @table @samp
  7271. @item a
  7272. averaged temporal noise (smoother)
  7273. @item p
  7274. mix random noise with a (semi)regular pattern
  7275. @item t
  7276. temporal noise (noise pattern changes between frames)
  7277. @item u
  7278. uniform noise (gaussian otherwise)
  7279. @end table
  7280. @end table
  7281. @subsection Examples
  7282. Add temporal and uniform noise to input video:
  7283. @example
  7284. noise=alls=20:allf=t+u
  7285. @end example
  7286. @section null
  7287. Pass the video source unchanged to the output.
  7288. @section ocr
  7289. Optical Character Recognition
  7290. This filter uses Tesseract for optical character recognition.
  7291. It accepts the following options:
  7292. @table @option
  7293. @item datapath
  7294. Set datapath to tesseract data. Default is to use whatever was
  7295. set at installation.
  7296. @item language
  7297. Set language, default is "eng".
  7298. @item whitelist
  7299. Set character whitelist.
  7300. @item blacklist
  7301. Set character blacklist.
  7302. @end table
  7303. The filter exports recognized text as the frame metadata @code{lavfi.ocr.text}.
  7304. @section ocv
  7305. Apply a video transform using libopencv.
  7306. To enable this filter, install the libopencv library and headers and
  7307. configure FFmpeg with @code{--enable-libopencv}.
  7308. It accepts the following parameters:
  7309. @table @option
  7310. @item filter_name
  7311. The name of the libopencv filter to apply.
  7312. @item filter_params
  7313. The parameters to pass to the libopencv filter. If not specified, the default
  7314. values are assumed.
  7315. @end table
  7316. Refer to the official libopencv documentation for more precise
  7317. information:
  7318. @url{http://docs.opencv.org/master/modules/imgproc/doc/filtering.html}
  7319. Several libopencv filters are supported; see the following subsections.
  7320. @anchor{dilate}
  7321. @subsection dilate
  7322. Dilate an image by using a specific structuring element.
  7323. It corresponds to the libopencv function @code{cvDilate}.
  7324. It accepts the parameters: @var{struct_el}|@var{nb_iterations}.
  7325. @var{struct_el} represents a structuring element, and has the syntax:
  7326. @var{cols}x@var{rows}+@var{anchor_x}x@var{anchor_y}/@var{shape}
  7327. @var{cols} and @var{rows} represent the number of columns and rows of
  7328. the structuring element, @var{anchor_x} and @var{anchor_y} the anchor
  7329. point, and @var{shape} the shape for the structuring element. @var{shape}
  7330. must be "rect", "cross", "ellipse", or "custom".
  7331. If the value for @var{shape} is "custom", it must be followed by a
  7332. string of the form "=@var{filename}". The file with name
  7333. @var{filename} is assumed to represent a binary image, with each
  7334. printable character corresponding to a bright pixel. When a custom
  7335. @var{shape} is used, @var{cols} and @var{rows} are ignored, the number
  7336. or columns and rows of the read file are assumed instead.
  7337. The default value for @var{struct_el} is "3x3+0x0/rect".
  7338. @var{nb_iterations} specifies the number of times the transform is
  7339. applied to the image, and defaults to 1.
  7340. Some examples:
  7341. @example
  7342. # Use the default values
  7343. ocv=dilate
  7344. # Dilate using a structuring element with a 5x5 cross, iterating two times
  7345. ocv=filter_name=dilate:filter_params=5x5+2x2/cross|2
  7346. # Read the shape from the file diamond.shape, iterating two times.
  7347. # The file diamond.shape may contain a pattern of characters like this
  7348. # *
  7349. # ***
  7350. # *****
  7351. # ***
  7352. # *
  7353. # The specified columns and rows are ignored
  7354. # but the anchor point coordinates are not
  7355. ocv=dilate:0x0+2x2/custom=diamond.shape|2
  7356. @end example
  7357. @subsection erode
  7358. Erode an image by using a specific structuring element.
  7359. It corresponds to the libopencv function @code{cvErode}.
  7360. It accepts the parameters: @var{struct_el}:@var{nb_iterations},
  7361. with the same syntax and semantics as the @ref{dilate} filter.
  7362. @subsection smooth
  7363. Smooth the input video.
  7364. The filter takes the following parameters:
  7365. @var{type}|@var{param1}|@var{param2}|@var{param3}|@var{param4}.
  7366. @var{type} is the type of smooth filter to apply, and must be one of
  7367. the following values: "blur", "blur_no_scale", "median", "gaussian",
  7368. or "bilateral". The default value is "gaussian".
  7369. The meaning of @var{param1}, @var{param2}, @var{param3}, and @var{param4}
  7370. depend on the smooth type. @var{param1} and
  7371. @var{param2} accept integer positive values or 0. @var{param3} and
  7372. @var{param4} accept floating point values.
  7373. The default value for @var{param1} is 3. The default value for the
  7374. other parameters is 0.
  7375. These parameters correspond to the parameters assigned to the
  7376. libopencv function @code{cvSmooth}.
  7377. @anchor{overlay}
  7378. @section overlay
  7379. Overlay one video on top of another.
  7380. It takes two inputs and has one output. The first input is the "main"
  7381. video on which the second input is overlaid.
  7382. It accepts the following parameters:
  7383. A description of the accepted options follows.
  7384. @table @option
  7385. @item x
  7386. @item y
  7387. Set the expression for the x and y coordinates of the overlaid video
  7388. on the main video. Default value is "0" for both expressions. In case
  7389. the expression is invalid, it is set to a huge value (meaning that the
  7390. overlay will not be displayed within the output visible area).
  7391. @item eof_action
  7392. The action to take when EOF is encountered on the secondary input; it accepts
  7393. one of the following values:
  7394. @table @option
  7395. @item repeat
  7396. Repeat the last frame (the default).
  7397. @item endall
  7398. End both streams.
  7399. @item pass
  7400. Pass the main input through.
  7401. @end table
  7402. @item eval
  7403. Set when the expressions for @option{x}, and @option{y} are evaluated.
  7404. It accepts the following values:
  7405. @table @samp
  7406. @item init
  7407. only evaluate expressions once during the filter initialization or
  7408. when a command is processed
  7409. @item frame
  7410. evaluate expressions for each incoming frame
  7411. @end table
  7412. Default value is @samp{frame}.
  7413. @item shortest
  7414. If set to 1, force the output to terminate when the shortest input
  7415. terminates. Default value is 0.
  7416. @item format
  7417. Set the format for the output video.
  7418. It accepts the following values:
  7419. @table @samp
  7420. @item yuv420
  7421. force YUV420 output
  7422. @item yuv422
  7423. force YUV422 output
  7424. @item yuv444
  7425. force YUV444 output
  7426. @item rgb
  7427. force RGB output
  7428. @end table
  7429. Default value is @samp{yuv420}.
  7430. @item rgb @emph{(deprecated)}
  7431. If set to 1, force the filter to accept inputs in the RGB
  7432. color space. Default value is 0. This option is deprecated, use
  7433. @option{format} instead.
  7434. @item repeatlast
  7435. If set to 1, force the filter to draw the last overlay frame over the
  7436. main input until the end of the stream. A value of 0 disables this
  7437. behavior. Default value is 1.
  7438. @end table
  7439. The @option{x}, and @option{y} expressions can contain the following
  7440. parameters.
  7441. @table @option
  7442. @item main_w, W
  7443. @item main_h, H
  7444. The main input width and height.
  7445. @item overlay_w, w
  7446. @item overlay_h, h
  7447. The overlay input width and height.
  7448. @item x
  7449. @item y
  7450. The computed values for @var{x} and @var{y}. They are evaluated for
  7451. each new frame.
  7452. @item hsub
  7453. @item vsub
  7454. horizontal and vertical chroma subsample values of the output
  7455. format. For example for the pixel format "yuv422p" @var{hsub} is 2 and
  7456. @var{vsub} is 1.
  7457. @item n
  7458. the number of input frame, starting from 0
  7459. @item pos
  7460. the position in the file of the input frame, NAN if unknown
  7461. @item t
  7462. The timestamp, expressed in seconds. It's NAN if the input timestamp is unknown.
  7463. @end table
  7464. Note that the @var{n}, @var{pos}, @var{t} variables are available only
  7465. when evaluation is done @emph{per frame}, and will evaluate to NAN
  7466. when @option{eval} is set to @samp{init}.
  7467. Be aware that frames are taken from each input video in timestamp
  7468. order, hence, if their initial timestamps differ, it is a good idea
  7469. to pass the two inputs through a @var{setpts=PTS-STARTPTS} filter to
  7470. have them begin in the same zero timestamp, as the example for
  7471. the @var{movie} filter does.
  7472. You can chain together more overlays but you should test the
  7473. efficiency of such approach.
  7474. @subsection Commands
  7475. This filter supports the following commands:
  7476. @table @option
  7477. @item x
  7478. @item y
  7479. Modify the x and y of the overlay input.
  7480. The command accepts the same syntax of the corresponding option.
  7481. If the specified expression is not valid, it is kept at its current
  7482. value.
  7483. @end table
  7484. @subsection Examples
  7485. @itemize
  7486. @item
  7487. Draw the overlay at 10 pixels from the bottom right corner of the main
  7488. video:
  7489. @example
  7490. overlay=main_w-overlay_w-10:main_h-overlay_h-10
  7491. @end example
  7492. Using named options the example above becomes:
  7493. @example
  7494. overlay=x=main_w-overlay_w-10:y=main_h-overlay_h-10
  7495. @end example
  7496. @item
  7497. Insert a transparent PNG logo in the bottom left corner of the input,
  7498. using the @command{ffmpeg} tool with the @code{-filter_complex} option:
  7499. @example
  7500. ffmpeg -i input -i logo -filter_complex 'overlay=10:main_h-overlay_h-10' output
  7501. @end example
  7502. @item
  7503. Insert 2 different transparent PNG logos (second logo on bottom
  7504. right corner) using the @command{ffmpeg} tool:
  7505. @example
  7506. ffmpeg -i input -i logo1 -i logo2 -filter_complex 'overlay=x=10:y=H-h-10,overlay=x=W-w-10:y=H-h-10' output
  7507. @end example
  7508. @item
  7509. Add a transparent color layer on top of the main video; @code{WxH}
  7510. must specify the size of the main input to the overlay filter:
  7511. @example
  7512. color=color=red@@.3:size=WxH [over]; [in][over] overlay [out]
  7513. @end example
  7514. @item
  7515. Play an original video and a filtered version (here with the deshake
  7516. filter) side by side using the @command{ffplay} tool:
  7517. @example
  7518. ffplay input.avi -vf 'split[a][b]; [a]pad=iw*2:ih[src]; [b]deshake[filt]; [src][filt]overlay=w'
  7519. @end example
  7520. The above command is the same as:
  7521. @example
  7522. ffplay input.avi -vf 'split[b], pad=iw*2[src], [b]deshake, [src]overlay=w'
  7523. @end example
  7524. @item
  7525. Make a sliding overlay appearing from the left to the right top part of the
  7526. screen starting since time 2:
  7527. @example
  7528. overlay=x='if(gte(t,2), -w+(t-2)*20, NAN)':y=0
  7529. @end example
  7530. @item
  7531. Compose output by putting two input videos side to side:
  7532. @example
  7533. ffmpeg -i left.avi -i right.avi -filter_complex "
  7534. nullsrc=size=200x100 [background];
  7535. [0:v] setpts=PTS-STARTPTS, scale=100x100 [left];
  7536. [1:v] setpts=PTS-STARTPTS, scale=100x100 [right];
  7537. [background][left] overlay=shortest=1 [background+left];
  7538. [background+left][right] overlay=shortest=1:x=100 [left+right]
  7539. "
  7540. @end example
  7541. @item
  7542. Mask 10-20 seconds of a video by applying the delogo filter to a section
  7543. @example
  7544. ffmpeg -i test.avi -codec:v:0 wmv2 -ar 11025 -b:v 9000k
  7545. -vf '[in]split[split_main][split_delogo];[split_delogo]trim=start=360:end=371,delogo=0:0:640:480[delogoed];[split_main][delogoed]overlay=eof_action=pass[out]'
  7546. masked.avi
  7547. @end example
  7548. @item
  7549. Chain several overlays in cascade:
  7550. @example
  7551. nullsrc=s=200x200 [bg];
  7552. testsrc=s=100x100, split=4 [in0][in1][in2][in3];
  7553. [in0] lutrgb=r=0, [bg] overlay=0:0 [mid0];
  7554. [in1] lutrgb=g=0, [mid0] overlay=100:0 [mid1];
  7555. [in2] lutrgb=b=0, [mid1] overlay=0:100 [mid2];
  7556. [in3] null, [mid2] overlay=100:100 [out0]
  7557. @end example
  7558. @end itemize
  7559. @section owdenoise
  7560. Apply Overcomplete Wavelet denoiser.
  7561. The filter accepts the following options:
  7562. @table @option
  7563. @item depth
  7564. Set depth.
  7565. Larger depth values will denoise lower frequency components more, but
  7566. slow down filtering.
  7567. Must be an int in the range 8-16, default is @code{8}.
  7568. @item luma_strength, ls
  7569. Set luma strength.
  7570. Must be a double value in the range 0-1000, default is @code{1.0}.
  7571. @item chroma_strength, cs
  7572. Set chroma strength.
  7573. Must be a double value in the range 0-1000, default is @code{1.0}.
  7574. @end table
  7575. @anchor{pad}
  7576. @section pad
  7577. Add paddings to the input image, and place the original input at the
  7578. provided @var{x}, @var{y} coordinates.
  7579. It accepts the following parameters:
  7580. @table @option
  7581. @item width, w
  7582. @item height, h
  7583. Specify an expression for the size of the output image with the
  7584. paddings added. If the value for @var{width} or @var{height} is 0, the
  7585. corresponding input size is used for the output.
  7586. The @var{width} expression can reference the value set by the
  7587. @var{height} expression, and vice versa.
  7588. The default value of @var{width} and @var{height} is 0.
  7589. @item x
  7590. @item y
  7591. Specify the offsets to place the input image at within the padded area,
  7592. with respect to the top/left border of the output image.
  7593. The @var{x} expression can reference the value set by the @var{y}
  7594. expression, and vice versa.
  7595. The default value of @var{x} and @var{y} is 0.
  7596. @item color
  7597. Specify the color of the padded area. For the syntax of this option,
  7598. check the "Color" section in the ffmpeg-utils manual.
  7599. The default value of @var{color} is "black".
  7600. @end table
  7601. The value for the @var{width}, @var{height}, @var{x}, and @var{y}
  7602. options are expressions containing the following constants:
  7603. @table @option
  7604. @item in_w
  7605. @item in_h
  7606. The input video width and height.
  7607. @item iw
  7608. @item ih
  7609. These are the same as @var{in_w} and @var{in_h}.
  7610. @item out_w
  7611. @item out_h
  7612. The output width and height (the size of the padded area), as
  7613. specified by the @var{width} and @var{height} expressions.
  7614. @item ow
  7615. @item oh
  7616. These are the same as @var{out_w} and @var{out_h}.
  7617. @item x
  7618. @item y
  7619. The x and y offsets as specified by the @var{x} and @var{y}
  7620. expressions, or NAN if not yet specified.
  7621. @item a
  7622. same as @var{iw} / @var{ih}
  7623. @item sar
  7624. input sample aspect ratio
  7625. @item dar
  7626. input display aspect ratio, it is the same as (@var{iw} / @var{ih}) * @var{sar}
  7627. @item hsub
  7628. @item vsub
  7629. The horizontal and vertical chroma subsample values. For example for the
  7630. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  7631. @end table
  7632. @subsection Examples
  7633. @itemize
  7634. @item
  7635. Add paddings with the color "violet" to the input video. The output video
  7636. size is 640x480, and the top-left corner of the input video is placed at
  7637. column 0, row 40
  7638. @example
  7639. pad=640:480:0:40:violet
  7640. @end example
  7641. The example above is equivalent to the following command:
  7642. @example
  7643. pad=width=640:height=480:x=0:y=40:color=violet
  7644. @end example
  7645. @item
  7646. Pad the input to get an output with dimensions increased by 3/2,
  7647. and put the input video at the center of the padded area:
  7648. @example
  7649. pad="3/2*iw:3/2*ih:(ow-iw)/2:(oh-ih)/2"
  7650. @end example
  7651. @item
  7652. Pad the input to get a squared output with size equal to the maximum
  7653. value between the input width and height, and put the input video at
  7654. the center of the padded area:
  7655. @example
  7656. pad="max(iw\,ih):ow:(ow-iw)/2:(oh-ih)/2"
  7657. @end example
  7658. @item
  7659. Pad the input to get a final w/h ratio of 16:9:
  7660. @example
  7661. pad="ih*16/9:ih:(ow-iw)/2:(oh-ih)/2"
  7662. @end example
  7663. @item
  7664. In case of anamorphic video, in order to set the output display aspect
  7665. correctly, it is necessary to use @var{sar} in the expression,
  7666. according to the relation:
  7667. @example
  7668. (ih * X / ih) * sar = output_dar
  7669. X = output_dar / sar
  7670. @end example
  7671. Thus the previous example needs to be modified to:
  7672. @example
  7673. pad="ih*16/9/sar:ih:(ow-iw)/2:(oh-ih)/2"
  7674. @end example
  7675. @item
  7676. Double the output size and put the input video in the bottom-right
  7677. corner of the output padded area:
  7678. @example
  7679. pad="2*iw:2*ih:ow-iw:oh-ih"
  7680. @end example
  7681. @end itemize
  7682. @anchor{palettegen}
  7683. @section palettegen
  7684. Generate one palette for a whole video stream.
  7685. It accepts the following options:
  7686. @table @option
  7687. @item max_colors
  7688. Set the maximum number of colors to quantize in the palette.
  7689. Note: the palette will still contain 256 colors; the unused palette entries
  7690. will be black.
  7691. @item reserve_transparent
  7692. Create a palette of 255 colors maximum and reserve the last one for
  7693. transparency. Reserving the transparency color is useful for GIF optimization.
  7694. If not set, the maximum of colors in the palette will be 256. You probably want
  7695. to disable this option for a standalone image.
  7696. Set by default.
  7697. @item stats_mode
  7698. Set statistics mode.
  7699. It accepts the following values:
  7700. @table @samp
  7701. @item full
  7702. Compute full frame histograms.
  7703. @item diff
  7704. Compute histograms only for the part that differs from previous frame. This
  7705. might be relevant to give more importance to the moving part of your input if
  7706. the background is static.
  7707. @end table
  7708. Default value is @var{full}.
  7709. @end table
  7710. The filter also exports the frame metadata @code{lavfi.color_quant_ratio}
  7711. (@code{nb_color_in / nb_color_out}) which you can use to evaluate the degree of
  7712. color quantization of the palette. This information is also visible at
  7713. @var{info} logging level.
  7714. @subsection Examples
  7715. @itemize
  7716. @item
  7717. Generate a representative palette of a given video using @command{ffmpeg}:
  7718. @example
  7719. ffmpeg -i input.mkv -vf palettegen palette.png
  7720. @end example
  7721. @end itemize
  7722. @section paletteuse
  7723. Use a palette to downsample an input video stream.
  7724. The filter takes two inputs: one video stream and a palette. The palette must
  7725. be a 256 pixels image.
  7726. It accepts the following options:
  7727. @table @option
  7728. @item dither
  7729. Select dithering mode. Available algorithms are:
  7730. @table @samp
  7731. @item bayer
  7732. Ordered 8x8 bayer dithering (deterministic)
  7733. @item heckbert
  7734. Dithering as defined by Paul Heckbert in 1982 (simple error diffusion).
  7735. Note: this dithering is sometimes considered "wrong" and is included as a
  7736. reference.
  7737. @item floyd_steinberg
  7738. Floyd and Steingberg dithering (error diffusion)
  7739. @item sierra2
  7740. Frankie Sierra dithering v2 (error diffusion)
  7741. @item sierra2_4a
  7742. Frankie Sierra dithering v2 "Lite" (error diffusion)
  7743. @end table
  7744. Default is @var{sierra2_4a}.
  7745. @item bayer_scale
  7746. When @var{bayer} dithering is selected, this option defines the scale of the
  7747. pattern (how much the crosshatch pattern is visible). A low value means more
  7748. visible pattern for less banding, and higher value means less visible pattern
  7749. at the cost of more banding.
  7750. The option must be an integer value in the range [0,5]. Default is @var{2}.
  7751. @item diff_mode
  7752. If set, define the zone to process
  7753. @table @samp
  7754. @item rectangle
  7755. Only the changing rectangle will be reprocessed. This is similar to GIF
  7756. cropping/offsetting compression mechanism. This option can be useful for speed
  7757. if only a part of the image is changing, and has use cases such as limiting the
  7758. scope of the error diffusal @option{dither} to the rectangle that bounds the
  7759. moving scene (it leads to more deterministic output if the scene doesn't change
  7760. much, and as a result less moving noise and better GIF compression).
  7761. @end table
  7762. Default is @var{none}.
  7763. @end table
  7764. @subsection Examples
  7765. @itemize
  7766. @item
  7767. Use a palette (generated for example with @ref{palettegen}) to encode a GIF
  7768. using @command{ffmpeg}:
  7769. @example
  7770. ffmpeg -i input.mkv -i palette.png -lavfi paletteuse output.gif
  7771. @end example
  7772. @end itemize
  7773. @section perspective
  7774. Correct perspective of video not recorded perpendicular to the screen.
  7775. A description of the accepted parameters follows.
  7776. @table @option
  7777. @item x0
  7778. @item y0
  7779. @item x1
  7780. @item y1
  7781. @item x2
  7782. @item y2
  7783. @item x3
  7784. @item y3
  7785. Set coordinates expression for top left, top right, bottom left and bottom right corners.
  7786. Default values are @code{0:0:W:0:0:H:W:H} with which perspective will remain unchanged.
  7787. If the @code{sense} option is set to @code{source}, then the specified points will be sent
  7788. to the corners of the destination. If the @code{sense} option is set to @code{destination},
  7789. then the corners of the source will be sent to the specified coordinates.
  7790. The expressions can use the following variables:
  7791. @table @option
  7792. @item W
  7793. @item H
  7794. the width and height of video frame.
  7795. @end table
  7796. @item interpolation
  7797. Set interpolation for perspective correction.
  7798. It accepts the following values:
  7799. @table @samp
  7800. @item linear
  7801. @item cubic
  7802. @end table
  7803. Default value is @samp{linear}.
  7804. @item sense
  7805. Set interpretation of coordinate options.
  7806. It accepts the following values:
  7807. @table @samp
  7808. @item 0, source
  7809. Send point in the source specified by the given coordinates to
  7810. the corners of the destination.
  7811. @item 1, destination
  7812. Send the corners of the source to the point in the destination specified
  7813. by the given coordinates.
  7814. Default value is @samp{source}.
  7815. @end table
  7816. @end table
  7817. @section phase
  7818. Delay interlaced video by one field time so that the field order changes.
  7819. The intended use is to fix PAL movies that have been captured with the
  7820. opposite field order to the film-to-video transfer.
  7821. A description of the accepted parameters follows.
  7822. @table @option
  7823. @item mode
  7824. Set phase mode.
  7825. It accepts the following values:
  7826. @table @samp
  7827. @item t
  7828. Capture field order top-first, transfer bottom-first.
  7829. Filter will delay the bottom field.
  7830. @item b
  7831. Capture field order bottom-first, transfer top-first.
  7832. Filter will delay the top field.
  7833. @item p
  7834. Capture and transfer with the same field order. This mode only exists
  7835. for the documentation of the other options to refer to, but if you
  7836. actually select it, the filter will faithfully do nothing.
  7837. @item a
  7838. Capture field order determined automatically by field flags, transfer
  7839. opposite.
  7840. Filter selects among @samp{t} and @samp{b} modes on a frame by frame
  7841. basis using field flags. If no field information is available,
  7842. then this works just like @samp{u}.
  7843. @item u
  7844. Capture unknown or varying, transfer opposite.
  7845. Filter selects among @samp{t} and @samp{b} on a frame by frame basis by
  7846. analyzing the images and selecting the alternative that produces best
  7847. match between the fields.
  7848. @item T
  7849. Capture top-first, transfer unknown or varying.
  7850. Filter selects among @samp{t} and @samp{p} using image analysis.
  7851. @item B
  7852. Capture bottom-first, transfer unknown or varying.
  7853. Filter selects among @samp{b} and @samp{p} using image analysis.
  7854. @item A
  7855. Capture determined by field flags, transfer unknown or varying.
  7856. Filter selects among @samp{t}, @samp{b} and @samp{p} using field flags and
  7857. image analysis. If no field information is available, then this works just
  7858. like @samp{U}. This is the default mode.
  7859. @item U
  7860. Both capture and transfer unknown or varying.
  7861. Filter selects among @samp{t}, @samp{b} and @samp{p} using image analysis only.
  7862. @end table
  7863. @end table
  7864. @section pixdesctest
  7865. Pixel format descriptor test filter, mainly useful for internal
  7866. testing. The output video should be equal to the input video.
  7867. For example:
  7868. @example
  7869. format=monow, pixdesctest
  7870. @end example
  7871. can be used to test the monowhite pixel format descriptor definition.
  7872. @section pp
  7873. Enable the specified chain of postprocessing subfilters using libpostproc. This
  7874. library should be automatically selected with a GPL build (@code{--enable-gpl}).
  7875. Subfilters must be separated by '/' and can be disabled by prepending a '-'.
  7876. Each subfilter and some options have a short and a long name that can be used
  7877. interchangeably, i.e. dr/dering are the same.
  7878. The filters accept the following options:
  7879. @table @option
  7880. @item subfilters
  7881. Set postprocessing subfilters string.
  7882. @end table
  7883. All subfilters share common options to determine their scope:
  7884. @table @option
  7885. @item a/autoq
  7886. Honor the quality commands for this subfilter.
  7887. @item c/chrom
  7888. Do chrominance filtering, too (default).
  7889. @item y/nochrom
  7890. Do luminance filtering only (no chrominance).
  7891. @item n/noluma
  7892. Do chrominance filtering only (no luminance).
  7893. @end table
  7894. These options can be appended after the subfilter name, separated by a '|'.
  7895. Available subfilters are:
  7896. @table @option
  7897. @item hb/hdeblock[|difference[|flatness]]
  7898. Horizontal deblocking filter
  7899. @table @option
  7900. @item difference
  7901. Difference factor where higher values mean more deblocking (default: @code{32}).
  7902. @item flatness
  7903. Flatness threshold where lower values mean more deblocking (default: @code{39}).
  7904. @end table
  7905. @item vb/vdeblock[|difference[|flatness]]
  7906. Vertical deblocking filter
  7907. @table @option
  7908. @item difference
  7909. Difference factor where higher values mean more deblocking (default: @code{32}).
  7910. @item flatness
  7911. Flatness threshold where lower values mean more deblocking (default: @code{39}).
  7912. @end table
  7913. @item ha/hadeblock[|difference[|flatness]]
  7914. Accurate horizontal deblocking filter
  7915. @table @option
  7916. @item difference
  7917. Difference factor where higher values mean more deblocking (default: @code{32}).
  7918. @item flatness
  7919. Flatness threshold where lower values mean more deblocking (default: @code{39}).
  7920. @end table
  7921. @item va/vadeblock[|difference[|flatness]]
  7922. Accurate vertical deblocking filter
  7923. @table @option
  7924. @item difference
  7925. Difference factor where higher values mean more deblocking (default: @code{32}).
  7926. @item flatness
  7927. Flatness threshold where lower values mean more deblocking (default: @code{39}).
  7928. @end table
  7929. @end table
  7930. The horizontal and vertical deblocking filters share the difference and
  7931. flatness values so you cannot set different horizontal and vertical
  7932. thresholds.
  7933. @table @option
  7934. @item h1/x1hdeblock
  7935. Experimental horizontal deblocking filter
  7936. @item v1/x1vdeblock
  7937. Experimental vertical deblocking filter
  7938. @item dr/dering
  7939. Deringing filter
  7940. @item tn/tmpnoise[|threshold1[|threshold2[|threshold3]]], temporal noise reducer
  7941. @table @option
  7942. @item threshold1
  7943. larger -> stronger filtering
  7944. @item threshold2
  7945. larger -> stronger filtering
  7946. @item threshold3
  7947. larger -> stronger filtering
  7948. @end table
  7949. @item al/autolevels[:f/fullyrange], automatic brightness / contrast correction
  7950. @table @option
  7951. @item f/fullyrange
  7952. Stretch luminance to @code{0-255}.
  7953. @end table
  7954. @item lb/linblenddeint
  7955. Linear blend deinterlacing filter that deinterlaces the given block by
  7956. filtering all lines with a @code{(1 2 1)} filter.
  7957. @item li/linipoldeint
  7958. Linear interpolating deinterlacing filter that deinterlaces the given block by
  7959. linearly interpolating every second line.
  7960. @item ci/cubicipoldeint
  7961. Cubic interpolating deinterlacing filter deinterlaces the given block by
  7962. cubically interpolating every second line.
  7963. @item md/mediandeint
  7964. Median deinterlacing filter that deinterlaces the given block by applying a
  7965. median filter to every second line.
  7966. @item fd/ffmpegdeint
  7967. FFmpeg deinterlacing filter that deinterlaces the given block by filtering every
  7968. second line with a @code{(-1 4 2 4 -1)} filter.
  7969. @item l5/lowpass5
  7970. Vertically applied FIR lowpass deinterlacing filter that deinterlaces the given
  7971. block by filtering all lines with a @code{(-1 2 6 2 -1)} filter.
  7972. @item fq/forceQuant[|quantizer]
  7973. Overrides the quantizer table from the input with the constant quantizer you
  7974. specify.
  7975. @table @option
  7976. @item quantizer
  7977. Quantizer to use
  7978. @end table
  7979. @item de/default
  7980. Default pp filter combination (@code{hb|a,vb|a,dr|a})
  7981. @item fa/fast
  7982. Fast pp filter combination (@code{h1|a,v1|a,dr|a})
  7983. @item ac
  7984. High quality pp filter combination (@code{ha|a|128|7,va|a,dr|a})
  7985. @end table
  7986. @subsection Examples
  7987. @itemize
  7988. @item
  7989. Apply horizontal and vertical deblocking, deringing and automatic
  7990. brightness/contrast:
  7991. @example
  7992. pp=hb/vb/dr/al
  7993. @end example
  7994. @item
  7995. Apply default filters without brightness/contrast correction:
  7996. @example
  7997. pp=de/-al
  7998. @end example
  7999. @item
  8000. Apply default filters and temporal denoiser:
  8001. @example
  8002. pp=default/tmpnoise|1|2|3
  8003. @end example
  8004. @item
  8005. Apply deblocking on luminance only, and switch vertical deblocking on or off
  8006. automatically depending on available CPU time:
  8007. @example
  8008. pp=hb|y/vb|a
  8009. @end example
  8010. @end itemize
  8011. @section pp7
  8012. Apply Postprocessing filter 7. It is variant of the @ref{spp} filter,
  8013. similar to spp = 6 with 7 point DCT, where only the center sample is
  8014. used after IDCT.
  8015. The filter accepts the following options:
  8016. @table @option
  8017. @item qp
  8018. Force a constant quantization parameter. It accepts an integer in range
  8019. 0 to 63. If not set, the filter will use the QP from the video stream
  8020. (if available).
  8021. @item mode
  8022. Set thresholding mode. Available modes are:
  8023. @table @samp
  8024. @item hard
  8025. Set hard thresholding.
  8026. @item soft
  8027. Set soft thresholding (better de-ringing effect, but likely blurrier).
  8028. @item medium
  8029. Set medium thresholding (good results, default).
  8030. @end table
  8031. @end table
  8032. @section psnr
  8033. Obtain the average, maximum and minimum PSNR (Peak Signal to Noise
  8034. Ratio) between two input videos.
  8035. This filter takes in input two input videos, the first input is
  8036. considered the "main" source and is passed unchanged to the
  8037. output. The second input is used as a "reference" video for computing
  8038. the PSNR.
  8039. Both video inputs must have the same resolution and pixel format for
  8040. this filter to work correctly. Also it assumes that both inputs
  8041. have the same number of frames, which are compared one by one.
  8042. The obtained average PSNR is printed through the logging system.
  8043. The filter stores the accumulated MSE (mean squared error) of each
  8044. frame, and at the end of the processing it is averaged across all frames
  8045. equally, and the following formula is applied to obtain the PSNR:
  8046. @example
  8047. PSNR = 10*log10(MAX^2/MSE)
  8048. @end example
  8049. Where MAX is the average of the maximum values of each component of the
  8050. image.
  8051. The description of the accepted parameters follows.
  8052. @table @option
  8053. @item stats_file, f
  8054. If specified the filter will use the named file to save the PSNR of
  8055. each individual frame. When filename equals "-" the data is sent to
  8056. standard output.
  8057. @end table
  8058. The file printed if @var{stats_file} is selected, contains a sequence of
  8059. key/value pairs of the form @var{key}:@var{value} for each compared
  8060. couple of frames.
  8061. A description of each shown parameter follows:
  8062. @table @option
  8063. @item n
  8064. sequential number of the input frame, starting from 1
  8065. @item mse_avg
  8066. Mean Square Error pixel-by-pixel average difference of the compared
  8067. frames, averaged over all the image components.
  8068. @item mse_y, mse_u, mse_v, mse_r, mse_g, mse_g, mse_a
  8069. Mean Square Error pixel-by-pixel average difference of the compared
  8070. frames for the component specified by the suffix.
  8071. @item psnr_y, psnr_u, psnr_v, psnr_r, psnr_g, psnr_b, psnr_a
  8072. Peak Signal to Noise ratio of the compared frames for the component
  8073. specified by the suffix.
  8074. @end table
  8075. For example:
  8076. @example
  8077. movie=ref_movie.mpg, setpts=PTS-STARTPTS [main];
  8078. [main][ref] psnr="stats_file=stats.log" [out]
  8079. @end example
  8080. On this example the input file being processed is compared with the
  8081. reference file @file{ref_movie.mpg}. The PSNR of each individual frame
  8082. is stored in @file{stats.log}.
  8083. @anchor{pullup}
  8084. @section pullup
  8085. Pulldown reversal (inverse telecine) filter, capable of handling mixed
  8086. hard-telecine, 24000/1001 fps progressive, and 30000/1001 fps progressive
  8087. content.
  8088. The pullup filter is designed to take advantage of future context in making
  8089. its decisions. This filter is stateless in the sense that it does not lock
  8090. onto a pattern to follow, but it instead looks forward to the following
  8091. fields in order to identify matches and rebuild progressive frames.
  8092. To produce content with an even framerate, insert the fps filter after
  8093. pullup, use @code{fps=24000/1001} if the input frame rate is 29.97fps,
  8094. @code{fps=24} for 30fps and the (rare) telecined 25fps input.
  8095. The filter accepts the following options:
  8096. @table @option
  8097. @item jl
  8098. @item jr
  8099. @item jt
  8100. @item jb
  8101. These options set the amount of "junk" to ignore at the left, right, top, and
  8102. bottom of the image, respectively. Left and right are in units of 8 pixels,
  8103. while top and bottom are in units of 2 lines.
  8104. The default is 8 pixels on each side.
  8105. @item sb
  8106. Set the strict breaks. Setting this option to 1 will reduce the chances of
  8107. filter generating an occasional mismatched frame, but it may also cause an
  8108. excessive number of frames to be dropped during high motion sequences.
  8109. Conversely, setting it to -1 will make filter match fields more easily.
  8110. This may help processing of video where there is slight blurring between
  8111. the fields, but may also cause there to be interlaced frames in the output.
  8112. Default value is @code{0}.
  8113. @item mp
  8114. Set the metric plane to use. It accepts the following values:
  8115. @table @samp
  8116. @item l
  8117. Use luma plane.
  8118. @item u
  8119. Use chroma blue plane.
  8120. @item v
  8121. Use chroma red plane.
  8122. @end table
  8123. This option may be set to use chroma plane instead of the default luma plane
  8124. for doing filter's computations. This may improve accuracy on very clean
  8125. source material, but more likely will decrease accuracy, especially if there
  8126. is chroma noise (rainbow effect) or any grayscale video.
  8127. The main purpose of setting @option{mp} to a chroma plane is to reduce CPU
  8128. load and make pullup usable in realtime on slow machines.
  8129. @end table
  8130. For best results (without duplicated frames in the output file) it is
  8131. necessary to change the output frame rate. For example, to inverse
  8132. telecine NTSC input:
  8133. @example
  8134. ffmpeg -i input -vf pullup -r 24000/1001 ...
  8135. @end example
  8136. @section qp
  8137. Change video quantization parameters (QP).
  8138. The filter accepts the following option:
  8139. @table @option
  8140. @item qp
  8141. Set expression for quantization parameter.
  8142. @end table
  8143. The expression is evaluated through the eval API and can contain, among others,
  8144. the following constants:
  8145. @table @var
  8146. @item known
  8147. 1 if index is not 129, 0 otherwise.
  8148. @item qp
  8149. Sequentional index starting from -129 to 128.
  8150. @end table
  8151. @subsection Examples
  8152. @itemize
  8153. @item
  8154. Some equation like:
  8155. @example
  8156. qp=2+2*sin(PI*qp)
  8157. @end example
  8158. @end itemize
  8159. @section random
  8160. Flush video frames from internal cache of frames into a random order.
  8161. No frame is discarded.
  8162. Inspired by @ref{frei0r} nervous filter.
  8163. @table @option
  8164. @item frames
  8165. Set size in number of frames of internal cache, in range from @code{2} to
  8166. @code{512}. Default is @code{30}.
  8167. @item seed
  8168. Set seed for random number generator, must be an integer included between
  8169. @code{0} and @code{UINT32_MAX}. If not specified, or if explicitly set to
  8170. less than @code{0}, the filter will try to use a good random seed on a
  8171. best effort basis.
  8172. @end table
  8173. @section readvitc
  8174. Read vertical interval timecode (VITC) information from the top lines of a
  8175. video frame.
  8176. The filter adds frame metadata key @code{lavfi.readvitc.tc_str} with the
  8177. timecode value, if a valid timecode has been detected. Further metadata key
  8178. @code{lavfi.readvitc.found} is set to 0/1 depending on whether
  8179. timecode data has been found or not.
  8180. This filter accepts the following options:
  8181. @table @option
  8182. @item scan_max
  8183. Set the maximum number of lines to scan for VITC data. If the value is set to
  8184. @code{-1} the full video frame is scanned. Default is @code{45}.
  8185. @item thr_b
  8186. Set the luma threshold for black. Accepts float numbers in the range [0.0,1.0],
  8187. default value is @code{0.2}. The value must be equal or less than @code{thr_w}.
  8188. @item thr_w
  8189. Set the luma threshold for white. Accepts float numbers in the range [0.0,1.0],
  8190. default value is @code{0.6}. The value must be equal or greater than @code{thr_b}.
  8191. @end table
  8192. @subsection Examples
  8193. @itemize
  8194. @item
  8195. Detect and draw VITC data onto the video frame; if no valid VITC is detected,
  8196. draw @code{--:--:--:--} as a placeholder:
  8197. @example
  8198. ffmpeg -i input.avi -filter:v 'readvitc,drawtext=fontfile=FreeMono.ttf:text=%@{metadata\\:lavfi.readvitc.tc_str\\:--\\\\\\:--\\\\\\:--\\\\\\:--@}:x=(w-tw)/2:y=400-ascent'
  8199. @end example
  8200. @end itemize
  8201. @section remap
  8202. Remap pixels using 2nd: Xmap and 3rd: Ymap input video stream.
  8203. Destination pixel at position (X, Y) will be picked from source (x, y) position
  8204. where x = Xmap(X, Y) and y = Ymap(X, Y). If mapping values are out of range, zero
  8205. value for pixel will be used for destination pixel.
  8206. Xmap and Ymap input video streams must be of same dimensions. Output video stream
  8207. will have Xmap/Ymap video stream dimensions.
  8208. Xmap and Ymap input video streams are 16bit depth, single channel.
  8209. @section removegrain
  8210. The removegrain filter is a spatial denoiser for progressive video.
  8211. @table @option
  8212. @item m0
  8213. Set mode for the first plane.
  8214. @item m1
  8215. Set mode for the second plane.
  8216. @item m2
  8217. Set mode for the third plane.
  8218. @item m3
  8219. Set mode for the fourth plane.
  8220. @end table
  8221. Range of mode is from 0 to 24. Description of each mode follows:
  8222. @table @var
  8223. @item 0
  8224. Leave input plane unchanged. Default.
  8225. @item 1
  8226. Clips the pixel with the minimum and maximum of the 8 neighbour pixels.
  8227. @item 2
  8228. Clips the pixel with the second minimum and maximum of the 8 neighbour pixels.
  8229. @item 3
  8230. Clips the pixel with the third minimum and maximum of the 8 neighbour pixels.
  8231. @item 4
  8232. Clips the pixel with the fourth minimum and maximum of the 8 neighbour pixels.
  8233. This is equivalent to a median filter.
  8234. @item 5
  8235. Line-sensitive clipping giving the minimal change.
  8236. @item 6
  8237. Line-sensitive clipping, intermediate.
  8238. @item 7
  8239. Line-sensitive clipping, intermediate.
  8240. @item 8
  8241. Line-sensitive clipping, intermediate.
  8242. @item 9
  8243. Line-sensitive clipping on a line where the neighbours pixels are the closest.
  8244. @item 10
  8245. Replaces the target pixel with the closest neighbour.
  8246. @item 11
  8247. [1 2 1] horizontal and vertical kernel blur.
  8248. @item 12
  8249. Same as mode 11.
  8250. @item 13
  8251. Bob mode, interpolates top field from the line where the neighbours
  8252. pixels are the closest.
  8253. @item 14
  8254. Bob mode, interpolates bottom field from the line where the neighbours
  8255. pixels are the closest.
  8256. @item 15
  8257. Bob mode, interpolates top field. Same as 13 but with a more complicated
  8258. interpolation formula.
  8259. @item 16
  8260. Bob mode, interpolates bottom field. Same as 14 but with a more complicated
  8261. interpolation formula.
  8262. @item 17
  8263. Clips the pixel with the minimum and maximum of respectively the maximum and
  8264. minimum of each pair of opposite neighbour pixels.
  8265. @item 18
  8266. Line-sensitive clipping using opposite neighbours whose greatest distance from
  8267. the current pixel is minimal.
  8268. @item 19
  8269. Replaces the pixel with the average of its 8 neighbours.
  8270. @item 20
  8271. Averages the 9 pixels ([1 1 1] horizontal and vertical blur).
  8272. @item 21
  8273. Clips pixels using the averages of opposite neighbour.
  8274. @item 22
  8275. Same as mode 21 but simpler and faster.
  8276. @item 23
  8277. Small edge and halo removal, but reputed useless.
  8278. @item 24
  8279. Similar as 23.
  8280. @end table
  8281. @section removelogo
  8282. Suppress a TV station logo, using an image file to determine which
  8283. pixels comprise the logo. It works by filling in the pixels that
  8284. comprise the logo with neighboring pixels.
  8285. The filter accepts the following options:
  8286. @table @option
  8287. @item filename, f
  8288. Set the filter bitmap file, which can be any image format supported by
  8289. libavformat. The width and height of the image file must match those of the
  8290. video stream being processed.
  8291. @end table
  8292. Pixels in the provided bitmap image with a value of zero are not
  8293. considered part of the logo, non-zero pixels are considered part of
  8294. the logo. If you use white (255) for the logo and black (0) for the
  8295. rest, you will be safe. For making the filter bitmap, it is
  8296. recommended to take a screen capture of a black frame with the logo
  8297. visible, and then using a threshold filter followed by the erode
  8298. filter once or twice.
  8299. If needed, little splotches can be fixed manually. Remember that if
  8300. logo pixels are not covered, the filter quality will be much
  8301. reduced. Marking too many pixels as part of the logo does not hurt as
  8302. much, but it will increase the amount of blurring needed to cover over
  8303. the image and will destroy more information than necessary, and extra
  8304. pixels will slow things down on a large logo.
  8305. @section repeatfields
  8306. This filter uses the repeat_field flag from the Video ES headers and hard repeats
  8307. fields based on its value.
  8308. @section reverse, areverse
  8309. Reverse a clip.
  8310. Warning: This filter requires memory to buffer the entire clip, so trimming
  8311. is suggested.
  8312. @subsection Examples
  8313. @itemize
  8314. @item
  8315. Take the first 5 seconds of a clip, and reverse it.
  8316. @example
  8317. trim=end=5,reverse
  8318. @end example
  8319. @end itemize
  8320. @section rotate
  8321. Rotate video by an arbitrary angle expressed in radians.
  8322. The filter accepts the following options:
  8323. A description of the optional parameters follows.
  8324. @table @option
  8325. @item angle, a
  8326. Set an expression for the angle by which to rotate the input video
  8327. clockwise, expressed as a number of radians. A negative value will
  8328. result in a counter-clockwise rotation. By default it is set to "0".
  8329. This expression is evaluated for each frame.
  8330. @item out_w, ow
  8331. Set the output width expression, default value is "iw".
  8332. This expression is evaluated just once during configuration.
  8333. @item out_h, oh
  8334. Set the output height expression, default value is "ih".
  8335. This expression is evaluated just once during configuration.
  8336. @item bilinear
  8337. Enable bilinear interpolation if set to 1, a value of 0 disables
  8338. it. Default value is 1.
  8339. @item fillcolor, c
  8340. Set the color used to fill the output area not covered by the rotated
  8341. image. For the general syntax of this option, check the "Color" section in the
  8342. ffmpeg-utils manual. If the special value "none" is selected then no
  8343. background is printed (useful for example if the background is never shown).
  8344. Default value is "black".
  8345. @end table
  8346. The expressions for the angle and the output size can contain the
  8347. following constants and functions:
  8348. @table @option
  8349. @item n
  8350. sequential number of the input frame, starting from 0. It is always NAN
  8351. before the first frame is filtered.
  8352. @item t
  8353. time in seconds of the input frame, it is set to 0 when the filter is
  8354. configured. It is always NAN before the first frame is filtered.
  8355. @item hsub
  8356. @item vsub
  8357. horizontal and vertical chroma subsample values. For example for the
  8358. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  8359. @item in_w, iw
  8360. @item in_h, ih
  8361. the input video width and height
  8362. @item out_w, ow
  8363. @item out_h, oh
  8364. the output width and height, that is the size of the padded area as
  8365. specified by the @var{width} and @var{height} expressions
  8366. @item rotw(a)
  8367. @item roth(a)
  8368. the minimal width/height required for completely containing the input
  8369. video rotated by @var{a} radians.
  8370. These are only available when computing the @option{out_w} and
  8371. @option{out_h} expressions.
  8372. @end table
  8373. @subsection Examples
  8374. @itemize
  8375. @item
  8376. Rotate the input by PI/6 radians clockwise:
  8377. @example
  8378. rotate=PI/6
  8379. @end example
  8380. @item
  8381. Rotate the input by PI/6 radians counter-clockwise:
  8382. @example
  8383. rotate=-PI/6
  8384. @end example
  8385. @item
  8386. Rotate the input by 45 degrees clockwise:
  8387. @example
  8388. rotate=45*PI/180
  8389. @end example
  8390. @item
  8391. Apply a constant rotation with period T, starting from an angle of PI/3:
  8392. @example
  8393. rotate=PI/3+2*PI*t/T
  8394. @end example
  8395. @item
  8396. Make the input video rotation oscillating with a period of T
  8397. seconds and an amplitude of A radians:
  8398. @example
  8399. rotate=A*sin(2*PI/T*t)
  8400. @end example
  8401. @item
  8402. Rotate the video, output size is chosen so that the whole rotating
  8403. input video is always completely contained in the output:
  8404. @example
  8405. rotate='2*PI*t:ow=hypot(iw,ih):oh=ow'
  8406. @end example
  8407. @item
  8408. Rotate the video, reduce the output size so that no background is ever
  8409. shown:
  8410. @example
  8411. rotate=2*PI*t:ow='min(iw,ih)/sqrt(2)':oh=ow:c=none
  8412. @end example
  8413. @end itemize
  8414. @subsection Commands
  8415. The filter supports the following commands:
  8416. @table @option
  8417. @item a, angle
  8418. Set the angle expression.
  8419. The command accepts the same syntax of the corresponding option.
  8420. If the specified expression is not valid, it is kept at its current
  8421. value.
  8422. @end table
  8423. @section sab
  8424. Apply Shape Adaptive Blur.
  8425. The filter accepts the following options:
  8426. @table @option
  8427. @item luma_radius, lr
  8428. Set luma blur filter strength, must be a value in range 0.1-4.0, default
  8429. value is 1.0. A greater value will result in a more blurred image, and
  8430. in slower processing.
  8431. @item luma_pre_filter_radius, lpfr
  8432. Set luma pre-filter radius, must be a value in the 0.1-2.0 range, default
  8433. value is 1.0.
  8434. @item luma_strength, ls
  8435. Set luma maximum difference between pixels to still be considered, must
  8436. be a value in the 0.1-100.0 range, default value is 1.0.
  8437. @item chroma_radius, cr
  8438. Set chroma blur filter strength, must be a value in range 0.1-4.0. A
  8439. greater value will result in a more blurred image, and in slower
  8440. processing.
  8441. @item chroma_pre_filter_radius, cpfr
  8442. Set chroma pre-filter radius, must be a value in the 0.1-2.0 range.
  8443. @item chroma_strength, cs
  8444. Set chroma maximum difference between pixels to still be considered,
  8445. must be a value in the 0.1-100.0 range.
  8446. @end table
  8447. Each chroma option value, if not explicitly specified, is set to the
  8448. corresponding luma option value.
  8449. @anchor{scale}
  8450. @section scale
  8451. Scale (resize) the input video, using the libswscale library.
  8452. The scale filter forces the output display aspect ratio to be the same
  8453. of the input, by changing the output sample aspect ratio.
  8454. If the input image format is different from the format requested by
  8455. the next filter, the scale filter will convert the input to the
  8456. requested format.
  8457. @subsection Options
  8458. The filter accepts the following options, or any of the options
  8459. supported by the libswscale scaler.
  8460. See @ref{scaler_options,,the ffmpeg-scaler manual,ffmpeg-scaler} for
  8461. the complete list of scaler options.
  8462. @table @option
  8463. @item width, w
  8464. @item height, h
  8465. Set the output video dimension expression. Default value is the input
  8466. dimension.
  8467. If the value is 0, the input width is used for the output.
  8468. If one of the values is -1, the scale filter will use a value that
  8469. maintains the aspect ratio of the input image, calculated from the
  8470. other specified dimension. If both of them are -1, the input size is
  8471. used
  8472. If one of the values is -n with n > 1, the scale filter will also use a value
  8473. that maintains the aspect ratio of the input image, calculated from the other
  8474. specified dimension. After that it will, however, make sure that the calculated
  8475. dimension is divisible by n and adjust the value if necessary.
  8476. See below for the list of accepted constants for use in the dimension
  8477. expression.
  8478. @item eval
  8479. Specify when to evaluate @var{width} and @var{height} expression. It accepts the following values:
  8480. @table @samp
  8481. @item init
  8482. Only evaluate expressions once during the filter initialization or when a command is processed.
  8483. @item frame
  8484. Evaluate expressions for each incoming frame.
  8485. @end table
  8486. Default value is @samp{init}.
  8487. @item interl
  8488. Set the interlacing mode. It accepts the following values:
  8489. @table @samp
  8490. @item 1
  8491. Force interlaced aware scaling.
  8492. @item 0
  8493. Do not apply interlaced scaling.
  8494. @item -1
  8495. Select interlaced aware scaling depending on whether the source frames
  8496. are flagged as interlaced or not.
  8497. @end table
  8498. Default value is @samp{0}.
  8499. @item flags
  8500. Set libswscale scaling flags. See
  8501. @ref{sws_flags,,the ffmpeg-scaler manual,ffmpeg-scaler} for the
  8502. complete list of values. If not explicitly specified the filter applies
  8503. the default flags.
  8504. @item param0, param1
  8505. Set libswscale input parameters for scaling algorithms that need them. See
  8506. @ref{sws_params,,the ffmpeg-scaler manual,ffmpeg-scaler} for the
  8507. complete documentation. If not explicitly specified the filter applies
  8508. empty parameters.
  8509. @item size, s
  8510. Set the video size. For the syntax of this option, check the
  8511. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  8512. @item in_color_matrix
  8513. @item out_color_matrix
  8514. Set in/output YCbCr color space type.
  8515. This allows the autodetected value to be overridden as well as allows forcing
  8516. a specific value used for the output and encoder.
  8517. If not specified, the color space type depends on the pixel format.
  8518. Possible values:
  8519. @table @samp
  8520. @item auto
  8521. Choose automatically.
  8522. @item bt709
  8523. Format conforming to International Telecommunication Union (ITU)
  8524. Recommendation BT.709.
  8525. @item fcc
  8526. Set color space conforming to the United States Federal Communications
  8527. Commission (FCC) Code of Federal Regulations (CFR) Title 47 (2003) 73.682 (a).
  8528. @item bt601
  8529. Set color space conforming to:
  8530. @itemize
  8531. @item
  8532. ITU Radiocommunication Sector (ITU-R) Recommendation BT.601
  8533. @item
  8534. ITU-R Rec. BT.470-6 (1998) Systems B, B1, and G
  8535. @item
  8536. Society of Motion Picture and Television Engineers (SMPTE) ST 170:2004
  8537. @end itemize
  8538. @item smpte240m
  8539. Set color space conforming to SMPTE ST 240:1999.
  8540. @end table
  8541. @item in_range
  8542. @item out_range
  8543. Set in/output YCbCr sample range.
  8544. This allows the autodetected value to be overridden as well as allows forcing
  8545. a specific value used for the output and encoder. If not specified, the
  8546. range depends on the pixel format. Possible values:
  8547. @table @samp
  8548. @item auto
  8549. Choose automatically.
  8550. @item jpeg/full/pc
  8551. Set full range (0-255 in case of 8-bit luma).
  8552. @item mpeg/tv
  8553. Set "MPEG" range (16-235 in case of 8-bit luma).
  8554. @end table
  8555. @item force_original_aspect_ratio
  8556. Enable decreasing or increasing output video width or height if necessary to
  8557. keep the original aspect ratio. Possible values:
  8558. @table @samp
  8559. @item disable
  8560. Scale the video as specified and disable this feature.
  8561. @item decrease
  8562. The output video dimensions will automatically be decreased if needed.
  8563. @item increase
  8564. The output video dimensions will automatically be increased if needed.
  8565. @end table
  8566. One useful instance of this option is that when you know a specific device's
  8567. maximum allowed resolution, you can use this to limit the output video to
  8568. that, while retaining the aspect ratio. For example, device A allows
  8569. 1280x720 playback, and your video is 1920x800. Using this option (set it to
  8570. decrease) and specifying 1280x720 to the command line makes the output
  8571. 1280x533.
  8572. Please note that this is a different thing than specifying -1 for @option{w}
  8573. or @option{h}, you still need to specify the output resolution for this option
  8574. to work.
  8575. @end table
  8576. The values of the @option{w} and @option{h} options are expressions
  8577. containing the following constants:
  8578. @table @var
  8579. @item in_w
  8580. @item in_h
  8581. The input width and height
  8582. @item iw
  8583. @item ih
  8584. These are the same as @var{in_w} and @var{in_h}.
  8585. @item out_w
  8586. @item out_h
  8587. The output (scaled) width and height
  8588. @item ow
  8589. @item oh
  8590. These are the same as @var{out_w} and @var{out_h}
  8591. @item a
  8592. The same as @var{iw} / @var{ih}
  8593. @item sar
  8594. input sample aspect ratio
  8595. @item dar
  8596. The input display aspect ratio. Calculated from @code{(iw / ih) * sar}.
  8597. @item hsub
  8598. @item vsub
  8599. horizontal and vertical input chroma subsample values. For example for the
  8600. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  8601. @item ohsub
  8602. @item ovsub
  8603. horizontal and vertical output chroma subsample values. For example for the
  8604. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  8605. @end table
  8606. @subsection Examples
  8607. @itemize
  8608. @item
  8609. Scale the input video to a size of 200x100
  8610. @example
  8611. scale=w=200:h=100
  8612. @end example
  8613. This is equivalent to:
  8614. @example
  8615. scale=200:100
  8616. @end example
  8617. or:
  8618. @example
  8619. scale=200x100
  8620. @end example
  8621. @item
  8622. Specify a size abbreviation for the output size:
  8623. @example
  8624. scale=qcif
  8625. @end example
  8626. which can also be written as:
  8627. @example
  8628. scale=size=qcif
  8629. @end example
  8630. @item
  8631. Scale the input to 2x:
  8632. @example
  8633. scale=w=2*iw:h=2*ih
  8634. @end example
  8635. @item
  8636. The above is the same as:
  8637. @example
  8638. scale=2*in_w:2*in_h
  8639. @end example
  8640. @item
  8641. Scale the input to 2x with forced interlaced scaling:
  8642. @example
  8643. scale=2*iw:2*ih:interl=1
  8644. @end example
  8645. @item
  8646. Scale the input to half size:
  8647. @example
  8648. scale=w=iw/2:h=ih/2
  8649. @end example
  8650. @item
  8651. Increase the width, and set the height to the same size:
  8652. @example
  8653. scale=3/2*iw:ow
  8654. @end example
  8655. @item
  8656. Seek Greek harmony:
  8657. @example
  8658. scale=iw:1/PHI*iw
  8659. scale=ih*PHI:ih
  8660. @end example
  8661. @item
  8662. Increase the height, and set the width to 3/2 of the height:
  8663. @example
  8664. scale=w=3/2*oh:h=3/5*ih
  8665. @end example
  8666. @item
  8667. Increase the size, making the size a multiple of the chroma
  8668. subsample values:
  8669. @example
  8670. scale="trunc(3/2*iw/hsub)*hsub:trunc(3/2*ih/vsub)*vsub"
  8671. @end example
  8672. @item
  8673. Increase the width to a maximum of 500 pixels,
  8674. keeping the same aspect ratio as the input:
  8675. @example
  8676. scale=w='min(500\, iw*3/2):h=-1'
  8677. @end example
  8678. @end itemize
  8679. @subsection Commands
  8680. This filter supports the following commands:
  8681. @table @option
  8682. @item width, w
  8683. @item height, h
  8684. Set the output video dimension expression.
  8685. The command accepts the same syntax of the corresponding option.
  8686. If the specified expression is not valid, it is kept at its current
  8687. value.
  8688. @end table
  8689. @section scale2ref
  8690. Scale (resize) the input video, based on a reference video.
  8691. See the scale filter for available options, scale2ref supports the same but
  8692. uses the reference video instead of the main input as basis.
  8693. @subsection Examples
  8694. @itemize
  8695. @item
  8696. Scale a subtitle stream to match the main video in size before overlaying
  8697. @example
  8698. 'scale2ref[b][a];[a][b]overlay'
  8699. @end example
  8700. @end itemize
  8701. @anchor{selectivecolor}
  8702. @section selectivecolor
  8703. Adjust cyan, magenta, yellow and black (CMYK) to certain ranges of colors (such
  8704. as "reds", "yellows", "greens", "cyans", ...). The adjustment range is defined
  8705. by the "purity" of the color (that is, how saturated it already is).
  8706. This filter is similar to the Adobe Photoshop Selective Color tool.
  8707. The filter accepts the following options:
  8708. @table @option
  8709. @item correction_method
  8710. Select color correction method.
  8711. Available values are:
  8712. @table @samp
  8713. @item absolute
  8714. Specified adjustments are applied "as-is" (added/subtracted to original pixel
  8715. component value).
  8716. @item relative
  8717. Specified adjustments are relative to the original component value.
  8718. @end table
  8719. Default is @code{absolute}.
  8720. @item reds
  8721. Adjustments for red pixels (pixels where the red component is the maximum)
  8722. @item yellows
  8723. Adjustments for yellow pixels (pixels where the blue component is the minimum)
  8724. @item greens
  8725. Adjustments for green pixels (pixels where the green component is the maximum)
  8726. @item cyans
  8727. Adjustments for cyan pixels (pixels where the red component is the minimum)
  8728. @item blues
  8729. Adjustments for blue pixels (pixels where the blue component is the maximum)
  8730. @item magentas
  8731. Adjustments for magenta pixels (pixels where the green component is the minimum)
  8732. @item whites
  8733. Adjustments for white pixels (pixels where all components are greater than 128)
  8734. @item neutrals
  8735. Adjustments for all pixels except pure black and pure white
  8736. @item blacks
  8737. Adjustments for black pixels (pixels where all components are lesser than 128)
  8738. @item psfile
  8739. Specify a Photoshop selective color file (@code{.asv}) to import the settings from.
  8740. @end table
  8741. All the adjustment settings (@option{reds}, @option{yellows}, ...) accept up to
  8742. 4 space separated floating point adjustment values in the [-1,1] range,
  8743. respectively to adjust the amount of cyan, magenta, yellow and black for the
  8744. pixels of its range.
  8745. @subsection Examples
  8746. @itemize
  8747. @item
  8748. Increase cyan by 50% and reduce yellow by 33% in every green areas, and
  8749. increase magenta by 27% in blue areas:
  8750. @example
  8751. selectivecolor=greens=.5 0 -.33 0:blues=0 .27
  8752. @end example
  8753. @item
  8754. Use a Photoshop selective color preset:
  8755. @example
  8756. selectivecolor=psfile=MySelectiveColorPresets/Misty.asv
  8757. @end example
  8758. @end itemize
  8759. @section separatefields
  8760. The @code{separatefields} takes a frame-based video input and splits
  8761. each frame into its components fields, producing a new half height clip
  8762. with twice the frame rate and twice the frame count.
  8763. This filter use field-dominance information in frame to decide which
  8764. of each pair of fields to place first in the output.
  8765. If it gets it wrong use @ref{setfield} filter before @code{separatefields} filter.
  8766. @section setdar, setsar
  8767. The @code{setdar} filter sets the Display Aspect Ratio for the filter
  8768. output video.
  8769. This is done by changing the specified Sample (aka Pixel) Aspect
  8770. Ratio, according to the following equation:
  8771. @example
  8772. @var{DAR} = @var{HORIZONTAL_RESOLUTION} / @var{VERTICAL_RESOLUTION} * @var{SAR}
  8773. @end example
  8774. Keep in mind that the @code{setdar} filter does not modify the pixel
  8775. dimensions of the video frame. Also, the display aspect ratio set by
  8776. this filter may be changed by later filters in the filterchain,
  8777. e.g. in case of scaling or if another "setdar" or a "setsar" filter is
  8778. applied.
  8779. The @code{setsar} filter sets the Sample (aka Pixel) Aspect Ratio for
  8780. the filter output video.
  8781. Note that as a consequence of the application of this filter, the
  8782. output display aspect ratio will change according to the equation
  8783. above.
  8784. Keep in mind that the sample aspect ratio set by the @code{setsar}
  8785. filter may be changed by later filters in the filterchain, e.g. if
  8786. another "setsar" or a "setdar" filter is applied.
  8787. It accepts the following parameters:
  8788. @table @option
  8789. @item r, ratio, dar (@code{setdar} only), sar (@code{setsar} only)
  8790. Set the aspect ratio used by the filter.
  8791. The parameter can be a floating point number string, an expression, or
  8792. a string of the form @var{num}:@var{den}, where @var{num} and
  8793. @var{den} are the numerator and denominator of the aspect ratio. If
  8794. the parameter is not specified, it is assumed the value "0".
  8795. In case the form "@var{num}:@var{den}" is used, the @code{:} character
  8796. should be escaped.
  8797. @item max
  8798. Set the maximum integer value to use for expressing numerator and
  8799. denominator when reducing the expressed aspect ratio to a rational.
  8800. Default value is @code{100}.
  8801. @end table
  8802. The parameter @var{sar} is an expression containing
  8803. the following constants:
  8804. @table @option
  8805. @item E, PI, PHI
  8806. These are approximated values for the mathematical constants e
  8807. (Euler's number), pi (Greek pi), and phi (the golden ratio).
  8808. @item w, h
  8809. The input width and height.
  8810. @item a
  8811. These are the same as @var{w} / @var{h}.
  8812. @item sar
  8813. The input sample aspect ratio.
  8814. @item dar
  8815. The input display aspect ratio. It is the same as
  8816. (@var{w} / @var{h}) * @var{sar}.
  8817. @item hsub, vsub
  8818. Horizontal and vertical chroma subsample values. For example, for the
  8819. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  8820. @end table
  8821. @subsection Examples
  8822. @itemize
  8823. @item
  8824. To change the display aspect ratio to 16:9, specify one of the following:
  8825. @example
  8826. setdar=dar=1.77777
  8827. setdar=dar=16/9
  8828. setdar=dar=1.77777
  8829. @end example
  8830. @item
  8831. To change the sample aspect ratio to 10:11, specify:
  8832. @example
  8833. setsar=sar=10/11
  8834. @end example
  8835. @item
  8836. To set a display aspect ratio of 16:9, and specify a maximum integer value of
  8837. 1000 in the aspect ratio reduction, use the command:
  8838. @example
  8839. setdar=ratio=16/9:max=1000
  8840. @end example
  8841. @end itemize
  8842. @anchor{setfield}
  8843. @section setfield
  8844. Force field for the output video frame.
  8845. The @code{setfield} filter marks the interlace type field for the
  8846. output frames. It does not change the input frame, but only sets the
  8847. corresponding property, which affects how the frame is treated by
  8848. following filters (e.g. @code{fieldorder} or @code{yadif}).
  8849. The filter accepts the following options:
  8850. @table @option
  8851. @item mode
  8852. Available values are:
  8853. @table @samp
  8854. @item auto
  8855. Keep the same field property.
  8856. @item bff
  8857. Mark the frame as bottom-field-first.
  8858. @item tff
  8859. Mark the frame as top-field-first.
  8860. @item prog
  8861. Mark the frame as progressive.
  8862. @end table
  8863. @end table
  8864. @section showinfo
  8865. Show a line containing various information for each input video frame.
  8866. The input video is not modified.
  8867. The shown line contains a sequence of key/value pairs of the form
  8868. @var{key}:@var{value}.
  8869. The following values are shown in the output:
  8870. @table @option
  8871. @item n
  8872. The (sequential) number of the input frame, starting from 0.
  8873. @item pts
  8874. The Presentation TimeStamp of the input frame, expressed as a number of
  8875. time base units. The time base unit depends on the filter input pad.
  8876. @item pts_time
  8877. The Presentation TimeStamp of the input frame, expressed as a number of
  8878. seconds.
  8879. @item pos
  8880. The position of the frame in the input stream, or -1 if this information is
  8881. unavailable and/or meaningless (for example in case of synthetic video).
  8882. @item fmt
  8883. The pixel format name.
  8884. @item sar
  8885. The sample aspect ratio of the input frame, expressed in the form
  8886. @var{num}/@var{den}.
  8887. @item s
  8888. The size of the input frame. For the syntax of this option, check the
  8889. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  8890. @item i
  8891. The type of interlaced mode ("P" for "progressive", "T" for top field first, "B"
  8892. for bottom field first).
  8893. @item iskey
  8894. This is 1 if the frame is a key frame, 0 otherwise.
  8895. @item type
  8896. The picture type of the input frame ("I" for an I-frame, "P" for a
  8897. P-frame, "B" for a B-frame, or "?" for an unknown type).
  8898. Also refer to the documentation of the @code{AVPictureType} enum and of
  8899. the @code{av_get_picture_type_char} function defined in
  8900. @file{libavutil/avutil.h}.
  8901. @item checksum
  8902. The Adler-32 checksum (printed in hexadecimal) of all the planes of the input frame.
  8903. @item plane_checksum
  8904. The Adler-32 checksum (printed in hexadecimal) of each plane of the input frame,
  8905. expressed in the form "[@var{c0} @var{c1} @var{c2} @var{c3}]".
  8906. @end table
  8907. @section showpalette
  8908. Displays the 256 colors palette of each frame. This filter is only relevant for
  8909. @var{pal8} pixel format frames.
  8910. It accepts the following option:
  8911. @table @option
  8912. @item s
  8913. Set the size of the box used to represent one palette color entry. Default is
  8914. @code{30} (for a @code{30x30} pixel box).
  8915. @end table
  8916. @section shuffleframes
  8917. Reorder and/or duplicate video frames.
  8918. It accepts the following parameters:
  8919. @table @option
  8920. @item mapping
  8921. Set the destination indexes of input frames.
  8922. This is space or '|' separated list of indexes that maps input frames to output
  8923. frames. Number of indexes also sets maximal value that each index may have.
  8924. @end table
  8925. The first frame has the index 0. The default is to keep the input unchanged.
  8926. Swap second and third frame of every three frames of the input:
  8927. @example
  8928. ffmpeg -i INPUT -vf "shuffleframes=0 2 1" OUTPUT
  8929. @end example
  8930. @section shuffleplanes
  8931. Reorder and/or duplicate video planes.
  8932. It accepts the following parameters:
  8933. @table @option
  8934. @item map0
  8935. The index of the input plane to be used as the first output plane.
  8936. @item map1
  8937. The index of the input plane to be used as the second output plane.
  8938. @item map2
  8939. The index of the input plane to be used as the third output plane.
  8940. @item map3
  8941. The index of the input plane to be used as the fourth output plane.
  8942. @end table
  8943. The first plane has the index 0. The default is to keep the input unchanged.
  8944. Swap the second and third planes of the input:
  8945. @example
  8946. ffmpeg -i INPUT -vf shuffleplanes=0:2:1:3 OUTPUT
  8947. @end example
  8948. @anchor{signalstats}
  8949. @section signalstats
  8950. Evaluate various visual metrics that assist in determining issues associated
  8951. with the digitization of analog video media.
  8952. By default the filter will log these metadata values:
  8953. @table @option
  8954. @item YMIN
  8955. Display the minimal Y value contained within the input frame. Expressed in
  8956. range of [0-255].
  8957. @item YLOW
  8958. Display the Y value at the 10% percentile within the input frame. Expressed in
  8959. range of [0-255].
  8960. @item YAVG
  8961. Display the average Y value within the input frame. Expressed in range of
  8962. [0-255].
  8963. @item YHIGH
  8964. Display the Y value at the 90% percentile within the input frame. Expressed in
  8965. range of [0-255].
  8966. @item YMAX
  8967. Display the maximum Y value contained within the input frame. Expressed in
  8968. range of [0-255].
  8969. @item UMIN
  8970. Display the minimal U value contained within the input frame. Expressed in
  8971. range of [0-255].
  8972. @item ULOW
  8973. Display the U value at the 10% percentile within the input frame. Expressed in
  8974. range of [0-255].
  8975. @item UAVG
  8976. Display the average U value within the input frame. Expressed in range of
  8977. [0-255].
  8978. @item UHIGH
  8979. Display the U value at the 90% percentile within the input frame. Expressed in
  8980. range of [0-255].
  8981. @item UMAX
  8982. Display the maximum U value contained within the input frame. Expressed in
  8983. range of [0-255].
  8984. @item VMIN
  8985. Display the minimal V value contained within the input frame. Expressed in
  8986. range of [0-255].
  8987. @item VLOW
  8988. Display the V value at the 10% percentile within the input frame. Expressed in
  8989. range of [0-255].
  8990. @item VAVG
  8991. Display the average V value within the input frame. Expressed in range of
  8992. [0-255].
  8993. @item VHIGH
  8994. Display the V value at the 90% percentile within the input frame. Expressed in
  8995. range of [0-255].
  8996. @item VMAX
  8997. Display the maximum V value contained within the input frame. Expressed in
  8998. range of [0-255].
  8999. @item SATMIN
  9000. Display the minimal saturation value contained within the input frame.
  9001. Expressed in range of [0-~181.02].
  9002. @item SATLOW
  9003. Display the saturation value at the 10% percentile within the input frame.
  9004. Expressed in range of [0-~181.02].
  9005. @item SATAVG
  9006. Display the average saturation value within the input frame. Expressed in range
  9007. of [0-~181.02].
  9008. @item SATHIGH
  9009. Display the saturation value at the 90% percentile within the input frame.
  9010. Expressed in range of [0-~181.02].
  9011. @item SATMAX
  9012. Display the maximum saturation value contained within the input frame.
  9013. Expressed in range of [0-~181.02].
  9014. @item HUEMED
  9015. Display the median value for hue within the input frame. Expressed in range of
  9016. [0-360].
  9017. @item HUEAVG
  9018. Display the average value for hue within the input frame. Expressed in range of
  9019. [0-360].
  9020. @item YDIF
  9021. Display the average of sample value difference between all values of the Y
  9022. plane in the current frame and corresponding values of the previous input frame.
  9023. Expressed in range of [0-255].
  9024. @item UDIF
  9025. Display the average of sample value difference between all values of the U
  9026. plane in the current frame and corresponding values of the previous input frame.
  9027. Expressed in range of [0-255].
  9028. @item VDIF
  9029. Display the average of sample value difference between all values of the V
  9030. plane in the current frame and corresponding values of the previous input frame.
  9031. Expressed in range of [0-255].
  9032. @end table
  9033. The filter accepts the following options:
  9034. @table @option
  9035. @item stat
  9036. @item out
  9037. @option{stat} specify an additional form of image analysis.
  9038. @option{out} output video with the specified type of pixel highlighted.
  9039. Both options accept the following values:
  9040. @table @samp
  9041. @item tout
  9042. Identify @var{temporal outliers} pixels. A @var{temporal outlier} is a pixel
  9043. unlike the neighboring pixels of the same field. Examples of temporal outliers
  9044. include the results of video dropouts, head clogs, or tape tracking issues.
  9045. @item vrep
  9046. Identify @var{vertical line repetition}. Vertical line repetition includes
  9047. similar rows of pixels within a frame. In born-digital video vertical line
  9048. repetition is common, but this pattern is uncommon in video digitized from an
  9049. analog source. When it occurs in video that results from the digitization of an
  9050. analog source it can indicate concealment from a dropout compensator.
  9051. @item brng
  9052. Identify pixels that fall outside of legal broadcast range.
  9053. @end table
  9054. @item color, c
  9055. Set the highlight color for the @option{out} option. The default color is
  9056. yellow.
  9057. @end table
  9058. @subsection Examples
  9059. @itemize
  9060. @item
  9061. Output data of various video metrics:
  9062. @example
  9063. ffprobe -f lavfi movie=example.mov,signalstats="stat=tout+vrep+brng" -show_frames
  9064. @end example
  9065. @item
  9066. Output specific data about the minimum and maximum values of the Y plane per frame:
  9067. @example
  9068. ffprobe -f lavfi movie=example.mov,signalstats -show_entries frame_tags=lavfi.signalstats.YMAX,lavfi.signalstats.YMIN
  9069. @end example
  9070. @item
  9071. Playback video while highlighting pixels that are outside of broadcast range in red.
  9072. @example
  9073. ffplay example.mov -vf signalstats="out=brng:color=red"
  9074. @end example
  9075. @item
  9076. Playback video with signalstats metadata drawn over the frame.
  9077. @example
  9078. ffplay example.mov -vf signalstats=stat=brng+vrep+tout,drawtext=fontfile=FreeSerif.ttf:textfile=signalstat_drawtext.txt
  9079. @end example
  9080. The contents of signalstat_drawtext.txt used in the command are:
  9081. @example
  9082. time %@{pts:hms@}
  9083. Y (%@{metadata:lavfi.signalstats.YMIN@}-%@{metadata:lavfi.signalstats.YMAX@})
  9084. U (%@{metadata:lavfi.signalstats.UMIN@}-%@{metadata:lavfi.signalstats.UMAX@})
  9085. V (%@{metadata:lavfi.signalstats.VMIN@}-%@{metadata:lavfi.signalstats.VMAX@})
  9086. saturation maximum: %@{metadata:lavfi.signalstats.SATMAX@}
  9087. @end example
  9088. @end itemize
  9089. @anchor{smartblur}
  9090. @section smartblur
  9091. Blur the input video without impacting the outlines.
  9092. It accepts the following options:
  9093. @table @option
  9094. @item luma_radius, lr
  9095. Set the luma radius. The option value must be a float number in
  9096. the range [0.1,5.0] that specifies the variance of the gaussian filter
  9097. used to blur the image (slower if larger). Default value is 1.0.
  9098. @item luma_strength, ls
  9099. Set the luma strength. The option value must be a float number
  9100. in the range [-1.0,1.0] that configures the blurring. A value included
  9101. in [0.0,1.0] will blur the image whereas a value included in
  9102. [-1.0,0.0] will sharpen the image. Default value is 1.0.
  9103. @item luma_threshold, lt
  9104. Set the luma threshold used as a coefficient to determine
  9105. whether a pixel should be blurred or not. The option value must be an
  9106. integer in the range [-30,30]. A value of 0 will filter all the image,
  9107. a value included in [0,30] will filter flat areas and a value included
  9108. in [-30,0] will filter edges. Default value is 0.
  9109. @item chroma_radius, cr
  9110. Set the chroma radius. The option value must be a float number in
  9111. the range [0.1,5.0] that specifies the variance of the gaussian filter
  9112. used to blur the image (slower if larger). Default value is 1.0.
  9113. @item chroma_strength, cs
  9114. Set the chroma strength. The option value must be a float number
  9115. in the range [-1.0,1.0] that configures the blurring. A value included
  9116. in [0.0,1.0] will blur the image whereas a value included in
  9117. [-1.0,0.0] will sharpen the image. Default value is 1.0.
  9118. @item chroma_threshold, ct
  9119. Set the chroma threshold used as a coefficient to determine
  9120. whether a pixel should be blurred or not. The option value must be an
  9121. integer in the range [-30,30]. A value of 0 will filter all the image,
  9122. a value included in [0,30] will filter flat areas and a value included
  9123. in [-30,0] will filter edges. Default value is 0.
  9124. @end table
  9125. If a chroma option is not explicitly set, the corresponding luma value
  9126. is set.
  9127. @section ssim
  9128. Obtain the SSIM (Structural SImilarity Metric) between two input videos.
  9129. This filter takes in input two input videos, the first input is
  9130. considered the "main" source and is passed unchanged to the
  9131. output. The second input is used as a "reference" video for computing
  9132. the SSIM.
  9133. Both video inputs must have the same resolution and pixel format for
  9134. this filter to work correctly. Also it assumes that both inputs
  9135. have the same number of frames, which are compared one by one.
  9136. The filter stores the calculated SSIM of each frame.
  9137. The description of the accepted parameters follows.
  9138. @table @option
  9139. @item stats_file, f
  9140. If specified the filter will use the named file to save the SSIM of
  9141. each individual frame. When filename equals "-" the data is sent to
  9142. standard output.
  9143. @end table
  9144. The file printed if @var{stats_file} is selected, contains a sequence of
  9145. key/value pairs of the form @var{key}:@var{value} for each compared
  9146. couple of frames.
  9147. A description of each shown parameter follows:
  9148. @table @option
  9149. @item n
  9150. sequential number of the input frame, starting from 1
  9151. @item Y, U, V, R, G, B
  9152. SSIM of the compared frames for the component specified by the suffix.
  9153. @item All
  9154. SSIM of the compared frames for the whole frame.
  9155. @item dB
  9156. Same as above but in dB representation.
  9157. @end table
  9158. For example:
  9159. @example
  9160. movie=ref_movie.mpg, setpts=PTS-STARTPTS [main];
  9161. [main][ref] ssim="stats_file=stats.log" [out]
  9162. @end example
  9163. On this example the input file being processed is compared with the
  9164. reference file @file{ref_movie.mpg}. The SSIM of each individual frame
  9165. is stored in @file{stats.log}.
  9166. Another example with both psnr and ssim at same time:
  9167. @example
  9168. ffmpeg -i main.mpg -i ref.mpg -lavfi "ssim;[0:v][1:v]psnr" -f null -
  9169. @end example
  9170. @section stereo3d
  9171. Convert between different stereoscopic image formats.
  9172. The filters accept the following options:
  9173. @table @option
  9174. @item in
  9175. Set stereoscopic image format of input.
  9176. Available values for input image formats are:
  9177. @table @samp
  9178. @item sbsl
  9179. side by side parallel (left eye left, right eye right)
  9180. @item sbsr
  9181. side by side crosseye (right eye left, left eye right)
  9182. @item sbs2l
  9183. side by side parallel with half width resolution
  9184. (left eye left, right eye right)
  9185. @item sbs2r
  9186. side by side crosseye with half width resolution
  9187. (right eye left, left eye right)
  9188. @item abl
  9189. above-below (left eye above, right eye below)
  9190. @item abr
  9191. above-below (right eye above, left eye below)
  9192. @item ab2l
  9193. above-below with half height resolution
  9194. (left eye above, right eye below)
  9195. @item ab2r
  9196. above-below with half height resolution
  9197. (right eye above, left eye below)
  9198. @item al
  9199. alternating frames (left eye first, right eye second)
  9200. @item ar
  9201. alternating frames (right eye first, left eye second)
  9202. @item irl
  9203. interleaved rows (left eye has top row, right eye starts on next row)
  9204. @item irr
  9205. interleaved rows (right eye has top row, left eye starts on next row)
  9206. @item icl
  9207. interleaved columns, left eye first
  9208. @item icr
  9209. interleaved columns, right eye first
  9210. Default value is @samp{sbsl}.
  9211. @end table
  9212. @item out
  9213. Set stereoscopic image format of output.
  9214. @table @samp
  9215. @item sbsl
  9216. side by side parallel (left eye left, right eye right)
  9217. @item sbsr
  9218. side by side crosseye (right eye left, left eye right)
  9219. @item sbs2l
  9220. side by side parallel with half width resolution
  9221. (left eye left, right eye right)
  9222. @item sbs2r
  9223. side by side crosseye with half width resolution
  9224. (right eye left, left eye right)
  9225. @item abl
  9226. above-below (left eye above, right eye below)
  9227. @item abr
  9228. above-below (right eye above, left eye below)
  9229. @item ab2l
  9230. above-below with half height resolution
  9231. (left eye above, right eye below)
  9232. @item ab2r
  9233. above-below with half height resolution
  9234. (right eye above, left eye below)
  9235. @item al
  9236. alternating frames (left eye first, right eye second)
  9237. @item ar
  9238. alternating frames (right eye first, left eye second)
  9239. @item irl
  9240. interleaved rows (left eye has top row, right eye starts on next row)
  9241. @item irr
  9242. interleaved rows (right eye has top row, left eye starts on next row)
  9243. @item arbg
  9244. anaglyph red/blue gray
  9245. (red filter on left eye, blue filter on right eye)
  9246. @item argg
  9247. anaglyph red/green gray
  9248. (red filter on left eye, green filter on right eye)
  9249. @item arcg
  9250. anaglyph red/cyan gray
  9251. (red filter on left eye, cyan filter on right eye)
  9252. @item arch
  9253. anaglyph red/cyan half colored
  9254. (red filter on left eye, cyan filter on right eye)
  9255. @item arcc
  9256. anaglyph red/cyan color
  9257. (red filter on left eye, cyan filter on right eye)
  9258. @item arcd
  9259. anaglyph red/cyan color optimized with the least squares projection of dubois
  9260. (red filter on left eye, cyan filter on right eye)
  9261. @item agmg
  9262. anaglyph green/magenta gray
  9263. (green filter on left eye, magenta filter on right eye)
  9264. @item agmh
  9265. anaglyph green/magenta half colored
  9266. (green filter on left eye, magenta filter on right eye)
  9267. @item agmc
  9268. anaglyph green/magenta colored
  9269. (green filter on left eye, magenta filter on right eye)
  9270. @item agmd
  9271. anaglyph green/magenta color optimized with the least squares projection of dubois
  9272. (green filter on left eye, magenta filter on right eye)
  9273. @item aybg
  9274. anaglyph yellow/blue gray
  9275. (yellow filter on left eye, blue filter on right eye)
  9276. @item aybh
  9277. anaglyph yellow/blue half colored
  9278. (yellow filter on left eye, blue filter on right eye)
  9279. @item aybc
  9280. anaglyph yellow/blue colored
  9281. (yellow filter on left eye, blue filter on right eye)
  9282. @item aybd
  9283. anaglyph yellow/blue color optimized with the least squares projection of dubois
  9284. (yellow filter on left eye, blue filter on right eye)
  9285. @item ml
  9286. mono output (left eye only)
  9287. @item mr
  9288. mono output (right eye only)
  9289. @item chl
  9290. checkerboard, left eye first
  9291. @item chr
  9292. checkerboard, right eye first
  9293. @item icl
  9294. interleaved columns, left eye first
  9295. @item icr
  9296. interleaved columns, right eye first
  9297. @end table
  9298. Default value is @samp{arcd}.
  9299. @end table
  9300. @subsection Examples
  9301. @itemize
  9302. @item
  9303. Convert input video from side by side parallel to anaglyph yellow/blue dubois:
  9304. @example
  9305. stereo3d=sbsl:aybd
  9306. @end example
  9307. @item
  9308. Convert input video from above below (left eye above, right eye below) to side by side crosseye.
  9309. @example
  9310. stereo3d=abl:sbsr
  9311. @end example
  9312. @end itemize
  9313. @section streamselect, astreamselect
  9314. Select video or audio streams.
  9315. The filter accepts the following options:
  9316. @table @option
  9317. @item inputs
  9318. Set number of inputs. Default is 2.
  9319. @item map
  9320. Set input indexes to remap to outputs.
  9321. @end table
  9322. @subsection Commands
  9323. The @code{streamselect} and @code{astreamselect} filter supports the following
  9324. commands:
  9325. @table @option
  9326. @item map
  9327. Set input indexes to remap to outputs.
  9328. @end table
  9329. @subsection Examples
  9330. @itemize
  9331. @item
  9332. Select first 5 seconds 1st stream and rest of time 2nd stream:
  9333. @example
  9334. sendcmd='5.0 streamselect map 1',streamselect=inputs=2:map=0
  9335. @end example
  9336. @item
  9337. Same as above, but for audio:
  9338. @example
  9339. asendcmd='5.0 astreamselect map 1',astreamselect=inputs=2:map=0
  9340. @end example
  9341. @end itemize
  9342. @anchor{spp}
  9343. @section spp
  9344. Apply a simple postprocessing filter that compresses and decompresses the image
  9345. at several (or - in the case of @option{quality} level @code{6} - all) shifts
  9346. and average the results.
  9347. The filter accepts the following options:
  9348. @table @option
  9349. @item quality
  9350. Set quality. This option defines the number of levels for averaging. It accepts
  9351. an integer in the range 0-6. If set to @code{0}, the filter will have no
  9352. effect. A value of @code{6} means the higher quality. For each increment of
  9353. that value the speed drops by a factor of approximately 2. Default value is
  9354. @code{3}.
  9355. @item qp
  9356. Force a constant quantization parameter. If not set, the filter will use the QP
  9357. from the video stream (if available).
  9358. @item mode
  9359. Set thresholding mode. Available modes are:
  9360. @table @samp
  9361. @item hard
  9362. Set hard thresholding (default).
  9363. @item soft
  9364. Set soft thresholding (better de-ringing effect, but likely blurrier).
  9365. @end table
  9366. @item use_bframe_qp
  9367. Enable the use of the QP from the B-Frames if set to @code{1}. Using this
  9368. option may cause flicker since the B-Frames have often larger QP. Default is
  9369. @code{0} (not enabled).
  9370. @end table
  9371. @anchor{subtitles}
  9372. @section subtitles
  9373. Draw subtitles on top of input video using the libass library.
  9374. To enable compilation of this filter you need to configure FFmpeg with
  9375. @code{--enable-libass}. This filter also requires a build with libavcodec and
  9376. libavformat to convert the passed subtitles file to ASS (Advanced Substation
  9377. Alpha) subtitles format.
  9378. The filter accepts the following options:
  9379. @table @option
  9380. @item filename, f
  9381. Set the filename of the subtitle file to read. It must be specified.
  9382. @item original_size
  9383. Specify the size of the original video, the video for which the ASS file
  9384. was composed. For the syntax of this option, check the
  9385. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  9386. Due to a misdesign in ASS aspect ratio arithmetic, this is necessary to
  9387. correctly scale the fonts if the aspect ratio has been changed.
  9388. @item fontsdir
  9389. Set a directory path containing fonts that can be used by the filter.
  9390. These fonts will be used in addition to whatever the font provider uses.
  9391. @item charenc
  9392. Set subtitles input character encoding. @code{subtitles} filter only. Only
  9393. useful if not UTF-8.
  9394. @item stream_index, si
  9395. Set subtitles stream index. @code{subtitles} filter only.
  9396. @item force_style
  9397. Override default style or script info parameters of the subtitles. It accepts a
  9398. string containing ASS style format @code{KEY=VALUE} couples separated by ",".
  9399. @end table
  9400. If the first key is not specified, it is assumed that the first value
  9401. specifies the @option{filename}.
  9402. For example, to render the file @file{sub.srt} on top of the input
  9403. video, use the command:
  9404. @example
  9405. subtitles=sub.srt
  9406. @end example
  9407. which is equivalent to:
  9408. @example
  9409. subtitles=filename=sub.srt
  9410. @end example
  9411. To render the default subtitles stream from file @file{video.mkv}, use:
  9412. @example
  9413. subtitles=video.mkv
  9414. @end example
  9415. To render the second subtitles stream from that file, use:
  9416. @example
  9417. subtitles=video.mkv:si=1
  9418. @end example
  9419. To make the subtitles stream from @file{sub.srt} appear in transparent green
  9420. @code{DejaVu Serif}, use:
  9421. @example
  9422. subtitles=sub.srt:force_style='FontName=DejaVu Serif,PrimaryColour=&HAA00FF00'
  9423. @end example
  9424. @section super2xsai
  9425. Scale the input by 2x and smooth using the Super2xSaI (Scale and
  9426. Interpolate) pixel art scaling algorithm.
  9427. Useful for enlarging pixel art images without reducing sharpness.
  9428. @section swaprect
  9429. Swap two rectangular objects in video.
  9430. This filter accepts the following options:
  9431. @table @option
  9432. @item w
  9433. Set object width.
  9434. @item h
  9435. Set object height.
  9436. @item x1
  9437. Set 1st rect x coordinate.
  9438. @item y1
  9439. Set 1st rect y coordinate.
  9440. @item x2
  9441. Set 2nd rect x coordinate.
  9442. @item y2
  9443. Set 2nd rect y coordinate.
  9444. All expressions are evaluated once for each frame.
  9445. @end table
  9446. The all options are expressions containing the following constants:
  9447. @table @option
  9448. @item w
  9449. @item h
  9450. The input width and height.
  9451. @item a
  9452. same as @var{w} / @var{h}
  9453. @item sar
  9454. input sample aspect ratio
  9455. @item dar
  9456. input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}
  9457. @item n
  9458. The number of the input frame, starting from 0.
  9459. @item t
  9460. The timestamp expressed in seconds. It's NAN if the input timestamp is unknown.
  9461. @item pos
  9462. the position in the file of the input frame, NAN if unknown
  9463. @end table
  9464. @section swapuv
  9465. Swap U & V plane.
  9466. @section telecine
  9467. Apply telecine process to the video.
  9468. This filter accepts the following options:
  9469. @table @option
  9470. @item first_field
  9471. @table @samp
  9472. @item top, t
  9473. top field first
  9474. @item bottom, b
  9475. bottom field first
  9476. The default value is @code{top}.
  9477. @end table
  9478. @item pattern
  9479. A string of numbers representing the pulldown pattern you wish to apply.
  9480. The default value is @code{23}.
  9481. @end table
  9482. @example
  9483. Some typical patterns:
  9484. NTSC output (30i):
  9485. 27.5p: 32222
  9486. 24p: 23 (classic)
  9487. 24p: 2332 (preferred)
  9488. 20p: 33
  9489. 18p: 334
  9490. 16p: 3444
  9491. PAL output (25i):
  9492. 27.5p: 12222
  9493. 24p: 222222222223 ("Euro pulldown")
  9494. 16.67p: 33
  9495. 16p: 33333334
  9496. @end example
  9497. @section thumbnail
  9498. Select the most representative frame in a given sequence of consecutive frames.
  9499. The filter accepts the following options:
  9500. @table @option
  9501. @item n
  9502. Set the frames batch size to analyze; in a set of @var{n} frames, the filter
  9503. will pick one of them, and then handle the next batch of @var{n} frames until
  9504. the end. Default is @code{100}.
  9505. @end table
  9506. Since the filter keeps track of the whole frames sequence, a bigger @var{n}
  9507. value will result in a higher memory usage, so a high value is not recommended.
  9508. @subsection Examples
  9509. @itemize
  9510. @item
  9511. Extract one picture each 50 frames:
  9512. @example
  9513. thumbnail=50
  9514. @end example
  9515. @item
  9516. Complete example of a thumbnail creation with @command{ffmpeg}:
  9517. @example
  9518. ffmpeg -i in.avi -vf thumbnail,scale=300:200 -frames:v 1 out.png
  9519. @end example
  9520. @end itemize
  9521. @section tile
  9522. Tile several successive frames together.
  9523. The filter accepts the following options:
  9524. @table @option
  9525. @item layout
  9526. Set the grid size (i.e. the number of lines and columns). For the syntax of
  9527. this option, check the
  9528. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  9529. @item nb_frames
  9530. Set the maximum number of frames to render in the given area. It must be less
  9531. than or equal to @var{w}x@var{h}. The default value is @code{0}, meaning all
  9532. the area will be used.
  9533. @item margin
  9534. Set the outer border margin in pixels.
  9535. @item padding
  9536. Set the inner border thickness (i.e. the number of pixels between frames). For
  9537. more advanced padding options (such as having different values for the edges),
  9538. refer to the pad video filter.
  9539. @item color
  9540. Specify the color of the unused area. For the syntax of this option, check the
  9541. "Color" section in the ffmpeg-utils manual. The default value of @var{color}
  9542. is "black".
  9543. @end table
  9544. @subsection Examples
  9545. @itemize
  9546. @item
  9547. Produce 8x8 PNG tiles of all keyframes (@option{-skip_frame nokey}) in a movie:
  9548. @example
  9549. ffmpeg -skip_frame nokey -i file.avi -vf 'scale=128:72,tile=8x8' -an -vsync 0 keyframes%03d.png
  9550. @end example
  9551. The @option{-vsync 0} is necessary to prevent @command{ffmpeg} from
  9552. duplicating each output frame to accommodate the originally detected frame
  9553. rate.
  9554. @item
  9555. Display @code{5} pictures in an area of @code{3x2} frames,
  9556. with @code{7} pixels between them, and @code{2} pixels of initial margin, using
  9557. mixed flat and named options:
  9558. @example
  9559. tile=3x2:nb_frames=5:padding=7:margin=2
  9560. @end example
  9561. @end itemize
  9562. @section tinterlace
  9563. Perform various types of temporal field interlacing.
  9564. Frames are counted starting from 1, so the first input frame is
  9565. considered odd.
  9566. The filter accepts the following options:
  9567. @table @option
  9568. @item mode
  9569. Specify the mode of the interlacing. This option can also be specified
  9570. as a value alone. See below for a list of values for this option.
  9571. Available values are:
  9572. @table @samp
  9573. @item merge, 0
  9574. Move odd frames into the upper field, even into the lower field,
  9575. generating a double height frame at half frame rate.
  9576. @example
  9577. ------> time
  9578. Input:
  9579. Frame 1 Frame 2 Frame 3 Frame 4
  9580. 11111 22222 33333 44444
  9581. 11111 22222 33333 44444
  9582. 11111 22222 33333 44444
  9583. 11111 22222 33333 44444
  9584. Output:
  9585. 11111 33333
  9586. 22222 44444
  9587. 11111 33333
  9588. 22222 44444
  9589. 11111 33333
  9590. 22222 44444
  9591. 11111 33333
  9592. 22222 44444
  9593. @end example
  9594. @item drop_odd, 1
  9595. Only output even frames, odd frames are dropped, generating a frame with
  9596. unchanged height at half frame rate.
  9597. @example
  9598. ------> time
  9599. Input:
  9600. Frame 1 Frame 2 Frame 3 Frame 4
  9601. 11111 22222 33333 44444
  9602. 11111 22222 33333 44444
  9603. 11111 22222 33333 44444
  9604. 11111 22222 33333 44444
  9605. Output:
  9606. 22222 44444
  9607. 22222 44444
  9608. 22222 44444
  9609. 22222 44444
  9610. @end example
  9611. @item drop_even, 2
  9612. Only output odd frames, even frames are dropped, generating a frame with
  9613. unchanged height at half frame rate.
  9614. @example
  9615. ------> time
  9616. Input:
  9617. Frame 1 Frame 2 Frame 3 Frame 4
  9618. 11111 22222 33333 44444
  9619. 11111 22222 33333 44444
  9620. 11111 22222 33333 44444
  9621. 11111 22222 33333 44444
  9622. Output:
  9623. 11111 33333
  9624. 11111 33333
  9625. 11111 33333
  9626. 11111 33333
  9627. @end example
  9628. @item pad, 3
  9629. Expand each frame to full height, but pad alternate lines with black,
  9630. generating a frame with double height at the same input frame rate.
  9631. @example
  9632. ------> time
  9633. Input:
  9634. Frame 1 Frame 2 Frame 3 Frame 4
  9635. 11111 22222 33333 44444
  9636. 11111 22222 33333 44444
  9637. 11111 22222 33333 44444
  9638. 11111 22222 33333 44444
  9639. Output:
  9640. 11111 ..... 33333 .....
  9641. ..... 22222 ..... 44444
  9642. 11111 ..... 33333 .....
  9643. ..... 22222 ..... 44444
  9644. 11111 ..... 33333 .....
  9645. ..... 22222 ..... 44444
  9646. 11111 ..... 33333 .....
  9647. ..... 22222 ..... 44444
  9648. @end example
  9649. @item interleave_top, 4
  9650. Interleave the upper field from odd frames with the lower field from
  9651. even frames, generating a frame with unchanged height at half frame rate.
  9652. @example
  9653. ------> time
  9654. Input:
  9655. Frame 1 Frame 2 Frame 3 Frame 4
  9656. 11111<- 22222 33333<- 44444
  9657. 11111 22222<- 33333 44444<-
  9658. 11111<- 22222 33333<- 44444
  9659. 11111 22222<- 33333 44444<-
  9660. Output:
  9661. 11111 33333
  9662. 22222 44444
  9663. 11111 33333
  9664. 22222 44444
  9665. @end example
  9666. @item interleave_bottom, 5
  9667. Interleave the lower field from odd frames with the upper field from
  9668. even frames, generating a frame with unchanged height at half frame rate.
  9669. @example
  9670. ------> time
  9671. Input:
  9672. Frame 1 Frame 2 Frame 3 Frame 4
  9673. 11111 22222<- 33333 44444<-
  9674. 11111<- 22222 33333<- 44444
  9675. 11111 22222<- 33333 44444<-
  9676. 11111<- 22222 33333<- 44444
  9677. Output:
  9678. 22222 44444
  9679. 11111 33333
  9680. 22222 44444
  9681. 11111 33333
  9682. @end example
  9683. @item interlacex2, 6
  9684. Double frame rate with unchanged height. Frames are inserted each
  9685. containing the second temporal field from the previous input frame and
  9686. the first temporal field from the next input frame. This mode relies on
  9687. the top_field_first flag. Useful for interlaced video displays with no
  9688. field synchronisation.
  9689. @example
  9690. ------> time
  9691. Input:
  9692. Frame 1 Frame 2 Frame 3 Frame 4
  9693. 11111 22222 33333 44444
  9694. 11111 22222 33333 44444
  9695. 11111 22222 33333 44444
  9696. 11111 22222 33333 44444
  9697. Output:
  9698. 11111 22222 22222 33333 33333 44444 44444
  9699. 11111 11111 22222 22222 33333 33333 44444
  9700. 11111 22222 22222 33333 33333 44444 44444
  9701. 11111 11111 22222 22222 33333 33333 44444
  9702. @end example
  9703. @item mergex2, 7
  9704. Move odd frames into the upper field, even into the lower field,
  9705. generating a double height frame at same frame rate.
  9706. @example
  9707. ------> time
  9708. Input:
  9709. Frame 1 Frame 2 Frame 3 Frame 4
  9710. 11111 22222 33333 44444
  9711. 11111 22222 33333 44444
  9712. 11111 22222 33333 44444
  9713. 11111 22222 33333 44444
  9714. Output:
  9715. 11111 33333 33333 55555
  9716. 22222 22222 44444 44444
  9717. 11111 33333 33333 55555
  9718. 22222 22222 44444 44444
  9719. 11111 33333 33333 55555
  9720. 22222 22222 44444 44444
  9721. 11111 33333 33333 55555
  9722. 22222 22222 44444 44444
  9723. @end example
  9724. @end table
  9725. Numeric values are deprecated but are accepted for backward
  9726. compatibility reasons.
  9727. Default mode is @code{merge}.
  9728. @item flags
  9729. Specify flags influencing the filter process.
  9730. Available value for @var{flags} is:
  9731. @table @option
  9732. @item low_pass_filter, vlfp
  9733. Enable vertical low-pass filtering in the filter.
  9734. Vertical low-pass filtering is required when creating an interlaced
  9735. destination from a progressive source which contains high-frequency
  9736. vertical detail. Filtering will reduce interlace 'twitter' and Moire
  9737. patterning.
  9738. Vertical low-pass filtering can only be enabled for @option{mode}
  9739. @var{interleave_top} and @var{interleave_bottom}.
  9740. @end table
  9741. @end table
  9742. @section transpose
  9743. Transpose rows with columns in the input video and optionally flip it.
  9744. It accepts the following parameters:
  9745. @table @option
  9746. @item dir
  9747. Specify the transposition direction.
  9748. Can assume the following values:
  9749. @table @samp
  9750. @item 0, 4, cclock_flip
  9751. Rotate by 90 degrees counterclockwise and vertically flip (default), that is:
  9752. @example
  9753. L.R L.l
  9754. . . -> . .
  9755. l.r R.r
  9756. @end example
  9757. @item 1, 5, clock
  9758. Rotate by 90 degrees clockwise, that is:
  9759. @example
  9760. L.R l.L
  9761. . . -> . .
  9762. l.r r.R
  9763. @end example
  9764. @item 2, 6, cclock
  9765. Rotate by 90 degrees counterclockwise, that is:
  9766. @example
  9767. L.R R.r
  9768. . . -> . .
  9769. l.r L.l
  9770. @end example
  9771. @item 3, 7, clock_flip
  9772. Rotate by 90 degrees clockwise and vertically flip, that is:
  9773. @example
  9774. L.R r.R
  9775. . . -> . .
  9776. l.r l.L
  9777. @end example
  9778. @end table
  9779. For values between 4-7, the transposition is only done if the input
  9780. video geometry is portrait and not landscape. These values are
  9781. deprecated, the @code{passthrough} option should be used instead.
  9782. Numerical values are deprecated, and should be dropped in favor of
  9783. symbolic constants.
  9784. @item passthrough
  9785. Do not apply the transposition if the input geometry matches the one
  9786. specified by the specified value. It accepts the following values:
  9787. @table @samp
  9788. @item none
  9789. Always apply transposition.
  9790. @item portrait
  9791. Preserve portrait geometry (when @var{height} >= @var{width}).
  9792. @item landscape
  9793. Preserve landscape geometry (when @var{width} >= @var{height}).
  9794. @end table
  9795. Default value is @code{none}.
  9796. @end table
  9797. For example to rotate by 90 degrees clockwise and preserve portrait
  9798. layout:
  9799. @example
  9800. transpose=dir=1:passthrough=portrait
  9801. @end example
  9802. The command above can also be specified as:
  9803. @example
  9804. transpose=1:portrait
  9805. @end example
  9806. @section trim
  9807. Trim the input so that the output contains one continuous subpart of the input.
  9808. It accepts the following parameters:
  9809. @table @option
  9810. @item start
  9811. Specify the time of the start of the kept section, i.e. the frame with the
  9812. timestamp @var{start} will be the first frame in the output.
  9813. @item end
  9814. Specify the time of the first frame that will be dropped, i.e. the frame
  9815. immediately preceding the one with the timestamp @var{end} will be the last
  9816. frame in the output.
  9817. @item start_pts
  9818. This is the same as @var{start}, except this option sets the start timestamp
  9819. in timebase units instead of seconds.
  9820. @item end_pts
  9821. This is the same as @var{end}, except this option sets the end timestamp
  9822. in timebase units instead of seconds.
  9823. @item duration
  9824. The maximum duration of the output in seconds.
  9825. @item start_frame
  9826. The number of the first frame that should be passed to the output.
  9827. @item end_frame
  9828. The number of the first frame that should be dropped.
  9829. @end table
  9830. @option{start}, @option{end}, and @option{duration} are expressed as time
  9831. duration specifications; see
  9832. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  9833. for the accepted syntax.
  9834. Note that the first two sets of the start/end options and the @option{duration}
  9835. option look at the frame timestamp, while the _frame variants simply count the
  9836. frames that pass through the filter. Also note that this filter does not modify
  9837. the timestamps. If you wish for the output timestamps to start at zero, insert a
  9838. setpts filter after the trim filter.
  9839. If multiple start or end options are set, this filter tries to be greedy and
  9840. keep all the frames that match at least one of the specified constraints. To keep
  9841. only the part that matches all the constraints at once, chain multiple trim
  9842. filters.
  9843. The defaults are such that all the input is kept. So it is possible to set e.g.
  9844. just the end values to keep everything before the specified time.
  9845. Examples:
  9846. @itemize
  9847. @item
  9848. Drop everything except the second minute of input:
  9849. @example
  9850. ffmpeg -i INPUT -vf trim=60:120
  9851. @end example
  9852. @item
  9853. Keep only the first second:
  9854. @example
  9855. ffmpeg -i INPUT -vf trim=duration=1
  9856. @end example
  9857. @end itemize
  9858. @anchor{unsharp}
  9859. @section unsharp
  9860. Sharpen or blur the input video.
  9861. It accepts the following parameters:
  9862. @table @option
  9863. @item luma_msize_x, lx
  9864. Set the luma matrix horizontal size. It must be an odd integer between
  9865. 3 and 63. The default value is 5.
  9866. @item luma_msize_y, ly
  9867. Set the luma matrix vertical size. It must be an odd integer between 3
  9868. and 63. The default value is 5.
  9869. @item luma_amount, la
  9870. Set the luma effect strength. It must be a floating point number, reasonable
  9871. values lay between -1.5 and 1.5.
  9872. Negative values will blur the input video, while positive values will
  9873. sharpen it, a value of zero will disable the effect.
  9874. Default value is 1.0.
  9875. @item chroma_msize_x, cx
  9876. Set the chroma matrix horizontal size. It must be an odd integer
  9877. between 3 and 63. The default value is 5.
  9878. @item chroma_msize_y, cy
  9879. Set the chroma matrix vertical size. It must be an odd integer
  9880. between 3 and 63. The default value is 5.
  9881. @item chroma_amount, ca
  9882. Set the chroma effect strength. It must be a floating point number, reasonable
  9883. values lay between -1.5 and 1.5.
  9884. Negative values will blur the input video, while positive values will
  9885. sharpen it, a value of zero will disable the effect.
  9886. Default value is 0.0.
  9887. @item opencl
  9888. If set to 1, specify using OpenCL capabilities, only available if
  9889. FFmpeg was configured with @code{--enable-opencl}. Default value is 0.
  9890. @end table
  9891. All parameters are optional and default to the equivalent of the
  9892. string '5:5:1.0:5:5:0.0'.
  9893. @subsection Examples
  9894. @itemize
  9895. @item
  9896. Apply strong luma sharpen effect:
  9897. @example
  9898. unsharp=luma_msize_x=7:luma_msize_y=7:luma_amount=2.5
  9899. @end example
  9900. @item
  9901. Apply a strong blur of both luma and chroma parameters:
  9902. @example
  9903. unsharp=7:7:-2:7:7:-2
  9904. @end example
  9905. @end itemize
  9906. @section uspp
  9907. Apply ultra slow/simple postprocessing filter that compresses and decompresses
  9908. the image at several (or - in the case of @option{quality} level @code{8} - all)
  9909. shifts and average the results.
  9910. The way this differs from the behavior of spp is that uspp actually encodes &
  9911. decodes each case with libavcodec Snow, whereas spp uses a simplified intra only 8x8
  9912. DCT similar to MJPEG.
  9913. The filter accepts the following options:
  9914. @table @option
  9915. @item quality
  9916. Set quality. This option defines the number of levels for averaging. It accepts
  9917. an integer in the range 0-8. If set to @code{0}, the filter will have no
  9918. effect. A value of @code{8} means the higher quality. For each increment of
  9919. that value the speed drops by a factor of approximately 2. Default value is
  9920. @code{3}.
  9921. @item qp
  9922. Force a constant quantization parameter. If not set, the filter will use the QP
  9923. from the video stream (if available).
  9924. @end table
  9925. @section vectorscope
  9926. Display 2 color component values in the two dimensional graph (which is called
  9927. a vectorscope).
  9928. This filter accepts the following options:
  9929. @table @option
  9930. @item mode, m
  9931. Set vectorscope mode.
  9932. It accepts the following values:
  9933. @table @samp
  9934. @item gray
  9935. Gray values are displayed on graph, higher brightness means more pixels have
  9936. same component color value on location in graph. This is the default mode.
  9937. @item color
  9938. Gray values are displayed on graph. Surrounding pixels values which are not
  9939. present in video frame are drawn in gradient of 2 color components which are
  9940. set by option @code{x} and @code{y}. The 3rd color component is static.
  9941. @item color2
  9942. Actual color components values present in video frame are displayed on graph.
  9943. @item color3
  9944. Similar as color2 but higher frequency of same values @code{x} and @code{y}
  9945. on graph increases value of another color component, which is luminance by
  9946. default values of @code{x} and @code{y}.
  9947. @item color4
  9948. Actual colors present in video frame are displayed on graph. If two different
  9949. colors map to same position on graph then color with higher value of component
  9950. not present in graph is picked.
  9951. @item color5
  9952. Gray values are displayed on graph. Similar to @code{color} but with 3rd color
  9953. component picked from radial gradient.
  9954. @end table
  9955. @item x
  9956. Set which color component will be represented on X-axis. Default is @code{1}.
  9957. @item y
  9958. Set which color component will be represented on Y-axis. Default is @code{2}.
  9959. @item intensity, i
  9960. Set intensity, used by modes: gray, color, color3 and color5 for increasing brightness
  9961. of color component which represents frequency of (X, Y) location in graph.
  9962. @item envelope, e
  9963. @table @samp
  9964. @item none
  9965. No envelope, this is default.
  9966. @item instant
  9967. Instant envelope, even darkest single pixel will be clearly highlighted.
  9968. @item peak
  9969. Hold maximum and minimum values presented in graph over time. This way you
  9970. can still spot out of range values without constantly looking at vectorscope.
  9971. @item peak+instant
  9972. Peak and instant envelope combined together.
  9973. @end table
  9974. @item graticule, g
  9975. Set what kind of graticule to draw.
  9976. @table @samp
  9977. @item none
  9978. @item green
  9979. @item color
  9980. @end table
  9981. @item opacity, o
  9982. Set graticule opacity.
  9983. @item flags, f
  9984. Set graticule flags.
  9985. @table @samp
  9986. @item white
  9987. Draw graticule for white point.
  9988. @item black
  9989. Draw graticule for black point.
  9990. @item name
  9991. Draw color points short names.
  9992. @end table
  9993. @item bgopacity, b
  9994. Set background opacity.
  9995. @item lthreshold, l
  9996. Set low threshold for color component not represented on X or Y axis.
  9997. Values lower than this value will be ignored. Default is 0.
  9998. Note this value is multiplied with actual max possible value one pixel component
  9999. can have. So for 8-bit input and low threshold value of 0.1 actual threshold
  10000. is 0.1 * 255 = 25.
  10001. @item hthreshold, h
  10002. Set high threshold for color component not represented on X or Y axis.
  10003. Values higher than this value will be ignored. Default is 1.
  10004. Note this value is multiplied with actual max possible value one pixel component
  10005. can have. So for 8-bit input and high threshold value of 0.9 actual threshold
  10006. is 0.9 * 255 = 230.
  10007. @item colorspace, c
  10008. Set what kind of colorspace to use when drawing graticule.
  10009. @table @samp
  10010. @item auto
  10011. @item 601
  10012. @item 709
  10013. @end table
  10014. Default is auto.
  10015. @end table
  10016. @anchor{vidstabdetect}
  10017. @section vidstabdetect
  10018. Analyze video stabilization/deshaking. Perform pass 1 of 2, see
  10019. @ref{vidstabtransform} for pass 2.
  10020. This filter generates a file with relative translation and rotation
  10021. transform information about subsequent frames, which is then used by
  10022. the @ref{vidstabtransform} filter.
  10023. To enable compilation of this filter you need to configure FFmpeg with
  10024. @code{--enable-libvidstab}.
  10025. This filter accepts the following options:
  10026. @table @option
  10027. @item result
  10028. Set the path to the file used to write the transforms information.
  10029. Default value is @file{transforms.trf}.
  10030. @item shakiness
  10031. Set how shaky the video is and how quick the camera is. It accepts an
  10032. integer in the range 1-10, a value of 1 means little shakiness, a
  10033. value of 10 means strong shakiness. Default value is 5.
  10034. @item accuracy
  10035. Set the accuracy of the detection process. It must be a value in the
  10036. range 1-15. A value of 1 means low accuracy, a value of 15 means high
  10037. accuracy. Default value is 15.
  10038. @item stepsize
  10039. Set stepsize of the search process. The region around minimum is
  10040. scanned with 1 pixel resolution. Default value is 6.
  10041. @item mincontrast
  10042. Set minimum contrast. Below this value a local measurement field is
  10043. discarded. Must be a floating point value in the range 0-1. Default
  10044. value is 0.3.
  10045. @item tripod
  10046. Set reference frame number for tripod mode.
  10047. If enabled, the motion of the frames is compared to a reference frame
  10048. in the filtered stream, identified by the specified number. The idea
  10049. is to compensate all movements in a more-or-less static scene and keep
  10050. the camera view absolutely still.
  10051. If set to 0, it is disabled. The frames are counted starting from 1.
  10052. @item show
  10053. Show fields and transforms in the resulting frames. It accepts an
  10054. integer in the range 0-2. Default value is 0, which disables any
  10055. visualization.
  10056. @end table
  10057. @subsection Examples
  10058. @itemize
  10059. @item
  10060. Use default values:
  10061. @example
  10062. vidstabdetect
  10063. @end example
  10064. @item
  10065. Analyze strongly shaky movie and put the results in file
  10066. @file{mytransforms.trf}:
  10067. @example
  10068. vidstabdetect=shakiness=10:accuracy=15:result="mytransforms.trf"
  10069. @end example
  10070. @item
  10071. Visualize the result of internal transformations in the resulting
  10072. video:
  10073. @example
  10074. vidstabdetect=show=1
  10075. @end example
  10076. @item
  10077. Analyze a video with medium shakiness using @command{ffmpeg}:
  10078. @example
  10079. ffmpeg -i input -vf vidstabdetect=shakiness=5:show=1 dummy.avi
  10080. @end example
  10081. @end itemize
  10082. @anchor{vidstabtransform}
  10083. @section vidstabtransform
  10084. Video stabilization/deshaking: pass 2 of 2,
  10085. see @ref{vidstabdetect} for pass 1.
  10086. Read a file with transform information for each frame and
  10087. apply/compensate them. Together with the @ref{vidstabdetect}
  10088. filter this can be used to deshake videos. See also
  10089. @url{http://public.hronopik.de/vid.stab}. It is important to also use
  10090. the @ref{unsharp} filter, see below.
  10091. To enable compilation of this filter you need to configure FFmpeg with
  10092. @code{--enable-libvidstab}.
  10093. @subsection Options
  10094. @table @option
  10095. @item input
  10096. Set path to the file used to read the transforms. Default value is
  10097. @file{transforms.trf}.
  10098. @item smoothing
  10099. Set the number of frames (value*2 + 1) used for lowpass filtering the
  10100. camera movements. Default value is 10.
  10101. For example a number of 10 means that 21 frames are used (10 in the
  10102. past and 10 in the future) to smoothen the motion in the video. A
  10103. larger value leads to a smoother video, but limits the acceleration of
  10104. the camera (pan/tilt movements). 0 is a special case where a static
  10105. camera is simulated.
  10106. @item optalgo
  10107. Set the camera path optimization algorithm.
  10108. Accepted values are:
  10109. @table @samp
  10110. @item gauss
  10111. gaussian kernel low-pass filter on camera motion (default)
  10112. @item avg
  10113. averaging on transformations
  10114. @end table
  10115. @item maxshift
  10116. Set maximal number of pixels to translate frames. Default value is -1,
  10117. meaning no limit.
  10118. @item maxangle
  10119. Set maximal angle in radians (degree*PI/180) to rotate frames. Default
  10120. value is -1, meaning no limit.
  10121. @item crop
  10122. Specify how to deal with borders that may be visible due to movement
  10123. compensation.
  10124. Available values are:
  10125. @table @samp
  10126. @item keep
  10127. keep image information from previous frame (default)
  10128. @item black
  10129. fill the border black
  10130. @end table
  10131. @item invert
  10132. Invert transforms if set to 1. Default value is 0.
  10133. @item relative
  10134. Consider transforms as relative to previous frame if set to 1,
  10135. absolute if set to 0. Default value is 0.
  10136. @item zoom
  10137. Set percentage to zoom. A positive value will result in a zoom-in
  10138. effect, a negative value in a zoom-out effect. Default value is 0 (no
  10139. zoom).
  10140. @item optzoom
  10141. Set optimal zooming to avoid borders.
  10142. Accepted values are:
  10143. @table @samp
  10144. @item 0
  10145. disabled
  10146. @item 1
  10147. optimal static zoom value is determined (only very strong movements
  10148. will lead to visible borders) (default)
  10149. @item 2
  10150. optimal adaptive zoom value is determined (no borders will be
  10151. visible), see @option{zoomspeed}
  10152. @end table
  10153. Note that the value given at zoom is added to the one calculated here.
  10154. @item zoomspeed
  10155. Set percent to zoom maximally each frame (enabled when
  10156. @option{optzoom} is set to 2). Range is from 0 to 5, default value is
  10157. 0.25.
  10158. @item interpol
  10159. Specify type of interpolation.
  10160. Available values are:
  10161. @table @samp
  10162. @item no
  10163. no interpolation
  10164. @item linear
  10165. linear only horizontal
  10166. @item bilinear
  10167. linear in both directions (default)
  10168. @item bicubic
  10169. cubic in both directions (slow)
  10170. @end table
  10171. @item tripod
  10172. Enable virtual tripod mode if set to 1, which is equivalent to
  10173. @code{relative=0:smoothing=0}. Default value is 0.
  10174. Use also @code{tripod} option of @ref{vidstabdetect}.
  10175. @item debug
  10176. Increase log verbosity if set to 1. Also the detected global motions
  10177. are written to the temporary file @file{global_motions.trf}. Default
  10178. value is 0.
  10179. @end table
  10180. @subsection Examples
  10181. @itemize
  10182. @item
  10183. Use @command{ffmpeg} for a typical stabilization with default values:
  10184. @example
  10185. ffmpeg -i inp.mpeg -vf vidstabtransform,unsharp=5:5:0.8:3:3:0.4 inp_stabilized.mpeg
  10186. @end example
  10187. Note the use of the @ref{unsharp} filter which is always recommended.
  10188. @item
  10189. Zoom in a bit more and load transform data from a given file:
  10190. @example
  10191. vidstabtransform=zoom=5:input="mytransforms.trf"
  10192. @end example
  10193. @item
  10194. Smoothen the video even more:
  10195. @example
  10196. vidstabtransform=smoothing=30
  10197. @end example
  10198. @end itemize
  10199. @section vflip
  10200. Flip the input video vertically.
  10201. For example, to vertically flip a video with @command{ffmpeg}:
  10202. @example
  10203. ffmpeg -i in.avi -vf "vflip" out.avi
  10204. @end example
  10205. @anchor{vignette}
  10206. @section vignette
  10207. Make or reverse a natural vignetting effect.
  10208. The filter accepts the following options:
  10209. @table @option
  10210. @item angle, a
  10211. Set lens angle expression as a number of radians.
  10212. The value is clipped in the @code{[0,PI/2]} range.
  10213. Default value: @code{"PI/5"}
  10214. @item x0
  10215. @item y0
  10216. Set center coordinates expressions. Respectively @code{"w/2"} and @code{"h/2"}
  10217. by default.
  10218. @item mode
  10219. Set forward/backward mode.
  10220. Available modes are:
  10221. @table @samp
  10222. @item forward
  10223. The larger the distance from the central point, the darker the image becomes.
  10224. @item backward
  10225. The larger the distance from the central point, the brighter the image becomes.
  10226. This can be used to reverse a vignette effect, though there is no automatic
  10227. detection to extract the lens @option{angle} and other settings (yet). It can
  10228. also be used to create a burning effect.
  10229. @end table
  10230. Default value is @samp{forward}.
  10231. @item eval
  10232. Set evaluation mode for the expressions (@option{angle}, @option{x0}, @option{y0}).
  10233. It accepts the following values:
  10234. @table @samp
  10235. @item init
  10236. Evaluate expressions only once during the filter initialization.
  10237. @item frame
  10238. Evaluate expressions for each incoming frame. This is way slower than the
  10239. @samp{init} mode since it requires all the scalers to be re-computed, but it
  10240. allows advanced dynamic expressions.
  10241. @end table
  10242. Default value is @samp{init}.
  10243. @item dither
  10244. Set dithering to reduce the circular banding effects. Default is @code{1}
  10245. (enabled).
  10246. @item aspect
  10247. Set vignette aspect. This setting allows one to adjust the shape of the vignette.
  10248. Setting this value to the SAR of the input will make a rectangular vignetting
  10249. following the dimensions of the video.
  10250. Default is @code{1/1}.
  10251. @end table
  10252. @subsection Expressions
  10253. The @option{alpha}, @option{x0} and @option{y0} expressions can contain the
  10254. following parameters.
  10255. @table @option
  10256. @item w
  10257. @item h
  10258. input width and height
  10259. @item n
  10260. the number of input frame, starting from 0
  10261. @item pts
  10262. the PTS (Presentation TimeStamp) time of the filtered video frame, expressed in
  10263. @var{TB} units, NAN if undefined
  10264. @item r
  10265. frame rate of the input video, NAN if the input frame rate is unknown
  10266. @item t
  10267. the PTS (Presentation TimeStamp) of the filtered video frame,
  10268. expressed in seconds, NAN if undefined
  10269. @item tb
  10270. time base of the input video
  10271. @end table
  10272. @subsection Examples
  10273. @itemize
  10274. @item
  10275. Apply simple strong vignetting effect:
  10276. @example
  10277. vignette=PI/4
  10278. @end example
  10279. @item
  10280. Make a flickering vignetting:
  10281. @example
  10282. vignette='PI/4+random(1)*PI/50':eval=frame
  10283. @end example
  10284. @end itemize
  10285. @section vstack
  10286. Stack input videos vertically.
  10287. All streams must be of same pixel format and of same width.
  10288. Note that this filter is faster than using @ref{overlay} and @ref{pad} filter
  10289. to create same output.
  10290. The filter accept the following option:
  10291. @table @option
  10292. @item inputs
  10293. Set number of input streams. Default is 2.
  10294. @item shortest
  10295. If set to 1, force the output to terminate when the shortest input
  10296. terminates. Default value is 0.
  10297. @end table
  10298. @section w3fdif
  10299. Deinterlace the input video ("w3fdif" stands for "Weston 3 Field
  10300. Deinterlacing Filter").
  10301. Based on the process described by Martin Weston for BBC R&D, and
  10302. implemented based on the de-interlace algorithm written by Jim
  10303. Easterbrook for BBC R&D, the Weston 3 field deinterlacing filter
  10304. uses filter coefficients calculated by BBC R&D.
  10305. There are two sets of filter coefficients, so called "simple":
  10306. and "complex". Which set of filter coefficients is used can
  10307. be set by passing an optional parameter:
  10308. @table @option
  10309. @item filter
  10310. Set the interlacing filter coefficients. Accepts one of the following values:
  10311. @table @samp
  10312. @item simple
  10313. Simple filter coefficient set.
  10314. @item complex
  10315. More-complex filter coefficient set.
  10316. @end table
  10317. Default value is @samp{complex}.
  10318. @item deint
  10319. Specify which frames to deinterlace. Accept one of the following values:
  10320. @table @samp
  10321. @item all
  10322. Deinterlace all frames,
  10323. @item interlaced
  10324. Only deinterlace frames marked as interlaced.
  10325. @end table
  10326. Default value is @samp{all}.
  10327. @end table
  10328. @section waveform
  10329. Video waveform monitor.
  10330. The waveform monitor plots color component intensity. By default luminance
  10331. only. Each column of the waveform corresponds to a column of pixels in the
  10332. source video.
  10333. It accepts the following options:
  10334. @table @option
  10335. @item mode, m
  10336. Can be either @code{row}, or @code{column}. Default is @code{column}.
  10337. In row mode, the graph on the left side represents color component value 0 and
  10338. the right side represents value = 255. In column mode, the top side represents
  10339. color component value = 0 and bottom side represents value = 255.
  10340. @item intensity, i
  10341. Set intensity. Smaller values are useful to find out how many values of the same
  10342. luminance are distributed across input rows/columns.
  10343. Default value is @code{0.04}. Allowed range is [0, 1].
  10344. @item mirror, r
  10345. Set mirroring mode. @code{0} means unmirrored, @code{1} means mirrored.
  10346. In mirrored mode, higher values will be represented on the left
  10347. side for @code{row} mode and at the top for @code{column} mode. Default is
  10348. @code{1} (mirrored).
  10349. @item display, d
  10350. Set display mode.
  10351. It accepts the following values:
  10352. @table @samp
  10353. @item overlay
  10354. Presents information identical to that in the @code{parade}, except
  10355. that the graphs representing color components are superimposed directly
  10356. over one another.
  10357. This display mode makes it easier to spot relative differences or similarities
  10358. in overlapping areas of the color components that are supposed to be identical,
  10359. such as neutral whites, grays, or blacks.
  10360. @item stack
  10361. Display separate graph for the color components side by side in
  10362. @code{row} mode or one below the other in @code{column} mode.
  10363. @item parade
  10364. Display separate graph for the color components side by side in
  10365. @code{column} mode or one below the other in @code{row} mode.
  10366. Using this display mode makes it easy to spot color casts in the highlights
  10367. and shadows of an image, by comparing the contours of the top and the bottom
  10368. graphs of each waveform. Since whites, grays, and blacks are characterized
  10369. by exactly equal amounts of red, green, and blue, neutral areas of the picture
  10370. should display three waveforms of roughly equal width/height. If not, the
  10371. correction is easy to perform by making level adjustments the three waveforms.
  10372. @end table
  10373. Default is @code{stack}.
  10374. @item components, c
  10375. Set which color components to display. Default is 1, which means only luminance
  10376. or red color component if input is in RGB colorspace. If is set for example to
  10377. 7 it will display all 3 (if) available color components.
  10378. @item envelope, e
  10379. @table @samp
  10380. @item none
  10381. No envelope, this is default.
  10382. @item instant
  10383. Instant envelope, minimum and maximum values presented in graph will be easily
  10384. visible even with small @code{step} value.
  10385. @item peak
  10386. Hold minimum and maximum values presented in graph across time. This way you
  10387. can still spot out of range values without constantly looking at waveforms.
  10388. @item peak+instant
  10389. Peak and instant envelope combined together.
  10390. @end table
  10391. @item filter, f
  10392. @table @samp
  10393. @item lowpass
  10394. No filtering, this is default.
  10395. @item flat
  10396. Luma and chroma combined together.
  10397. @item aflat
  10398. Similar as above, but shows difference between blue and red chroma.
  10399. @item chroma
  10400. Displays only chroma.
  10401. @item color
  10402. Displays actual color value on waveform.
  10403. @item acolor
  10404. Similar as above, but with luma showing frequency of chroma values.
  10405. @end table
  10406. @item graticule, g
  10407. Set which graticule to display.
  10408. @table @samp
  10409. @item none
  10410. Do not display graticule.
  10411. @item green
  10412. Display green graticule showing legal broadcast ranges.
  10413. @end table
  10414. @item opacity, o
  10415. Set graticule opacity.
  10416. @item flags, fl
  10417. Set graticule flags.
  10418. @table @samp
  10419. @item numbers
  10420. Draw numbers above lines. By default enabled.
  10421. @item dots
  10422. Draw dots instead of lines.
  10423. @end table
  10424. @item scale, s
  10425. Set scale used for displaying graticule.
  10426. @table @samp
  10427. @item digital
  10428. @item millivolts
  10429. @item ire
  10430. @end table
  10431. Default is digital.
  10432. @end table
  10433. @section xbr
  10434. Apply the xBR high-quality magnification filter which is designed for pixel
  10435. art. It follows a set of edge-detection rules, see
  10436. @url{http://www.libretro.com/forums/viewtopic.php?f=6&t=134}.
  10437. It accepts the following option:
  10438. @table @option
  10439. @item n
  10440. Set the scaling dimension: @code{2} for @code{2xBR}, @code{3} for
  10441. @code{3xBR} and @code{4} for @code{4xBR}.
  10442. Default is @code{3}.
  10443. @end table
  10444. @anchor{yadif}
  10445. @section yadif
  10446. Deinterlace the input video ("yadif" means "yet another deinterlacing
  10447. filter").
  10448. It accepts the following parameters:
  10449. @table @option
  10450. @item mode
  10451. The interlacing mode to adopt. It accepts one of the following values:
  10452. @table @option
  10453. @item 0, send_frame
  10454. Output one frame for each frame.
  10455. @item 1, send_field
  10456. Output one frame for each field.
  10457. @item 2, send_frame_nospatial
  10458. Like @code{send_frame}, but it skips the spatial interlacing check.
  10459. @item 3, send_field_nospatial
  10460. Like @code{send_field}, but it skips the spatial interlacing check.
  10461. @end table
  10462. The default value is @code{send_frame}.
  10463. @item parity
  10464. The picture field parity assumed for the input interlaced video. It accepts one
  10465. of the following values:
  10466. @table @option
  10467. @item 0, tff
  10468. Assume the top field is first.
  10469. @item 1, bff
  10470. Assume the bottom field is first.
  10471. @item -1, auto
  10472. Enable automatic detection of field parity.
  10473. @end table
  10474. The default value is @code{auto}.
  10475. If the interlacing is unknown or the decoder does not export this information,
  10476. top field first will be assumed.
  10477. @item deint
  10478. Specify which frames to deinterlace. Accept one of the following
  10479. values:
  10480. @table @option
  10481. @item 0, all
  10482. Deinterlace all frames.
  10483. @item 1, interlaced
  10484. Only deinterlace frames marked as interlaced.
  10485. @end table
  10486. The default value is @code{all}.
  10487. @end table
  10488. @section zoompan
  10489. Apply Zoom & Pan effect.
  10490. This filter accepts the following options:
  10491. @table @option
  10492. @item zoom, z
  10493. Set the zoom expression. Default is 1.
  10494. @item x
  10495. @item y
  10496. Set the x and y expression. Default is 0.
  10497. @item d
  10498. Set the duration expression in number of frames.
  10499. This sets for how many number of frames effect will last for
  10500. single input image.
  10501. @item s
  10502. Set the output image size, default is 'hd720'.
  10503. @item fps
  10504. Set the output frame rate, default is '25'.
  10505. @end table
  10506. Each expression can contain the following constants:
  10507. @table @option
  10508. @item in_w, iw
  10509. Input width.
  10510. @item in_h, ih
  10511. Input height.
  10512. @item out_w, ow
  10513. Output width.
  10514. @item out_h, oh
  10515. Output height.
  10516. @item in
  10517. Input frame count.
  10518. @item on
  10519. Output frame count.
  10520. @item x
  10521. @item y
  10522. Last calculated 'x' and 'y' position from 'x' and 'y' expression
  10523. for current input frame.
  10524. @item px
  10525. @item py
  10526. 'x' and 'y' of last output frame of previous input frame or 0 when there was
  10527. not yet such frame (first input frame).
  10528. @item zoom
  10529. Last calculated zoom from 'z' expression for current input frame.
  10530. @item pzoom
  10531. Last calculated zoom of last output frame of previous input frame.
  10532. @item duration
  10533. Number of output frames for current input frame. Calculated from 'd' expression
  10534. for each input frame.
  10535. @item pduration
  10536. number of output frames created for previous input frame
  10537. @item a
  10538. Rational number: input width / input height
  10539. @item sar
  10540. sample aspect ratio
  10541. @item dar
  10542. display aspect ratio
  10543. @end table
  10544. @subsection Examples
  10545. @itemize
  10546. @item
  10547. Zoom-in up to 1.5 and pan at same time to some spot near center of picture:
  10548. @example
  10549. zoompan=z='min(zoom+0.0015,1.5)':d=700:x='if(gte(zoom,1.5),x,x+1/a)':y='if(gte(zoom,1.5),y,y+1)':s=640x360
  10550. @end example
  10551. @item
  10552. Zoom-in up to 1.5 and pan always at center of picture:
  10553. @example
  10554. zoompan=z='min(zoom+0.0015,1.5)':d=700:x='iw/2-(iw/zoom/2)':y='ih/2-(ih/zoom/2)'
  10555. @end example
  10556. @end itemize
  10557. @section zscale
  10558. Scale (resize) the input video, using the z.lib library:
  10559. https://github.com/sekrit-twc/zimg.
  10560. The zscale filter forces the output display aspect ratio to be the same
  10561. as the input, by changing the output sample aspect ratio.
  10562. If the input image format is different from the format requested by
  10563. the next filter, the zscale filter will convert the input to the
  10564. requested format.
  10565. @subsection Options
  10566. The filter accepts the following options.
  10567. @table @option
  10568. @item width, w
  10569. @item height, h
  10570. Set the output video dimension expression. Default value is the input
  10571. dimension.
  10572. If the @var{width} or @var{w} is 0, the input width is used for the output.
  10573. If the @var{height} or @var{h} is 0, the input height is used for the output.
  10574. If one of the values is -1, the zscale filter will use a value that
  10575. maintains the aspect ratio of the input image, calculated from the
  10576. other specified dimension. If both of them are -1, the input size is
  10577. used
  10578. If one of the values is -n with n > 1, the zscale filter will also use a value
  10579. that maintains the aspect ratio of the input image, calculated from the other
  10580. specified dimension. After that it will, however, make sure that the calculated
  10581. dimension is divisible by n and adjust the value if necessary.
  10582. See below for the list of accepted constants for use in the dimension
  10583. expression.
  10584. @item size, s
  10585. Set the video size. For the syntax of this option, check the
  10586. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  10587. @item dither, d
  10588. Set the dither type.
  10589. Possible values are:
  10590. @table @var
  10591. @item none
  10592. @item ordered
  10593. @item random
  10594. @item error_diffusion
  10595. @end table
  10596. Default is none.
  10597. @item filter, f
  10598. Set the resize filter type.
  10599. Possible values are:
  10600. @table @var
  10601. @item point
  10602. @item bilinear
  10603. @item bicubic
  10604. @item spline16
  10605. @item spline36
  10606. @item lanczos
  10607. @end table
  10608. Default is bilinear.
  10609. @item range, r
  10610. Set the color range.
  10611. Possible values are:
  10612. @table @var
  10613. @item input
  10614. @item limited
  10615. @item full
  10616. @end table
  10617. Default is same as input.
  10618. @item primaries, p
  10619. Set the color primaries.
  10620. Possible values are:
  10621. @table @var
  10622. @item input
  10623. @item 709
  10624. @item unspecified
  10625. @item 170m
  10626. @item 240m
  10627. @item 2020
  10628. @end table
  10629. Default is same as input.
  10630. @item transfer, t
  10631. Set the transfer characteristics.
  10632. Possible values are:
  10633. @table @var
  10634. @item input
  10635. @item 709
  10636. @item unspecified
  10637. @item 601
  10638. @item linear
  10639. @item 2020_10
  10640. @item 2020_12
  10641. @end table
  10642. Default is same as input.
  10643. @item matrix, m
  10644. Set the colorspace matrix.
  10645. Possible value are:
  10646. @table @var
  10647. @item input
  10648. @item 709
  10649. @item unspecified
  10650. @item 470bg
  10651. @item 170m
  10652. @item 2020_ncl
  10653. @item 2020_cl
  10654. @end table
  10655. Default is same as input.
  10656. @item rangein, rin
  10657. Set the input color range.
  10658. Possible values are:
  10659. @table @var
  10660. @item input
  10661. @item limited
  10662. @item full
  10663. @end table
  10664. Default is same as input.
  10665. @item primariesin, pin
  10666. Set the input color primaries.
  10667. Possible values are:
  10668. @table @var
  10669. @item input
  10670. @item 709
  10671. @item unspecified
  10672. @item 170m
  10673. @item 240m
  10674. @item 2020
  10675. @end table
  10676. Default is same as input.
  10677. @item transferin, tin
  10678. Set the input transfer characteristics.
  10679. Possible values are:
  10680. @table @var
  10681. @item input
  10682. @item 709
  10683. @item unspecified
  10684. @item 601
  10685. @item linear
  10686. @item 2020_10
  10687. @item 2020_12
  10688. @end table
  10689. Default is same as input.
  10690. @item matrixin, min
  10691. Set the input colorspace matrix.
  10692. Possible value are:
  10693. @table @var
  10694. @item input
  10695. @item 709
  10696. @item unspecified
  10697. @item 470bg
  10698. @item 170m
  10699. @item 2020_ncl
  10700. @item 2020_cl
  10701. @end table
  10702. @end table
  10703. The values of the @option{w} and @option{h} options are expressions
  10704. containing the following constants:
  10705. @table @var
  10706. @item in_w
  10707. @item in_h
  10708. The input width and height
  10709. @item iw
  10710. @item ih
  10711. These are the same as @var{in_w} and @var{in_h}.
  10712. @item out_w
  10713. @item out_h
  10714. The output (scaled) width and height
  10715. @item ow
  10716. @item oh
  10717. These are the same as @var{out_w} and @var{out_h}
  10718. @item a
  10719. The same as @var{iw} / @var{ih}
  10720. @item sar
  10721. input sample aspect ratio
  10722. @item dar
  10723. The input display aspect ratio. Calculated from @code{(iw / ih) * sar}.
  10724. @item hsub
  10725. @item vsub
  10726. horizontal and vertical input chroma subsample values. For example for the
  10727. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  10728. @item ohsub
  10729. @item ovsub
  10730. horizontal and vertical output chroma subsample values. For example for the
  10731. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  10732. @end table
  10733. @table @option
  10734. @end table
  10735. @c man end VIDEO FILTERS
  10736. @chapter Video Sources
  10737. @c man begin VIDEO SOURCES
  10738. Below is a description of the currently available video sources.
  10739. @section buffer
  10740. Buffer video frames, and make them available to the filter chain.
  10741. This source is mainly intended for a programmatic use, in particular
  10742. through the interface defined in @file{libavfilter/vsrc_buffer.h}.
  10743. It accepts the following parameters:
  10744. @table @option
  10745. @item video_size
  10746. Specify the size (width and height) of the buffered video frames. For the
  10747. syntax of this option, check the
  10748. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  10749. @item width
  10750. The input video width.
  10751. @item height
  10752. The input video height.
  10753. @item pix_fmt
  10754. A string representing the pixel format of the buffered video frames.
  10755. It may be a number corresponding to a pixel format, or a pixel format
  10756. name.
  10757. @item time_base
  10758. Specify the timebase assumed by the timestamps of the buffered frames.
  10759. @item frame_rate
  10760. Specify the frame rate expected for the video stream.
  10761. @item pixel_aspect, sar
  10762. The sample (pixel) aspect ratio of the input video.
  10763. @item sws_param
  10764. Specify the optional parameters to be used for the scale filter which
  10765. is automatically inserted when an input change is detected in the
  10766. input size or format.
  10767. @item hw_frames_ctx
  10768. When using a hardware pixel format, this should be a reference to an
  10769. AVHWFramesContext describing input frames.
  10770. @end table
  10771. For example:
  10772. @example
  10773. buffer=width=320:height=240:pix_fmt=yuv410p:time_base=1/24:sar=1
  10774. @end example
  10775. will instruct the source to accept video frames with size 320x240 and
  10776. with format "yuv410p", assuming 1/24 as the timestamps timebase and
  10777. square pixels (1:1 sample aspect ratio).
  10778. Since the pixel format with name "yuv410p" corresponds to the number 6
  10779. (check the enum AVPixelFormat definition in @file{libavutil/pixfmt.h}),
  10780. this example corresponds to:
  10781. @example
  10782. buffer=size=320x240:pixfmt=6:time_base=1/24:pixel_aspect=1/1
  10783. @end example
  10784. Alternatively, the options can be specified as a flat string, but this
  10785. syntax is deprecated:
  10786. @var{width}:@var{height}:@var{pix_fmt}:@var{time_base.num}:@var{time_base.den}:@var{pixel_aspect.num}:@var{pixel_aspect.den}[:@var{sws_param}]
  10787. @section cellauto
  10788. Create a pattern generated by an elementary cellular automaton.
  10789. The initial state of the cellular automaton can be defined through the
  10790. @option{filename}, and @option{pattern} options. If such options are
  10791. not specified an initial state is created randomly.
  10792. At each new frame a new row in the video is filled with the result of
  10793. the cellular automaton next generation. The behavior when the whole
  10794. frame is filled is defined by the @option{scroll} option.
  10795. This source accepts the following options:
  10796. @table @option
  10797. @item filename, f
  10798. Read the initial cellular automaton state, i.e. the starting row, from
  10799. the specified file.
  10800. In the file, each non-whitespace character is considered an alive
  10801. cell, a newline will terminate the row, and further characters in the
  10802. file will be ignored.
  10803. @item pattern, p
  10804. Read the initial cellular automaton state, i.e. the starting row, from
  10805. the specified string.
  10806. Each non-whitespace character in the string is considered an alive
  10807. cell, a newline will terminate the row, and further characters in the
  10808. string will be ignored.
  10809. @item rate, r
  10810. Set the video rate, that is the number of frames generated per second.
  10811. Default is 25.
  10812. @item random_fill_ratio, ratio
  10813. Set the random fill ratio for the initial cellular automaton row. It
  10814. is a floating point number value ranging from 0 to 1, defaults to
  10815. 1/PHI.
  10816. This option is ignored when a file or a pattern is specified.
  10817. @item random_seed, seed
  10818. Set the seed for filling randomly the initial row, must be an integer
  10819. included between 0 and UINT32_MAX. If not specified, or if explicitly
  10820. set to -1, the filter will try to use a good random seed on a best
  10821. effort basis.
  10822. @item rule
  10823. Set the cellular automaton rule, it is a number ranging from 0 to 255.
  10824. Default value is 110.
  10825. @item size, s
  10826. Set the size of the output video. For the syntax of this option, check the
  10827. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  10828. If @option{filename} or @option{pattern} is specified, the size is set
  10829. by default to the width of the specified initial state row, and the
  10830. height is set to @var{width} * PHI.
  10831. If @option{size} is set, it must contain the width of the specified
  10832. pattern string, and the specified pattern will be centered in the
  10833. larger row.
  10834. If a filename or a pattern string is not specified, the size value
  10835. defaults to "320x518" (used for a randomly generated initial state).
  10836. @item scroll
  10837. If set to 1, scroll the output upward when all the rows in the output
  10838. have been already filled. If set to 0, the new generated row will be
  10839. written over the top row just after the bottom row is filled.
  10840. Defaults to 1.
  10841. @item start_full, full
  10842. If set to 1, completely fill the output with generated rows before
  10843. outputting the first frame.
  10844. This is the default behavior, for disabling set the value to 0.
  10845. @item stitch
  10846. If set to 1, stitch the left and right row edges together.
  10847. This is the default behavior, for disabling set the value to 0.
  10848. @end table
  10849. @subsection Examples
  10850. @itemize
  10851. @item
  10852. Read the initial state from @file{pattern}, and specify an output of
  10853. size 200x400.
  10854. @example
  10855. cellauto=f=pattern:s=200x400
  10856. @end example
  10857. @item
  10858. Generate a random initial row with a width of 200 cells, with a fill
  10859. ratio of 2/3:
  10860. @example
  10861. cellauto=ratio=2/3:s=200x200
  10862. @end example
  10863. @item
  10864. Create a pattern generated by rule 18 starting by a single alive cell
  10865. centered on an initial row with width 100:
  10866. @example
  10867. cellauto=p=@@:s=100x400:full=0:rule=18
  10868. @end example
  10869. @item
  10870. Specify a more elaborated initial pattern:
  10871. @example
  10872. cellauto=p='@@@@ @@ @@@@':s=100x400:full=0:rule=18
  10873. @end example
  10874. @end itemize
  10875. @anchor{coreimagesrc}
  10876. @section coreimagesrc
  10877. Video source generated on GPU using Apple's CoreImage API on OSX.
  10878. This video source is a specialized version of the @ref{coreimage} video filter.
  10879. Use a core image generator at the beginning of the applied filterchain to
  10880. generate the content.
  10881. The coreimagesrc video source accepts the following options:
  10882. @table @option
  10883. @item list_generators
  10884. List all available generators along with all their respective options as well as
  10885. possible minimum and maximum values along with the default values.
  10886. @example
  10887. list_generators=true
  10888. @end example
  10889. @item size, s
  10890. Specify the size of the sourced video. For the syntax of this option, check the
  10891. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  10892. The default value is @code{320x240}.
  10893. @item rate, r
  10894. Specify the frame rate of the sourced video, as the number of frames
  10895. generated per second. It has to be a string in the format
  10896. @var{frame_rate_num}/@var{frame_rate_den}, an integer number, a floating point
  10897. number or a valid video frame rate abbreviation. The default value is
  10898. "25".
  10899. @item sar
  10900. Set the sample aspect ratio of the sourced video.
  10901. @item duration, d
  10902. Set the duration of the sourced video. See
  10903. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  10904. for the accepted syntax.
  10905. If not specified, or the expressed duration is negative, the video is
  10906. supposed to be generated forever.
  10907. @end table
  10908. Additionally, all options of the @ref{coreimage} video filter are accepted.
  10909. A complete filterchain can be used for further processing of the
  10910. generated input without CPU-HOST transfer. See @ref{coreimage} documentation
  10911. and examples for details.
  10912. @subsection Examples
  10913. @itemize
  10914. @item
  10915. Use CIQRCodeGenerator to create a QR code for the FFmpeg homepage,
  10916. given as complete and escaped command-line for Apple's standard bash shell:
  10917. @example
  10918. ffmpeg -f lavfi -i coreimagesrc=s=100x100:filter=CIQRCodeGenerator@@inputMessage=https\\\\\://FFmpeg.org/@@inputCorrectionLevel=H -frames:v 1 QRCode.png
  10919. @end example
  10920. This example is equivalent to the QRCode example of @ref{coreimage} without the
  10921. need for a nullsrc video source.
  10922. @end itemize
  10923. @section mandelbrot
  10924. Generate a Mandelbrot set fractal, and progressively zoom towards the
  10925. point specified with @var{start_x} and @var{start_y}.
  10926. This source accepts the following options:
  10927. @table @option
  10928. @item end_pts
  10929. Set the terminal pts value. Default value is 400.
  10930. @item end_scale
  10931. Set the terminal scale value.
  10932. Must be a floating point value. Default value is 0.3.
  10933. @item inner
  10934. Set the inner coloring mode, that is the algorithm used to draw the
  10935. Mandelbrot fractal internal region.
  10936. It shall assume one of the following values:
  10937. @table @option
  10938. @item black
  10939. Set black mode.
  10940. @item convergence
  10941. Show time until convergence.
  10942. @item mincol
  10943. Set color based on point closest to the origin of the iterations.
  10944. @item period
  10945. Set period mode.
  10946. @end table
  10947. Default value is @var{mincol}.
  10948. @item bailout
  10949. Set the bailout value. Default value is 10.0.
  10950. @item maxiter
  10951. Set the maximum of iterations performed by the rendering
  10952. algorithm. Default value is 7189.
  10953. @item outer
  10954. Set outer coloring mode.
  10955. It shall assume one of following values:
  10956. @table @option
  10957. @item iteration_count
  10958. Set iteration cound mode.
  10959. @item normalized_iteration_count
  10960. set normalized iteration count mode.
  10961. @end table
  10962. Default value is @var{normalized_iteration_count}.
  10963. @item rate, r
  10964. Set frame rate, expressed as number of frames per second. Default
  10965. value is "25".
  10966. @item size, s
  10967. Set frame size. For the syntax of this option, check the "Video
  10968. size" section in the ffmpeg-utils manual. Default value is "640x480".
  10969. @item start_scale
  10970. Set the initial scale value. Default value is 3.0.
  10971. @item start_x
  10972. Set the initial x position. Must be a floating point value between
  10973. -100 and 100. Default value is -0.743643887037158704752191506114774.
  10974. @item start_y
  10975. Set the initial y position. Must be a floating point value between
  10976. -100 and 100. Default value is -0.131825904205311970493132056385139.
  10977. @end table
  10978. @section mptestsrc
  10979. Generate various test patterns, as generated by the MPlayer test filter.
  10980. The size of the generated video is fixed, and is 256x256.
  10981. This source is useful in particular for testing encoding features.
  10982. This source accepts the following options:
  10983. @table @option
  10984. @item rate, r
  10985. Specify the frame rate of the sourced video, as the number of frames
  10986. generated per second. It has to be a string in the format
  10987. @var{frame_rate_num}/@var{frame_rate_den}, an integer number, a floating point
  10988. number or a valid video frame rate abbreviation. The default value is
  10989. "25".
  10990. @item duration, d
  10991. Set the duration of the sourced video. See
  10992. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  10993. for the accepted syntax.
  10994. If not specified, or the expressed duration is negative, the video is
  10995. supposed to be generated forever.
  10996. @item test, t
  10997. Set the number or the name of the test to perform. Supported tests are:
  10998. @table @option
  10999. @item dc_luma
  11000. @item dc_chroma
  11001. @item freq_luma
  11002. @item freq_chroma
  11003. @item amp_luma
  11004. @item amp_chroma
  11005. @item cbp
  11006. @item mv
  11007. @item ring1
  11008. @item ring2
  11009. @item all
  11010. @end table
  11011. Default value is "all", which will cycle through the list of all tests.
  11012. @end table
  11013. Some examples:
  11014. @example
  11015. mptestsrc=t=dc_luma
  11016. @end example
  11017. will generate a "dc_luma" test pattern.
  11018. @section frei0r_src
  11019. Provide a frei0r source.
  11020. To enable compilation of this filter you need to install the frei0r
  11021. header and configure FFmpeg with @code{--enable-frei0r}.
  11022. This source accepts the following parameters:
  11023. @table @option
  11024. @item size
  11025. The size of the video to generate. For the syntax of this option, check the
  11026. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  11027. @item framerate
  11028. The framerate of the generated video. It may be a string of the form
  11029. @var{num}/@var{den} or a frame rate abbreviation.
  11030. @item filter_name
  11031. The name to the frei0r source to load. For more information regarding frei0r and
  11032. how to set the parameters, read the @ref{frei0r} section in the video filters
  11033. documentation.
  11034. @item filter_params
  11035. A '|'-separated list of parameters to pass to the frei0r source.
  11036. @end table
  11037. For example, to generate a frei0r partik0l source with size 200x200
  11038. and frame rate 10 which is overlaid on the overlay filter main input:
  11039. @example
  11040. frei0r_src=size=200x200:framerate=10:filter_name=partik0l:filter_params=1234 [overlay]; [in][overlay] overlay
  11041. @end example
  11042. @section life
  11043. Generate a life pattern.
  11044. This source is based on a generalization of John Conway's life game.
  11045. The sourced input represents a life grid, each pixel represents a cell
  11046. which can be in one of two possible states, alive or dead. Every cell
  11047. interacts with its eight neighbours, which are the cells that are
  11048. horizontally, vertically, or diagonally adjacent.
  11049. At each interaction the grid evolves according to the adopted rule,
  11050. which specifies the number of neighbor alive cells which will make a
  11051. cell stay alive or born. The @option{rule} option allows one to specify
  11052. the rule to adopt.
  11053. This source accepts the following options:
  11054. @table @option
  11055. @item filename, f
  11056. Set the file from which to read the initial grid state. In the file,
  11057. each non-whitespace character is considered an alive cell, and newline
  11058. is used to delimit the end of each row.
  11059. If this option is not specified, the initial grid is generated
  11060. randomly.
  11061. @item rate, r
  11062. Set the video rate, that is the number of frames generated per second.
  11063. Default is 25.
  11064. @item random_fill_ratio, ratio
  11065. Set the random fill ratio for the initial random grid. It is a
  11066. floating point number value ranging from 0 to 1, defaults to 1/PHI.
  11067. It is ignored when a file is specified.
  11068. @item random_seed, seed
  11069. Set the seed for filling the initial random grid, must be an integer
  11070. included between 0 and UINT32_MAX. If not specified, or if explicitly
  11071. set to -1, the filter will try to use a good random seed on a best
  11072. effort basis.
  11073. @item rule
  11074. Set the life rule.
  11075. A rule can be specified with a code of the kind "S@var{NS}/B@var{NB}",
  11076. where @var{NS} and @var{NB} are sequences of numbers in the range 0-8,
  11077. @var{NS} specifies the number of alive neighbor cells which make a
  11078. live cell stay alive, and @var{NB} the number of alive neighbor cells
  11079. which make a dead cell to become alive (i.e. to "born").
  11080. "s" and "b" can be used in place of "S" and "B", respectively.
  11081. Alternatively a rule can be specified by an 18-bits integer. The 9
  11082. high order bits are used to encode the next cell state if it is alive
  11083. for each number of neighbor alive cells, the low order bits specify
  11084. the rule for "borning" new cells. Higher order bits encode for an
  11085. higher number of neighbor cells.
  11086. For example the number 6153 = @code{(12<<9)+9} specifies a stay alive
  11087. rule of 12 and a born rule of 9, which corresponds to "S23/B03".
  11088. Default value is "S23/B3", which is the original Conway's game of life
  11089. rule, and will keep a cell alive if it has 2 or 3 neighbor alive
  11090. cells, and will born a new cell if there are three alive cells around
  11091. a dead cell.
  11092. @item size, s
  11093. Set the size of the output video. For the syntax of this option, check the
  11094. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  11095. If @option{filename} is specified, the size is set by default to the
  11096. same size of the input file. If @option{size} is set, it must contain
  11097. the size specified in the input file, and the initial grid defined in
  11098. that file is centered in the larger resulting area.
  11099. If a filename is not specified, the size value defaults to "320x240"
  11100. (used for a randomly generated initial grid).
  11101. @item stitch
  11102. If set to 1, stitch the left and right grid edges together, and the
  11103. top and bottom edges also. Defaults to 1.
  11104. @item mold
  11105. Set cell mold speed. If set, a dead cell will go from @option{death_color} to
  11106. @option{mold_color} with a step of @option{mold}. @option{mold} can have a
  11107. value from 0 to 255.
  11108. @item life_color
  11109. Set the color of living (or new born) cells.
  11110. @item death_color
  11111. Set the color of dead cells. If @option{mold} is set, this is the first color
  11112. used to represent a dead cell.
  11113. @item mold_color
  11114. Set mold color, for definitely dead and moldy cells.
  11115. For the syntax of these 3 color options, check the "Color" section in the
  11116. ffmpeg-utils manual.
  11117. @end table
  11118. @subsection Examples
  11119. @itemize
  11120. @item
  11121. Read a grid from @file{pattern}, and center it on a grid of size
  11122. 300x300 pixels:
  11123. @example
  11124. life=f=pattern:s=300x300
  11125. @end example
  11126. @item
  11127. Generate a random grid of size 200x200, with a fill ratio of 2/3:
  11128. @example
  11129. life=ratio=2/3:s=200x200
  11130. @end example
  11131. @item
  11132. Specify a custom rule for evolving a randomly generated grid:
  11133. @example
  11134. life=rule=S14/B34
  11135. @end example
  11136. @item
  11137. Full example with slow death effect (mold) using @command{ffplay}:
  11138. @example
  11139. ffplay -f lavfi life=s=300x200:mold=10:r=60:ratio=0.1:death_color=#C83232:life_color=#00ff00,scale=1200:800:flags=16
  11140. @end example
  11141. @end itemize
  11142. @anchor{allrgb}
  11143. @anchor{allyuv}
  11144. @anchor{color}
  11145. @anchor{haldclutsrc}
  11146. @anchor{nullsrc}
  11147. @anchor{rgbtestsrc}
  11148. @anchor{smptebars}
  11149. @anchor{smptehdbars}
  11150. @anchor{testsrc}
  11151. @anchor{testsrc2}
  11152. @section allrgb, allyuv, color, haldclutsrc, nullsrc, rgbtestsrc, smptebars, smptehdbars, testsrc, testsrc2
  11153. The @code{allrgb} source returns frames of size 4096x4096 of all rgb colors.
  11154. The @code{allyuv} source returns frames of size 4096x4096 of all yuv colors.
  11155. The @code{color} source provides an uniformly colored input.
  11156. The @code{haldclutsrc} source provides an identity Hald CLUT. See also
  11157. @ref{haldclut} filter.
  11158. The @code{nullsrc} source returns unprocessed video frames. It is
  11159. mainly useful to be employed in analysis / debugging tools, or as the
  11160. source for filters which ignore the input data.
  11161. The @code{rgbtestsrc} source generates an RGB test pattern useful for
  11162. detecting RGB vs BGR issues. You should see a red, green and blue
  11163. stripe from top to bottom.
  11164. The @code{smptebars} source generates a color bars pattern, based on
  11165. the SMPTE Engineering Guideline EG 1-1990.
  11166. The @code{smptehdbars} source generates a color bars pattern, based on
  11167. the SMPTE RP 219-2002.
  11168. The @code{testsrc} source generates a test video pattern, showing a
  11169. color pattern, a scrolling gradient and a timestamp. This is mainly
  11170. intended for testing purposes.
  11171. The @code{testsrc2} source is similar to testsrc, but supports more
  11172. pixel formats instead of just @code{rgb24}. This allows using it as an
  11173. input for other tests without requiring a format conversion.
  11174. The sources accept the following parameters:
  11175. @table @option
  11176. @item color, c
  11177. Specify the color of the source, only available in the @code{color}
  11178. source. For the syntax of this option, check the "Color" section in the
  11179. ffmpeg-utils manual.
  11180. @item level
  11181. Specify the level of the Hald CLUT, only available in the @code{haldclutsrc}
  11182. source. A level of @code{N} generates a picture of @code{N*N*N} by @code{N*N*N}
  11183. pixels to be used as identity matrix for 3D lookup tables. Each component is
  11184. coded on a @code{1/(N*N)} scale.
  11185. @item size, s
  11186. Specify the size of the sourced video. For the syntax of this option, check the
  11187. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  11188. The default value is @code{320x240}.
  11189. This option is not available with the @code{haldclutsrc} filter.
  11190. @item rate, r
  11191. Specify the frame rate of the sourced video, as the number of frames
  11192. generated per second. It has to be a string in the format
  11193. @var{frame_rate_num}/@var{frame_rate_den}, an integer number, a floating point
  11194. number or a valid video frame rate abbreviation. The default value is
  11195. "25".
  11196. @item sar
  11197. Set the sample aspect ratio of the sourced video.
  11198. @item duration, d
  11199. Set the duration of the sourced video. See
  11200. @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
  11201. for the accepted syntax.
  11202. If not specified, or the expressed duration is negative, the video is
  11203. supposed to be generated forever.
  11204. @item decimals, n
  11205. Set the number of decimals to show in the timestamp, only available in the
  11206. @code{testsrc} source.
  11207. The displayed timestamp value will correspond to the original
  11208. timestamp value multiplied by the power of 10 of the specified
  11209. value. Default value is 0.
  11210. @end table
  11211. For example the following:
  11212. @example
  11213. testsrc=duration=5.3:size=qcif:rate=10
  11214. @end example
  11215. will generate a video with a duration of 5.3 seconds, with size
  11216. 176x144 and a frame rate of 10 frames per second.
  11217. The following graph description will generate a red source
  11218. with an opacity of 0.2, with size "qcif" and a frame rate of 10
  11219. frames per second.
  11220. @example
  11221. color=c=red@@0.2:s=qcif:r=10
  11222. @end example
  11223. If the input content is to be ignored, @code{nullsrc} can be used. The
  11224. following command generates noise in the luminance plane by employing
  11225. the @code{geq} filter:
  11226. @example
  11227. nullsrc=s=256x256, geq=random(1)*255:128:128
  11228. @end example
  11229. @subsection Commands
  11230. The @code{color} source supports the following commands:
  11231. @table @option
  11232. @item c, color
  11233. Set the color of the created image. Accepts the same syntax of the
  11234. corresponding @option{color} option.
  11235. @end table
  11236. @c man end VIDEO SOURCES
  11237. @chapter Video Sinks
  11238. @c man begin VIDEO SINKS
  11239. Below is a description of the currently available video sinks.
  11240. @section buffersink
  11241. Buffer video frames, and make them available to the end of the filter
  11242. graph.
  11243. This sink is mainly intended for programmatic use, in particular
  11244. through the interface defined in @file{libavfilter/buffersink.h}
  11245. or the options system.
  11246. It accepts a pointer to an AVBufferSinkContext structure, which
  11247. defines the incoming buffers' formats, to be passed as the opaque
  11248. parameter to @code{avfilter_init_filter} for initialization.
  11249. @section nullsink
  11250. Null video sink: do absolutely nothing with the input video. It is
  11251. mainly useful as a template and for use in analysis / debugging
  11252. tools.
  11253. @c man end VIDEO SINKS
  11254. @chapter Multimedia Filters
  11255. @c man begin MULTIMEDIA FILTERS
  11256. Below is a description of the currently available multimedia filters.
  11257. @section ahistogram
  11258. Convert input audio to a video output, displaying the volume histogram.
  11259. The filter accepts the following options:
  11260. @table @option
  11261. @item dmode
  11262. Specify how histogram is calculated.
  11263. It accepts the following values:
  11264. @table @samp
  11265. @item single
  11266. Use single histogram for all channels.
  11267. @item separate
  11268. Use separate histogram for each channel.
  11269. @end table
  11270. Default is @code{single}.
  11271. @item rate, r
  11272. Set frame rate, expressed as number of frames per second. Default
  11273. value is "25".
  11274. @item size, s
  11275. Specify the video size for the output. For the syntax of this option, check the
  11276. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  11277. Default value is @code{hd720}.
  11278. @item scale
  11279. Set display scale.
  11280. It accepts the following values:
  11281. @table @samp
  11282. @item log
  11283. logarithmic
  11284. @item sqrt
  11285. square root
  11286. @item cbrt
  11287. cubic root
  11288. @item lin
  11289. linear
  11290. @item rlog
  11291. reverse logarithmic
  11292. @end table
  11293. Default is @code{log}.
  11294. @item ascale
  11295. Set amplitude scale.
  11296. It accepts the following values:
  11297. @table @samp
  11298. @item log
  11299. logarithmic
  11300. @item lin
  11301. linear
  11302. @end table
  11303. Default is @code{log}.
  11304. @item acount
  11305. Set how much frames to accumulate in histogram.
  11306. Defauls is 1. Setting this to -1 accumulates all frames.
  11307. @item rheight
  11308. Set histogram ratio of window height.
  11309. @item slide
  11310. Set sonogram sliding.
  11311. It accepts the following values:
  11312. @table @samp
  11313. @item replace
  11314. replace old rows with new ones.
  11315. @item scroll
  11316. scroll from top to bottom.
  11317. @end table
  11318. Default is @code{replace}.
  11319. @end table
  11320. @section aphasemeter
  11321. Convert input audio to a video output, displaying the audio phase.
  11322. The filter accepts the following options:
  11323. @table @option
  11324. @item rate, r
  11325. Set the output frame rate. Default value is @code{25}.
  11326. @item size, s
  11327. Set the video size for the output. For the syntax of this option, check the
  11328. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  11329. Default value is @code{800x400}.
  11330. @item rc
  11331. @item gc
  11332. @item bc
  11333. Specify the red, green, blue contrast. Default values are @code{2},
  11334. @code{7} and @code{1}.
  11335. Allowed range is @code{[0, 255]}.
  11336. @item mpc
  11337. Set color which will be used for drawing median phase. If color is
  11338. @code{none} which is default, no median phase value will be drawn.
  11339. @end table
  11340. The filter also exports the frame metadata @code{lavfi.aphasemeter.phase} which
  11341. represents mean phase of current audio frame. Value is in range @code{[-1, 1]}.
  11342. The @code{-1} means left and right channels are completely out of phase and
  11343. @code{1} means channels are in phase.
  11344. @section avectorscope
  11345. Convert input audio to a video output, representing the audio vector
  11346. scope.
  11347. The filter is used to measure the difference between channels of stereo
  11348. audio stream. A monoaural signal, consisting of identical left and right
  11349. signal, results in straight vertical line. Any stereo separation is visible
  11350. as a deviation from this line, creating a Lissajous figure.
  11351. If the straight (or deviation from it) but horizontal line appears this
  11352. indicates that the left and right channels are out of phase.
  11353. The filter accepts the following options:
  11354. @table @option
  11355. @item mode, m
  11356. Set the vectorscope mode.
  11357. Available values are:
  11358. @table @samp
  11359. @item lissajous
  11360. Lissajous rotated by 45 degrees.
  11361. @item lissajous_xy
  11362. Same as above but not rotated.
  11363. @item polar
  11364. Shape resembling half of circle.
  11365. @end table
  11366. Default value is @samp{lissajous}.
  11367. @item size, s
  11368. Set the video size for the output. For the syntax of this option, check the
  11369. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  11370. Default value is @code{400x400}.
  11371. @item rate, r
  11372. Set the output frame rate. Default value is @code{25}.
  11373. @item rc
  11374. @item gc
  11375. @item bc
  11376. @item ac
  11377. Specify the red, green, blue and alpha contrast. Default values are @code{40},
  11378. @code{160}, @code{80} and @code{255}.
  11379. Allowed range is @code{[0, 255]}.
  11380. @item rf
  11381. @item gf
  11382. @item bf
  11383. @item af
  11384. Specify the red, green, blue and alpha fade. Default values are @code{15},
  11385. @code{10}, @code{5} and @code{5}.
  11386. Allowed range is @code{[0, 255]}.
  11387. @item zoom
  11388. Set the zoom factor. Default value is @code{1}. Allowed range is @code{[1, 10]}.
  11389. @item draw
  11390. Set the vectorscope drawing mode.
  11391. Available values are:
  11392. @table @samp
  11393. @item dot
  11394. Draw dot for each sample.
  11395. @item line
  11396. Draw line between previous and current sample.
  11397. @end table
  11398. Default value is @samp{dot}.
  11399. @end table
  11400. @subsection Examples
  11401. @itemize
  11402. @item
  11403. Complete example using @command{ffplay}:
  11404. @example
  11405. ffplay -f lavfi 'amovie=input.mp3, asplit [a][out1];
  11406. [a] avectorscope=zoom=1.3:rc=2:gc=200:bc=10:rf=1:gf=8:bf=7 [out0]'
  11407. @end example
  11408. @end itemize
  11409. @section bench, abench
  11410. Benchmark part of a filtergraph.
  11411. The filter accepts the following options:
  11412. @table @option
  11413. @item action
  11414. Start or stop a timer.
  11415. Available values are:
  11416. @table @samp
  11417. @item start
  11418. Get the current time, set it as frame metadata (using the key
  11419. @code{lavfi.bench.start_time}), and forward the frame to the next filter.
  11420. @item stop
  11421. Get the current time and fetch the @code{lavfi.bench.start_time} metadata from
  11422. the input frame metadata to get the time difference. Time difference, average,
  11423. maximum and minimum time (respectively @code{t}, @code{avg}, @code{max} and
  11424. @code{min}) are then printed. The timestamps are expressed in seconds.
  11425. @end table
  11426. @end table
  11427. @subsection Examples
  11428. @itemize
  11429. @item
  11430. Benchmark @ref{selectivecolor} filter:
  11431. @example
  11432. bench=start,selectivecolor=reds=-.2 .12 -.49,bench=stop
  11433. @end example
  11434. @end itemize
  11435. @section concat
  11436. Concatenate audio and video streams, joining them together one after the
  11437. other.
  11438. The filter works on segments of synchronized video and audio streams. All
  11439. segments must have the same number of streams of each type, and that will
  11440. also be the number of streams at output.
  11441. The filter accepts the following options:
  11442. @table @option
  11443. @item n
  11444. Set the number of segments. Default is 2.
  11445. @item v
  11446. Set the number of output video streams, that is also the number of video
  11447. streams in each segment. Default is 1.
  11448. @item a
  11449. Set the number of output audio streams, that is also the number of audio
  11450. streams in each segment. Default is 0.
  11451. @item unsafe
  11452. Activate unsafe mode: do not fail if segments have a different format.
  11453. @end table
  11454. The filter has @var{v}+@var{a} outputs: first @var{v} video outputs, then
  11455. @var{a} audio outputs.
  11456. There are @var{n}x(@var{v}+@var{a}) inputs: first the inputs for the first
  11457. segment, in the same order as the outputs, then the inputs for the second
  11458. segment, etc.
  11459. Related streams do not always have exactly the same duration, for various
  11460. reasons including codec frame size or sloppy authoring. For that reason,
  11461. related synchronized streams (e.g. a video and its audio track) should be
  11462. concatenated at once. The concat filter will use the duration of the longest
  11463. stream in each segment (except the last one), and if necessary pad shorter
  11464. audio streams with silence.
  11465. For this filter to work correctly, all segments must start at timestamp 0.
  11466. All corresponding streams must have the same parameters in all segments; the
  11467. filtering system will automatically select a common pixel format for video
  11468. streams, and a common sample format, sample rate and channel layout for
  11469. audio streams, but other settings, such as resolution, must be converted
  11470. explicitly by the user.
  11471. Different frame rates are acceptable but will result in variable frame rate
  11472. at output; be sure to configure the output file to handle it.
  11473. @subsection Examples
  11474. @itemize
  11475. @item
  11476. Concatenate an opening, an episode and an ending, all in bilingual version
  11477. (video in stream 0, audio in streams 1 and 2):
  11478. @example
  11479. ffmpeg -i opening.mkv -i episode.mkv -i ending.mkv -filter_complex \
  11480. '[0:0] [0:1] [0:2] [1:0] [1:1] [1:2] [2:0] [2:1] [2:2]
  11481. concat=n=3:v=1:a=2 [v] [a1] [a2]' \
  11482. -map '[v]' -map '[a1]' -map '[a2]' output.mkv
  11483. @end example
  11484. @item
  11485. Concatenate two parts, handling audio and video separately, using the
  11486. (a)movie sources, and adjusting the resolution:
  11487. @example
  11488. movie=part1.mp4, scale=512:288 [v1] ; amovie=part1.mp4 [a1] ;
  11489. movie=part2.mp4, scale=512:288 [v2] ; amovie=part2.mp4 [a2] ;
  11490. [v1] [v2] concat [outv] ; [a1] [a2] concat=v=0:a=1 [outa]
  11491. @end example
  11492. Note that a desync will happen at the stitch if the audio and video streams
  11493. do not have exactly the same duration in the first file.
  11494. @end itemize
  11495. @anchor{ebur128}
  11496. @section ebur128
  11497. EBU R128 scanner filter. This filter takes an audio stream as input and outputs
  11498. it unchanged. By default, it logs a message at a frequency of 10Hz with the
  11499. Momentary loudness (identified by @code{M}), Short-term loudness (@code{S}),
  11500. Integrated loudness (@code{I}) and Loudness Range (@code{LRA}).
  11501. The filter also has a video output (see the @var{video} option) with a real
  11502. time graph to observe the loudness evolution. The graphic contains the logged
  11503. message mentioned above, so it is not printed anymore when this option is set,
  11504. unless the verbose logging is set. The main graphing area contains the
  11505. short-term loudness (3 seconds of analysis), and the gauge on the right is for
  11506. the momentary loudness (400 milliseconds).
  11507. More information about the Loudness Recommendation EBU R128 on
  11508. @url{http://tech.ebu.ch/loudness}.
  11509. The filter accepts the following options:
  11510. @table @option
  11511. @item video
  11512. Activate the video output. The audio stream is passed unchanged whether this
  11513. option is set or no. The video stream will be the first output stream if
  11514. activated. Default is @code{0}.
  11515. @item size
  11516. Set the video size. This option is for video only. For the syntax of this
  11517. option, check the
  11518. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  11519. Default and minimum resolution is @code{640x480}.
  11520. @item meter
  11521. Set the EBU scale meter. Default is @code{9}. Common values are @code{9} and
  11522. @code{18}, respectively for EBU scale meter +9 and EBU scale meter +18. Any
  11523. other integer value between this range is allowed.
  11524. @item metadata
  11525. Set metadata injection. If set to @code{1}, the audio input will be segmented
  11526. into 100ms output frames, each of them containing various loudness information
  11527. in metadata. All the metadata keys are prefixed with @code{lavfi.r128.}.
  11528. Default is @code{0}.
  11529. @item framelog
  11530. Force the frame logging level.
  11531. Available values are:
  11532. @table @samp
  11533. @item info
  11534. information logging level
  11535. @item verbose
  11536. verbose logging level
  11537. @end table
  11538. By default, the logging level is set to @var{info}. If the @option{video} or
  11539. the @option{metadata} options are set, it switches to @var{verbose}.
  11540. @item peak
  11541. Set peak mode(s).
  11542. Available modes can be cumulated (the option is a @code{flag} type). Possible
  11543. values are:
  11544. @table @samp
  11545. @item none
  11546. Disable any peak mode (default).
  11547. @item sample
  11548. Enable sample-peak mode.
  11549. Simple peak mode looking for the higher sample value. It logs a message
  11550. for sample-peak (identified by @code{SPK}).
  11551. @item true
  11552. Enable true-peak mode.
  11553. If enabled, the peak lookup is done on an over-sampled version of the input
  11554. stream for better peak accuracy. It logs a message for true-peak.
  11555. (identified by @code{TPK}) and true-peak per frame (identified by @code{FTPK}).
  11556. This mode requires a build with @code{libswresample}.
  11557. @end table
  11558. @item dualmono
  11559. Treat mono input files as "dual mono". If a mono file is intended for playback
  11560. on a stereo system, its EBU R128 measurement will be perceptually incorrect.
  11561. If set to @code{true}, this option will compensate for this effect.
  11562. Multi-channel input files are not affected by this option.
  11563. @item panlaw
  11564. Set a specific pan law to be used for the measurement of dual mono files.
  11565. This parameter is optional, and has a default value of -3.01dB.
  11566. @end table
  11567. @subsection Examples
  11568. @itemize
  11569. @item
  11570. Real-time graph using @command{ffplay}, with a EBU scale meter +18:
  11571. @example
  11572. ffplay -f lavfi -i "amovie=input.mp3,ebur128=video=1:meter=18 [out0][out1]"
  11573. @end example
  11574. @item
  11575. Run an analysis with @command{ffmpeg}:
  11576. @example
  11577. ffmpeg -nostats -i input.mp3 -filter_complex ebur128 -f null -
  11578. @end example
  11579. @end itemize
  11580. @section interleave, ainterleave
  11581. Temporally interleave frames from several inputs.
  11582. @code{interleave} works with video inputs, @code{ainterleave} with audio.
  11583. These filters read frames from several inputs and send the oldest
  11584. queued frame to the output.
  11585. Input streams must have a well defined, monotonically increasing frame
  11586. timestamp values.
  11587. In order to submit one frame to output, these filters need to enqueue
  11588. at least one frame for each input, so they cannot work in case one
  11589. input is not yet terminated and will not receive incoming frames.
  11590. For example consider the case when one input is a @code{select} filter
  11591. which always drop input frames. The @code{interleave} filter will keep
  11592. reading from that input, but it will never be able to send new frames
  11593. to output until the input will send an end-of-stream signal.
  11594. Also, depending on inputs synchronization, the filters will drop
  11595. frames in case one input receives more frames than the other ones, and
  11596. the queue is already filled.
  11597. These filters accept the following options:
  11598. @table @option
  11599. @item nb_inputs, n
  11600. Set the number of different inputs, it is 2 by default.
  11601. @end table
  11602. @subsection Examples
  11603. @itemize
  11604. @item
  11605. Interleave frames belonging to different streams using @command{ffmpeg}:
  11606. @example
  11607. ffmpeg -i bambi.avi -i pr0n.mkv -filter_complex "[0:v][1:v] interleave" out.avi
  11608. @end example
  11609. @item
  11610. Add flickering blur effect:
  11611. @example
  11612. select='if(gt(random(0), 0.2), 1, 2)':n=2 [tmp], boxblur=2:2, [tmp] interleave
  11613. @end example
  11614. @end itemize
  11615. @section perms, aperms
  11616. Set read/write permissions for the output frames.
  11617. These filters are mainly aimed at developers to test direct path in the
  11618. following filter in the filtergraph.
  11619. The filters accept the following options:
  11620. @table @option
  11621. @item mode
  11622. Select the permissions mode.
  11623. It accepts the following values:
  11624. @table @samp
  11625. @item none
  11626. Do nothing. This is the default.
  11627. @item ro
  11628. Set all the output frames read-only.
  11629. @item rw
  11630. Set all the output frames directly writable.
  11631. @item toggle
  11632. Make the frame read-only if writable, and writable if read-only.
  11633. @item random
  11634. Set each output frame read-only or writable randomly.
  11635. @end table
  11636. @item seed
  11637. Set the seed for the @var{random} mode, must be an integer included between
  11638. @code{0} and @code{UINT32_MAX}. If not specified, or if explicitly set to
  11639. @code{-1}, the filter will try to use a good random seed on a best effort
  11640. basis.
  11641. @end table
  11642. Note: in case of auto-inserted filter between the permission filter and the
  11643. following one, the permission might not be received as expected in that
  11644. following filter. Inserting a @ref{format} or @ref{aformat} filter before the
  11645. perms/aperms filter can avoid this problem.
  11646. @section realtime, arealtime
  11647. Slow down filtering to match real time approximatively.
  11648. These filters will pause the filtering for a variable amount of time to
  11649. match the output rate with the input timestamps.
  11650. They are similar to the @option{re} option to @code{ffmpeg}.
  11651. They accept the following options:
  11652. @table @option
  11653. @item limit
  11654. Time limit for the pauses. Any pause longer than that will be considered
  11655. a timestamp discontinuity and reset the timer. Default is 2 seconds.
  11656. @end table
  11657. @section select, aselect
  11658. Select frames to pass in output.
  11659. This filter accepts the following options:
  11660. @table @option
  11661. @item expr, e
  11662. Set expression, which is evaluated for each input frame.
  11663. If the expression is evaluated to zero, the frame is discarded.
  11664. If the evaluation result is negative or NaN, the frame is sent to the
  11665. first output; otherwise it is sent to the output with index
  11666. @code{ceil(val)-1}, assuming that the input index starts from 0.
  11667. For example a value of @code{1.2} corresponds to the output with index
  11668. @code{ceil(1.2)-1 = 2-1 = 1}, that is the second output.
  11669. @item outputs, n
  11670. Set the number of outputs. The output to which to send the selected
  11671. frame is based on the result of the evaluation. Default value is 1.
  11672. @end table
  11673. The expression can contain the following constants:
  11674. @table @option
  11675. @item n
  11676. The (sequential) number of the filtered frame, starting from 0.
  11677. @item selected_n
  11678. The (sequential) number of the selected frame, starting from 0.
  11679. @item prev_selected_n
  11680. The sequential number of the last selected frame. It's NAN if undefined.
  11681. @item TB
  11682. The timebase of the input timestamps.
  11683. @item pts
  11684. The PTS (Presentation TimeStamp) of the filtered video frame,
  11685. expressed in @var{TB} units. It's NAN if undefined.
  11686. @item t
  11687. The PTS of the filtered video frame,
  11688. expressed in seconds. It's NAN if undefined.
  11689. @item prev_pts
  11690. The PTS of the previously filtered video frame. It's NAN if undefined.
  11691. @item prev_selected_pts
  11692. The PTS of the last previously filtered video frame. It's NAN if undefined.
  11693. @item prev_selected_t
  11694. The PTS of the last previously selected video frame. It's NAN if undefined.
  11695. @item start_pts
  11696. The PTS of the first video frame in the video. It's NAN if undefined.
  11697. @item start_t
  11698. The time of the first video frame in the video. It's NAN if undefined.
  11699. @item pict_type @emph{(video only)}
  11700. The type of the filtered frame. It can assume one of the following
  11701. values:
  11702. @table @option
  11703. @item I
  11704. @item P
  11705. @item B
  11706. @item S
  11707. @item SI
  11708. @item SP
  11709. @item BI
  11710. @end table
  11711. @item interlace_type @emph{(video only)}
  11712. The frame interlace type. It can assume one of the following values:
  11713. @table @option
  11714. @item PROGRESSIVE
  11715. The frame is progressive (not interlaced).
  11716. @item TOPFIRST
  11717. The frame is top-field-first.
  11718. @item BOTTOMFIRST
  11719. The frame is bottom-field-first.
  11720. @end table
  11721. @item consumed_sample_n @emph{(audio only)}
  11722. the number of selected samples before the current frame
  11723. @item samples_n @emph{(audio only)}
  11724. the number of samples in the current frame
  11725. @item sample_rate @emph{(audio only)}
  11726. the input sample rate
  11727. @item key
  11728. This is 1 if the filtered frame is a key-frame, 0 otherwise.
  11729. @item pos
  11730. the position in the file of the filtered frame, -1 if the information
  11731. is not available (e.g. for synthetic video)
  11732. @item scene @emph{(video only)}
  11733. value between 0 and 1 to indicate a new scene; a low value reflects a low
  11734. probability for the current frame to introduce a new scene, while a higher
  11735. value means the current frame is more likely to be one (see the example below)
  11736. @item concatdec_select
  11737. The concat demuxer can select only part of a concat input file by setting an
  11738. inpoint and an outpoint, but the output packets may not be entirely contained
  11739. in the selected interval. By using this variable, it is possible to skip frames
  11740. generated by the concat demuxer which are not exactly contained in the selected
  11741. interval.
  11742. This works by comparing the frame pts against the @var{lavf.concat.start_time}
  11743. and the @var{lavf.concat.duration} packet metadata values which are also
  11744. present in the decoded frames.
  11745. The @var{concatdec_select} variable is -1 if the frame pts is at least
  11746. start_time and either the duration metadata is missing or the frame pts is less
  11747. than start_time + duration, 0 otherwise, and NaN if the start_time metadata is
  11748. missing.
  11749. That basically means that an input frame is selected if its pts is within the
  11750. interval set by the concat demuxer.
  11751. @end table
  11752. The default value of the select expression is "1".
  11753. @subsection Examples
  11754. @itemize
  11755. @item
  11756. Select all frames in input:
  11757. @example
  11758. select
  11759. @end example
  11760. The example above is the same as:
  11761. @example
  11762. select=1
  11763. @end example
  11764. @item
  11765. Skip all frames:
  11766. @example
  11767. select=0
  11768. @end example
  11769. @item
  11770. Select only I-frames:
  11771. @example
  11772. select='eq(pict_type\,I)'
  11773. @end example
  11774. @item
  11775. Select one frame every 100:
  11776. @example
  11777. select='not(mod(n\,100))'
  11778. @end example
  11779. @item
  11780. Select only frames contained in the 10-20 time interval:
  11781. @example
  11782. select=between(t\,10\,20)
  11783. @end example
  11784. @item
  11785. Select only I frames contained in the 10-20 time interval:
  11786. @example
  11787. select=between(t\,10\,20)*eq(pict_type\,I)
  11788. @end example
  11789. @item
  11790. Select frames with a minimum distance of 10 seconds:
  11791. @example
  11792. select='isnan(prev_selected_t)+gte(t-prev_selected_t\,10)'
  11793. @end example
  11794. @item
  11795. Use aselect to select only audio frames with samples number > 100:
  11796. @example
  11797. aselect='gt(samples_n\,100)'
  11798. @end example
  11799. @item
  11800. Create a mosaic of the first scenes:
  11801. @example
  11802. ffmpeg -i video.avi -vf select='gt(scene\,0.4)',scale=160:120,tile -frames:v 1 preview.png
  11803. @end example
  11804. Comparing @var{scene} against a value between 0.3 and 0.5 is generally a sane
  11805. choice.
  11806. @item
  11807. Send even and odd frames to separate outputs, and compose them:
  11808. @example
  11809. select=n=2:e='mod(n, 2)+1' [odd][even]; [odd] pad=h=2*ih [tmp]; [tmp][even] overlay=y=h
  11810. @end example
  11811. @item
  11812. Select useful frames from an ffconcat file which is using inpoints and
  11813. outpoints but where the source files are not intra frame only.
  11814. @example
  11815. ffmpeg -copyts -vsync 0 -segment_time_metadata 1 -i input.ffconcat -vf select=concatdec_select -af aselect=concatdec_select output.avi
  11816. @end example
  11817. @end itemize
  11818. @section sendcmd, asendcmd
  11819. Send commands to filters in the filtergraph.
  11820. These filters read commands to be sent to other filters in the
  11821. filtergraph.
  11822. @code{sendcmd} must be inserted between two video filters,
  11823. @code{asendcmd} must be inserted between two audio filters, but apart
  11824. from that they act the same way.
  11825. The specification of commands can be provided in the filter arguments
  11826. with the @var{commands} option, or in a file specified by the
  11827. @var{filename} option.
  11828. These filters accept the following options:
  11829. @table @option
  11830. @item commands, c
  11831. Set the commands to be read and sent to the other filters.
  11832. @item filename, f
  11833. Set the filename of the commands to be read and sent to the other
  11834. filters.
  11835. @end table
  11836. @subsection Commands syntax
  11837. A commands description consists of a sequence of interval
  11838. specifications, comprising a list of commands to be executed when a
  11839. particular event related to that interval occurs. The occurring event
  11840. is typically the current frame time entering or leaving a given time
  11841. interval.
  11842. An interval is specified by the following syntax:
  11843. @example
  11844. @var{START}[-@var{END}] @var{COMMANDS};
  11845. @end example
  11846. The time interval is specified by the @var{START} and @var{END} times.
  11847. @var{END} is optional and defaults to the maximum time.
  11848. The current frame time is considered within the specified interval if
  11849. it is included in the interval [@var{START}, @var{END}), that is when
  11850. the time is greater or equal to @var{START} and is lesser than
  11851. @var{END}.
  11852. @var{COMMANDS} consists of a sequence of one or more command
  11853. specifications, separated by ",", relating to that interval. The
  11854. syntax of a command specification is given by:
  11855. @example
  11856. [@var{FLAGS}] @var{TARGET} @var{COMMAND} @var{ARG}
  11857. @end example
  11858. @var{FLAGS} is optional and specifies the type of events relating to
  11859. the time interval which enable sending the specified command, and must
  11860. be a non-null sequence of identifier flags separated by "+" or "|" and
  11861. enclosed between "[" and "]".
  11862. The following flags are recognized:
  11863. @table @option
  11864. @item enter
  11865. The command is sent when the current frame timestamp enters the
  11866. specified interval. In other words, the command is sent when the
  11867. previous frame timestamp was not in the given interval, and the
  11868. current is.
  11869. @item leave
  11870. The command is sent when the current frame timestamp leaves the
  11871. specified interval. In other words, the command is sent when the
  11872. previous frame timestamp was in the given interval, and the
  11873. current is not.
  11874. @end table
  11875. If @var{FLAGS} is not specified, a default value of @code{[enter]} is
  11876. assumed.
  11877. @var{TARGET} specifies the target of the command, usually the name of
  11878. the filter class or a specific filter instance name.
  11879. @var{COMMAND} specifies the name of the command for the target filter.
  11880. @var{ARG} is optional and specifies the optional list of argument for
  11881. the given @var{COMMAND}.
  11882. Between one interval specification and another, whitespaces, or
  11883. sequences of characters starting with @code{#} until the end of line,
  11884. are ignored and can be used to annotate comments.
  11885. A simplified BNF description of the commands specification syntax
  11886. follows:
  11887. @example
  11888. @var{COMMAND_FLAG} ::= "enter" | "leave"
  11889. @var{COMMAND_FLAGS} ::= @var{COMMAND_FLAG} [(+|"|")@var{COMMAND_FLAG}]
  11890. @var{COMMAND} ::= ["[" @var{COMMAND_FLAGS} "]"] @var{TARGET} @var{COMMAND} [@var{ARG}]
  11891. @var{COMMANDS} ::= @var{COMMAND} [,@var{COMMANDS}]
  11892. @var{INTERVAL} ::= @var{START}[-@var{END}] @var{COMMANDS}
  11893. @var{INTERVALS} ::= @var{INTERVAL}[;@var{INTERVALS}]
  11894. @end example
  11895. @subsection Examples
  11896. @itemize
  11897. @item
  11898. Specify audio tempo change at second 4:
  11899. @example
  11900. asendcmd=c='4.0 atempo tempo 1.5',atempo
  11901. @end example
  11902. @item
  11903. Specify a list of drawtext and hue commands in a file.
  11904. @example
  11905. # show text in the interval 5-10
  11906. 5.0-10.0 [enter] drawtext reinit 'fontfile=FreeSerif.ttf:text=hello world',
  11907. [leave] drawtext reinit 'fontfile=FreeSerif.ttf:text=';
  11908. # desaturate the image in the interval 15-20
  11909. 15.0-20.0 [enter] hue s 0,
  11910. [enter] drawtext reinit 'fontfile=FreeSerif.ttf:text=nocolor',
  11911. [leave] hue s 1,
  11912. [leave] drawtext reinit 'fontfile=FreeSerif.ttf:text=color';
  11913. # apply an exponential saturation fade-out effect, starting from time 25
  11914. 25 [enter] hue s exp(25-t)
  11915. @end example
  11916. A filtergraph allowing to read and process the above command list
  11917. stored in a file @file{test.cmd}, can be specified with:
  11918. @example
  11919. sendcmd=f=test.cmd,drawtext=fontfile=FreeSerif.ttf:text='',hue
  11920. @end example
  11921. @end itemize
  11922. @anchor{setpts}
  11923. @section setpts, asetpts
  11924. Change the PTS (presentation timestamp) of the input frames.
  11925. @code{setpts} works on video frames, @code{asetpts} on audio frames.
  11926. This filter accepts the following options:
  11927. @table @option
  11928. @item expr
  11929. The expression which is evaluated for each frame to construct its timestamp.
  11930. @end table
  11931. The expression is evaluated through the eval API and can contain the following
  11932. constants:
  11933. @table @option
  11934. @item FRAME_RATE
  11935. frame rate, only defined for constant frame-rate video
  11936. @item PTS
  11937. The presentation timestamp in input
  11938. @item N
  11939. The count of the input frame for video or the number of consumed samples,
  11940. not including the current frame for audio, starting from 0.
  11941. @item NB_CONSUMED_SAMPLES
  11942. The number of consumed samples, not including the current frame (only
  11943. audio)
  11944. @item NB_SAMPLES, S
  11945. The number of samples in the current frame (only audio)
  11946. @item SAMPLE_RATE, SR
  11947. The audio sample rate.
  11948. @item STARTPTS
  11949. The PTS of the first frame.
  11950. @item STARTT
  11951. the time in seconds of the first frame
  11952. @item INTERLACED
  11953. State whether the current frame is interlaced.
  11954. @item T
  11955. the time in seconds of the current frame
  11956. @item POS
  11957. original position in the file of the frame, or undefined if undefined
  11958. for the current frame
  11959. @item PREV_INPTS
  11960. The previous input PTS.
  11961. @item PREV_INT
  11962. previous input time in seconds
  11963. @item PREV_OUTPTS
  11964. The previous output PTS.
  11965. @item PREV_OUTT
  11966. previous output time in seconds
  11967. @item RTCTIME
  11968. The wallclock (RTC) time in microseconds. This is deprecated, use time(0)
  11969. instead.
  11970. @item RTCSTART
  11971. The wallclock (RTC) time at the start of the movie in microseconds.
  11972. @item TB
  11973. The timebase of the input timestamps.
  11974. @end table
  11975. @subsection Examples
  11976. @itemize
  11977. @item
  11978. Start counting PTS from zero
  11979. @example
  11980. setpts=PTS-STARTPTS
  11981. @end example
  11982. @item
  11983. Apply fast motion effect:
  11984. @example
  11985. setpts=0.5*PTS
  11986. @end example
  11987. @item
  11988. Apply slow motion effect:
  11989. @example
  11990. setpts=2.0*PTS
  11991. @end example
  11992. @item
  11993. Set fixed rate of 25 frames per second:
  11994. @example
  11995. setpts=N/(25*TB)
  11996. @end example
  11997. @item
  11998. Set fixed rate 25 fps with some jitter:
  11999. @example
  12000. setpts='1/(25*TB) * (N + 0.05 * sin(N*2*PI/25))'
  12001. @end example
  12002. @item
  12003. Apply an offset of 10 seconds to the input PTS:
  12004. @example
  12005. setpts=PTS+10/TB
  12006. @end example
  12007. @item
  12008. Generate timestamps from a "live source" and rebase onto the current timebase:
  12009. @example
  12010. setpts='(RTCTIME - RTCSTART) / (TB * 1000000)'
  12011. @end example
  12012. @item
  12013. Generate timestamps by counting samples:
  12014. @example
  12015. asetpts=N/SR/TB
  12016. @end example
  12017. @end itemize
  12018. @section settb, asettb
  12019. Set the timebase to use for the output frames timestamps.
  12020. It is mainly useful for testing timebase configuration.
  12021. It accepts the following parameters:
  12022. @table @option
  12023. @item expr, tb
  12024. The expression which is evaluated into the output timebase.
  12025. @end table
  12026. The value for @option{tb} is an arithmetic expression representing a
  12027. rational. The expression can contain the constants "AVTB" (the default
  12028. timebase), "intb" (the input timebase) and "sr" (the sample rate,
  12029. audio only). Default value is "intb".
  12030. @subsection Examples
  12031. @itemize
  12032. @item
  12033. Set the timebase to 1/25:
  12034. @example
  12035. settb=expr=1/25
  12036. @end example
  12037. @item
  12038. Set the timebase to 1/10:
  12039. @example
  12040. settb=expr=0.1
  12041. @end example
  12042. @item
  12043. Set the timebase to 1001/1000:
  12044. @example
  12045. settb=1+0.001
  12046. @end example
  12047. @item
  12048. Set the timebase to 2*intb:
  12049. @example
  12050. settb=2*intb
  12051. @end example
  12052. @item
  12053. Set the default timebase value:
  12054. @example
  12055. settb=AVTB
  12056. @end example
  12057. @end itemize
  12058. @section showcqt
  12059. Convert input audio to a video output representing frequency spectrum
  12060. logarithmically using Brown-Puckette constant Q transform algorithm with
  12061. direct frequency domain coefficient calculation (but the transform itself
  12062. is not really constant Q, instead the Q factor is actually variable/clamped),
  12063. with musical tone scale, from E0 to D#10.
  12064. The filter accepts the following options:
  12065. @table @option
  12066. @item size, s
  12067. Specify the video size for the output. It must be even. For the syntax of this option,
  12068. check the @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  12069. Default value is @code{1920x1080}.
  12070. @item fps, rate, r
  12071. Set the output frame rate. Default value is @code{25}.
  12072. @item bar_h
  12073. Set the bargraph height. It must be even. Default value is @code{-1} which
  12074. computes the bargraph height automatically.
  12075. @item axis_h
  12076. Set the axis height. It must be even. Default value is @code{-1} which computes
  12077. the axis height automatically.
  12078. @item sono_h
  12079. Set the sonogram height. It must be even. Default value is @code{-1} which
  12080. computes the sonogram height automatically.
  12081. @item fullhd
  12082. Set the fullhd resolution. This option is deprecated, use @var{size}, @var{s}
  12083. instead. Default value is @code{1}.
  12084. @item sono_v, volume
  12085. Specify the sonogram volume expression. It can contain variables:
  12086. @table @option
  12087. @item bar_v
  12088. the @var{bar_v} evaluated expression
  12089. @item frequency, freq, f
  12090. the frequency where it is evaluated
  12091. @item timeclamp, tc
  12092. the value of @var{timeclamp} option
  12093. @end table
  12094. and functions:
  12095. @table @option
  12096. @item a_weighting(f)
  12097. A-weighting of equal loudness
  12098. @item b_weighting(f)
  12099. B-weighting of equal loudness
  12100. @item c_weighting(f)
  12101. C-weighting of equal loudness.
  12102. @end table
  12103. Default value is @code{16}.
  12104. @item bar_v, volume2
  12105. Specify the bargraph volume expression. It can contain variables:
  12106. @table @option
  12107. @item sono_v
  12108. the @var{sono_v} evaluated expression
  12109. @item frequency, freq, f
  12110. the frequency where it is evaluated
  12111. @item timeclamp, tc
  12112. the value of @var{timeclamp} option
  12113. @end table
  12114. and functions:
  12115. @table @option
  12116. @item a_weighting(f)
  12117. A-weighting of equal loudness
  12118. @item b_weighting(f)
  12119. B-weighting of equal loudness
  12120. @item c_weighting(f)
  12121. C-weighting of equal loudness.
  12122. @end table
  12123. Default value is @code{sono_v}.
  12124. @item sono_g, gamma
  12125. Specify the sonogram gamma. Lower gamma makes the spectrum more contrast,
  12126. higher gamma makes the spectrum having more range. Default value is @code{3}.
  12127. Acceptable range is @code{[1, 7]}.
  12128. @item bar_g, gamma2
  12129. Specify the bargraph gamma. Default value is @code{1}. Acceptable range is
  12130. @code{[1, 7]}.
  12131. @item timeclamp, tc
  12132. Specify the transform timeclamp. At low frequency, there is trade-off between
  12133. accuracy in time domain and frequency domain. If timeclamp is lower,
  12134. event in time domain is represented more accurately (such as fast bass drum),
  12135. otherwise event in frequency domain is represented more accurately
  12136. (such as bass guitar). Acceptable range is @code{[0.1, 1]}. Default value is @code{0.17}.
  12137. @item basefreq
  12138. Specify the transform base frequency. Default value is @code{20.01523126408007475},
  12139. which is frequency 50 cents below E0. Acceptable range is @code{[10, 100000]}.
  12140. @item endfreq
  12141. Specify the transform end frequency. Default value is @code{20495.59681441799654},
  12142. which is frequency 50 cents above D#10. Acceptable range is @code{[10, 100000]}.
  12143. @item coeffclamp
  12144. This option is deprecated and ignored.
  12145. @item tlength
  12146. Specify the transform length in time domain. Use this option to control accuracy
  12147. trade-off between time domain and frequency domain at every frequency sample.
  12148. It can contain variables:
  12149. @table @option
  12150. @item frequency, freq, f
  12151. the frequency where it is evaluated
  12152. @item timeclamp, tc
  12153. the value of @var{timeclamp} option.
  12154. @end table
  12155. Default value is @code{384*tc/(384+tc*f)}.
  12156. @item count
  12157. Specify the transform count for every video frame. Default value is @code{6}.
  12158. Acceptable range is @code{[1, 30]}.
  12159. @item fcount
  12160. Specify the transform count for every single pixel. Default value is @code{0},
  12161. which makes it computed automatically. Acceptable range is @code{[0, 10]}.
  12162. @item fontfile
  12163. Specify font file for use with freetype to draw the axis. If not specified,
  12164. use embedded font. Note that drawing with font file or embedded font is not
  12165. implemented with custom @var{basefreq} and @var{endfreq}, use @var{axisfile}
  12166. option instead.
  12167. @item fontcolor
  12168. Specify font color expression. This is arithmetic expression that should return
  12169. integer value 0xRRGGBB. It can contain variables:
  12170. @table @option
  12171. @item frequency, freq, f
  12172. the frequency where it is evaluated
  12173. @item timeclamp, tc
  12174. the value of @var{timeclamp} option
  12175. @end table
  12176. and functions:
  12177. @table @option
  12178. @item midi(f)
  12179. midi number of frequency f, some midi numbers: E0(16), C1(24), C2(36), A4(69)
  12180. @item r(x), g(x), b(x)
  12181. red, green, and blue value of intensity x.
  12182. @end table
  12183. Default value is @code{st(0, (midi(f)-59.5)/12);
  12184. st(1, if(between(ld(0),0,1), 0.5-0.5*cos(2*PI*ld(0)), 0));
  12185. r(1-ld(1)) + b(ld(1))}.
  12186. @item axisfile
  12187. Specify image file to draw the axis. This option override @var{fontfile} and
  12188. @var{fontcolor} option.
  12189. @item axis, text
  12190. Enable/disable drawing text to the axis. If it is set to @code{0}, drawing to
  12191. the axis is disabled, ignoring @var{fontfile} and @var{axisfile} option.
  12192. Default value is @code{1}.
  12193. @end table
  12194. @subsection Examples
  12195. @itemize
  12196. @item
  12197. Playing audio while showing the spectrum:
  12198. @example
  12199. ffplay -f lavfi 'amovie=a.mp3, asplit [a][out1]; [a] showcqt [out0]'
  12200. @end example
  12201. @item
  12202. Same as above, but with frame rate 30 fps:
  12203. @example
  12204. ffplay -f lavfi 'amovie=a.mp3, asplit [a][out1]; [a] showcqt=fps=30:count=5 [out0]'
  12205. @end example
  12206. @item
  12207. Playing at 1280x720:
  12208. @example
  12209. ffplay -f lavfi 'amovie=a.mp3, asplit [a][out1]; [a] showcqt=s=1280x720:count=4 [out0]'
  12210. @end example
  12211. @item
  12212. Disable sonogram display:
  12213. @example
  12214. sono_h=0
  12215. @end example
  12216. @item
  12217. A1 and its harmonics: A1, A2, (near)E3, A3:
  12218. @example
  12219. ffplay -f lavfi 'aevalsrc=0.1*sin(2*PI*55*t)+0.1*sin(4*PI*55*t)+0.1*sin(6*PI*55*t)+0.1*sin(8*PI*55*t),
  12220. asplit[a][out1]; [a] showcqt [out0]'
  12221. @end example
  12222. @item
  12223. Same as above, but with more accuracy in frequency domain:
  12224. @example
  12225. ffplay -f lavfi 'aevalsrc=0.1*sin(2*PI*55*t)+0.1*sin(4*PI*55*t)+0.1*sin(6*PI*55*t)+0.1*sin(8*PI*55*t),
  12226. asplit[a][out1]; [a] showcqt=timeclamp=0.5 [out0]'
  12227. @end example
  12228. @item
  12229. Custom volume:
  12230. @example
  12231. bar_v=10:sono_v=bar_v*a_weighting(f)
  12232. @end example
  12233. @item
  12234. Custom gamma, now spectrum is linear to the amplitude.
  12235. @example
  12236. bar_g=2:sono_g=2
  12237. @end example
  12238. @item
  12239. Custom tlength equation:
  12240. @example
  12241. tc=0.33:tlength='st(0,0.17); 384*tc / (384 / ld(0) + tc*f /(1-ld(0))) + 384*tc / (tc*f / ld(0) + 384 /(1-ld(0)))'
  12242. @end example
  12243. @item
  12244. Custom fontcolor and fontfile, C-note is colored green, others are colored blue:
  12245. @example
  12246. fontcolor='if(mod(floor(midi(f)+0.5),12), 0x0000FF, g(1))':fontfile=myfont.ttf
  12247. @end example
  12248. @item
  12249. Custom frequency range with custom axis using image file:
  12250. @example
  12251. axisfile=myaxis.png:basefreq=40:endfreq=10000
  12252. @end example
  12253. @end itemize
  12254. @section showfreqs
  12255. Convert input audio to video output representing the audio power spectrum.
  12256. Audio amplitude is on Y-axis while frequency is on X-axis.
  12257. The filter accepts the following options:
  12258. @table @option
  12259. @item size, s
  12260. Specify size of video. For the syntax of this option, check the
  12261. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  12262. Default is @code{1024x512}.
  12263. @item mode
  12264. Set display mode.
  12265. This set how each frequency bin will be represented.
  12266. It accepts the following values:
  12267. @table @samp
  12268. @item line
  12269. @item bar
  12270. @item dot
  12271. @end table
  12272. Default is @code{bar}.
  12273. @item ascale
  12274. Set amplitude scale.
  12275. It accepts the following values:
  12276. @table @samp
  12277. @item lin
  12278. Linear scale.
  12279. @item sqrt
  12280. Square root scale.
  12281. @item cbrt
  12282. Cubic root scale.
  12283. @item log
  12284. Logarithmic scale.
  12285. @end table
  12286. Default is @code{log}.
  12287. @item fscale
  12288. Set frequency scale.
  12289. It accepts the following values:
  12290. @table @samp
  12291. @item lin
  12292. Linear scale.
  12293. @item log
  12294. Logarithmic scale.
  12295. @item rlog
  12296. Reverse logarithmic scale.
  12297. @end table
  12298. Default is @code{lin}.
  12299. @item win_size
  12300. Set window size.
  12301. It accepts the following values:
  12302. @table @samp
  12303. @item w16
  12304. @item w32
  12305. @item w64
  12306. @item w128
  12307. @item w256
  12308. @item w512
  12309. @item w1024
  12310. @item w2048
  12311. @item w4096
  12312. @item w8192
  12313. @item w16384
  12314. @item w32768
  12315. @item w65536
  12316. @end table
  12317. Default is @code{w2048}
  12318. @item win_func
  12319. Set windowing function.
  12320. It accepts the following values:
  12321. @table @samp
  12322. @item rect
  12323. @item bartlett
  12324. @item hanning
  12325. @item hamming
  12326. @item blackman
  12327. @item welch
  12328. @item flattop
  12329. @item bharris
  12330. @item bnuttall
  12331. @item bhann
  12332. @item sine
  12333. @item nuttall
  12334. @item lanczos
  12335. @item gauss
  12336. @item tukey
  12337. @end table
  12338. Default is @code{hanning}.
  12339. @item overlap
  12340. Set window overlap. In range @code{[0, 1]}. Default is @code{1},
  12341. which means optimal overlap for selected window function will be picked.
  12342. @item averaging
  12343. Set time averaging. Setting this to 0 will display current maximal peaks.
  12344. Default is @code{1}, which means time averaging is disabled.
  12345. @item colors
  12346. Specify list of colors separated by space or by '|' which will be used to
  12347. draw channel frequencies. Unrecognized or missing colors will be replaced
  12348. by white color.
  12349. @item cmode
  12350. Set channel display mode.
  12351. It accepts the following values:
  12352. @table @samp
  12353. @item combined
  12354. @item separate
  12355. @end table
  12356. Default is @code{combined}.
  12357. @end table
  12358. @anchor{showspectrum}
  12359. @section showspectrum
  12360. Convert input audio to a video output, representing the audio frequency
  12361. spectrum.
  12362. The filter accepts the following options:
  12363. @table @option
  12364. @item size, s
  12365. Specify the video size for the output. For the syntax of this option, check the
  12366. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  12367. Default value is @code{640x512}.
  12368. @item slide
  12369. Specify how the spectrum should slide along the window.
  12370. It accepts the following values:
  12371. @table @samp
  12372. @item replace
  12373. the samples start again on the left when they reach the right
  12374. @item scroll
  12375. the samples scroll from right to left
  12376. @item rscroll
  12377. the samples scroll from left to right
  12378. @item fullframe
  12379. frames are only produced when the samples reach the right
  12380. @end table
  12381. Default value is @code{replace}.
  12382. @item mode
  12383. Specify display mode.
  12384. It accepts the following values:
  12385. @table @samp
  12386. @item combined
  12387. all channels are displayed in the same row
  12388. @item separate
  12389. all channels are displayed in separate rows
  12390. @end table
  12391. Default value is @samp{combined}.
  12392. @item color
  12393. Specify display color mode.
  12394. It accepts the following values:
  12395. @table @samp
  12396. @item channel
  12397. each channel is displayed in a separate color
  12398. @item intensity
  12399. each channel is displayed using the same color scheme
  12400. @item rainbow
  12401. each channel is displayed using the rainbow color scheme
  12402. @item moreland
  12403. each channel is displayed using the moreland color scheme
  12404. @item nebulae
  12405. each channel is displayed using the nebulae color scheme
  12406. @item fire
  12407. each channel is displayed using the fire color scheme
  12408. @item fiery
  12409. each channel is displayed using the fiery color scheme
  12410. @item fruit
  12411. each channel is displayed using the fruit color scheme
  12412. @item cool
  12413. each channel is displayed using the cool color scheme
  12414. @end table
  12415. Default value is @samp{channel}.
  12416. @item scale
  12417. Specify scale used for calculating intensity color values.
  12418. It accepts the following values:
  12419. @table @samp
  12420. @item lin
  12421. linear
  12422. @item sqrt
  12423. square root, default
  12424. @item cbrt
  12425. cubic root
  12426. @item 4thrt
  12427. 4th root
  12428. @item 5thrt
  12429. 5th root
  12430. @item log
  12431. logarithmic
  12432. @end table
  12433. Default value is @samp{sqrt}.
  12434. @item saturation
  12435. Set saturation modifier for displayed colors. Negative values provide
  12436. alternative color scheme. @code{0} is no saturation at all.
  12437. Saturation must be in [-10.0, 10.0] range.
  12438. Default value is @code{1}.
  12439. @item win_func
  12440. Set window function.
  12441. It accepts the following values:
  12442. @table @samp
  12443. @item rect
  12444. @item bartlett
  12445. @item hann
  12446. @item hanning
  12447. @item hamming
  12448. @item blackman
  12449. @item welch
  12450. @item flattop
  12451. @item bharris
  12452. @item bnuttall
  12453. @item bhann
  12454. @item sine
  12455. @item nuttall
  12456. @item lanczos
  12457. @item gauss
  12458. @item tukey
  12459. @end table
  12460. Default value is @code{hann}.
  12461. @item orientation
  12462. Set orientation of time vs frequency axis. Can be @code{vertical} or
  12463. @code{horizontal}. Default is @code{vertical}.
  12464. @item overlap
  12465. Set ratio of overlap window. Default value is @code{0}.
  12466. When value is @code{1} overlap is set to recommended size for specific
  12467. window function currently used.
  12468. @item gain
  12469. Set scale gain for calculating intensity color values.
  12470. Default value is @code{1}.
  12471. @item data
  12472. Set which data to display. Can be @code{magnitude}, default or @code{phase}.
  12473. @end table
  12474. The usage is very similar to the showwaves filter; see the examples in that
  12475. section.
  12476. @subsection Examples
  12477. @itemize
  12478. @item
  12479. Large window with logarithmic color scaling:
  12480. @example
  12481. showspectrum=s=1280x480:scale=log
  12482. @end example
  12483. @item
  12484. Complete example for a colored and sliding spectrum per channel using @command{ffplay}:
  12485. @example
  12486. ffplay -f lavfi 'amovie=input.mp3, asplit [a][out1];
  12487. [a] showspectrum=mode=separate:color=intensity:slide=1:scale=cbrt [out0]'
  12488. @end example
  12489. @end itemize
  12490. @section showspectrumpic
  12491. Convert input audio to a single video frame, representing the audio frequency
  12492. spectrum.
  12493. The filter accepts the following options:
  12494. @table @option
  12495. @item size, s
  12496. Specify the video size for the output. For the syntax of this option, check the
  12497. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  12498. Default value is @code{4096x2048}.
  12499. @item mode
  12500. Specify display mode.
  12501. It accepts the following values:
  12502. @table @samp
  12503. @item combined
  12504. all channels are displayed in the same row
  12505. @item separate
  12506. all channels are displayed in separate rows
  12507. @end table
  12508. Default value is @samp{combined}.
  12509. @item color
  12510. Specify display color mode.
  12511. It accepts the following values:
  12512. @table @samp
  12513. @item channel
  12514. each channel is displayed in a separate color
  12515. @item intensity
  12516. each channel is displayed using the same color scheme
  12517. @item rainbow
  12518. each channel is displayed using the rainbow color scheme
  12519. @item moreland
  12520. each channel is displayed using the moreland color scheme
  12521. @item nebulae
  12522. each channel is displayed using the nebulae color scheme
  12523. @item fire
  12524. each channel is displayed using the fire color scheme
  12525. @item fiery
  12526. each channel is displayed using the fiery color scheme
  12527. @item fruit
  12528. each channel is displayed using the fruit color scheme
  12529. @item cool
  12530. each channel is displayed using the cool color scheme
  12531. @end table
  12532. Default value is @samp{intensity}.
  12533. @item scale
  12534. Specify scale used for calculating intensity color values.
  12535. It accepts the following values:
  12536. @table @samp
  12537. @item lin
  12538. linear
  12539. @item sqrt
  12540. square root, default
  12541. @item cbrt
  12542. cubic root
  12543. @item 4thrt
  12544. 4th root
  12545. @item 5thrt
  12546. 5th root
  12547. @item log
  12548. logarithmic
  12549. @end table
  12550. Default value is @samp{log}.
  12551. @item saturation
  12552. Set saturation modifier for displayed colors. Negative values provide
  12553. alternative color scheme. @code{0} is no saturation at all.
  12554. Saturation must be in [-10.0, 10.0] range.
  12555. Default value is @code{1}.
  12556. @item win_func
  12557. Set window function.
  12558. It accepts the following values:
  12559. @table @samp
  12560. @item rect
  12561. @item bartlett
  12562. @item hann
  12563. @item hanning
  12564. @item hamming
  12565. @item blackman
  12566. @item welch
  12567. @item flattop
  12568. @item bharris
  12569. @item bnuttall
  12570. @item bhann
  12571. @item sine
  12572. @item nuttall
  12573. @item lanczos
  12574. @item gauss
  12575. @item tukey
  12576. @end table
  12577. Default value is @code{hann}.
  12578. @item orientation
  12579. Set orientation of time vs frequency axis. Can be @code{vertical} or
  12580. @code{horizontal}. Default is @code{vertical}.
  12581. @item gain
  12582. Set scale gain for calculating intensity color values.
  12583. Default value is @code{1}.
  12584. @item legend
  12585. Draw time and frequency axes and legends. Default is enabled.
  12586. @end table
  12587. @subsection Examples
  12588. @itemize
  12589. @item
  12590. Extract an audio spectrogram of a whole audio track
  12591. in a 1024x1024 picture using @command{ffmpeg}:
  12592. @example
  12593. ffmpeg -i audio.flac -lavfi showspectrumpic=s=1024x1024 spectrogram.png
  12594. @end example
  12595. @end itemize
  12596. @section showvolume
  12597. Convert input audio volume to a video output.
  12598. The filter accepts the following options:
  12599. @table @option
  12600. @item rate, r
  12601. Set video rate.
  12602. @item b
  12603. Set border width, allowed range is [0, 5]. Default is 1.
  12604. @item w
  12605. Set channel width, allowed range is [80, 8192]. Default is 400.
  12606. @item h
  12607. Set channel height, allowed range is [1, 900]. Default is 20.
  12608. @item f
  12609. Set fade, allowed range is [0.001, 1]. Default is 0.95.
  12610. @item c
  12611. Set volume color expression.
  12612. The expression can use the following variables:
  12613. @table @option
  12614. @item VOLUME
  12615. Current max volume of channel in dB.
  12616. @item CHANNEL
  12617. Current channel number, starting from 0.
  12618. @end table
  12619. @item t
  12620. If set, displays channel names. Default is enabled.
  12621. @item v
  12622. If set, displays volume values. Default is enabled.
  12623. @item o
  12624. Set orientation, can be @code{horizontal} or @code{vertical},
  12625. default is @code{horizontal}.
  12626. @item s
  12627. Set step size, allowed range s [0, 5]. Default is 0, which means
  12628. step is disabled.
  12629. @end table
  12630. @section showwaves
  12631. Convert input audio to a video output, representing the samples waves.
  12632. The filter accepts the following options:
  12633. @table @option
  12634. @item size, s
  12635. Specify the video size for the output. For the syntax of this option, check the
  12636. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  12637. Default value is @code{600x240}.
  12638. @item mode
  12639. Set display mode.
  12640. Available values are:
  12641. @table @samp
  12642. @item point
  12643. Draw a point for each sample.
  12644. @item line
  12645. Draw a vertical line for each sample.
  12646. @item p2p
  12647. Draw a point for each sample and a line between them.
  12648. @item cline
  12649. Draw a centered vertical line for each sample.
  12650. @end table
  12651. Default value is @code{point}.
  12652. @item n
  12653. Set the number of samples which are printed on the same column. A
  12654. larger value will decrease the frame rate. Must be a positive
  12655. integer. This option can be set only if the value for @var{rate}
  12656. is not explicitly specified.
  12657. @item rate, r
  12658. Set the (approximate) output frame rate. This is done by setting the
  12659. option @var{n}. Default value is "25".
  12660. @item split_channels
  12661. Set if channels should be drawn separately or overlap. Default value is 0.
  12662. @item colors
  12663. Set colors separated by '|' which are going to be used for drawing of each channel.
  12664. @item scale
  12665. Set amplitude scale. Can be linear @code{lin} or logarithmic @code{log}.
  12666. Default is linear.
  12667. @end table
  12668. @subsection Examples
  12669. @itemize
  12670. @item
  12671. Output the input file audio and the corresponding video representation
  12672. at the same time:
  12673. @example
  12674. amovie=a.mp3,asplit[out0],showwaves[out1]
  12675. @end example
  12676. @item
  12677. Create a synthetic signal and show it with showwaves, forcing a
  12678. frame rate of 30 frames per second:
  12679. @example
  12680. aevalsrc=sin(1*2*PI*t)*sin(880*2*PI*t):cos(2*PI*200*t),asplit[out0],showwaves=r=30[out1]
  12681. @end example
  12682. @end itemize
  12683. @section showwavespic
  12684. Convert input audio to a single video frame, representing the samples waves.
  12685. The filter accepts the following options:
  12686. @table @option
  12687. @item size, s
  12688. Specify the video size for the output. For the syntax of this option, check the
  12689. @ref{video size syntax,,"Video size" section in the ffmpeg-utils manual,ffmpeg-utils}.
  12690. Default value is @code{600x240}.
  12691. @item split_channels
  12692. Set if channels should be drawn separately or overlap. Default value is 0.
  12693. @item colors
  12694. Set colors separated by '|' which are going to be used for drawing of each channel.
  12695. @item scale
  12696. Set amplitude scale. Can be linear @code{lin} or logarithmic @code{log}.
  12697. Default is linear.
  12698. @end table
  12699. @subsection Examples
  12700. @itemize
  12701. @item
  12702. Extract a channel split representation of the wave form of a whole audio track
  12703. in a 1024x800 picture using @command{ffmpeg}:
  12704. @example
  12705. ffmpeg -i audio.flac -lavfi showwavespic=split_channels=1:s=1024x800 waveform.png
  12706. @end example
  12707. @item
  12708. Colorize the waveform with colorchannelmixer. This example will make
  12709. the waveform a green color approximately RGB(66,217,150). Additional
  12710. channels will be shades of this color.
  12711. @example
  12712. ffmpeg -i audio.mp3 -filter_complex "showwavespic,colorchannelmixer=rr=66/255:gg=217/255:bb=150/255" waveform.png
  12713. @end example
  12714. @end itemize
  12715. @section spectrumsynth
  12716. Sythesize audio from 2 input video spectrums, first input stream represents
  12717. magnitude across time and second represents phase across time.
  12718. The filter will transform from frequency domain as displayed in videos back
  12719. to time domain as presented in audio output.
  12720. This filter is primarly created for reversing processed @ref{showspectrum}
  12721. filter outputs, but can synthesize sound from other spectrograms too.
  12722. But in such case results are going to be poor if the phase data is not
  12723. available, because in such cases phase data need to be recreated, usually
  12724. its just recreated from random noise.
  12725. For best results use gray only output (@code{channel} color mode in
  12726. @ref{showspectrum} filter) and @code{log} scale for magnitude video and
  12727. @code{lin} scale for phase video. To produce phase, for 2nd video, use
  12728. @code{data} option. Inputs videos should generally use @code{fullframe}
  12729. slide mode as that saves resources needed for decoding video.
  12730. The filter accepts the following options:
  12731. @table @option
  12732. @item sample_rate
  12733. Specify sample rate of output audio, the sample rate of audio from which
  12734. spectrum was generated may differ.
  12735. @item channels
  12736. Set number of channels represented in input video spectrums.
  12737. @item scale
  12738. Set scale which was used when generating magnitude input spectrum.
  12739. Can be @code{lin} or @code{log}. Default is @code{log}.
  12740. @item slide
  12741. Set slide which was used when generating inputs spectrums.
  12742. Can be @code{replace}, @code{scroll}, @code{fullframe} or @code{rscroll}.
  12743. Default is @code{fullframe}.
  12744. @item win_func
  12745. Set window function used for resynthesis.
  12746. @item overlap
  12747. Set window overlap. In range @code{[0, 1]}. Default is @code{1},
  12748. which means optimal overlap for selected window function will be picked.
  12749. @item orientation
  12750. Set orientation of input videos. Can be @code{vertical} or @code{horizontal}.
  12751. Default is @code{vertical}.
  12752. @end table
  12753. @subsection Examples
  12754. @itemize
  12755. @item
  12756. First create magnitude and phase videos from audio, assuming audio is stereo with 44100 sample rate,
  12757. then resynthesize videos back to audio with spectrumsynth:
  12758. @example
  12759. ffmpeg -i input.flac -lavfi showspectrum=mode=separate:scale=log:overlap=0.875:color=channel:slide=fullframe:data=magnitude -an -c:v rawvideo magnitude.nut
  12760. ffmpeg -i input.flac -lavfi showspectrum=mode=separate:scale=lin:overlap=0.875:color=channel:slide=fullframe:data=phase -an -c:v rawvideo phase.nut
  12761. ffmpeg -i magnitude.nut -i phase.nut -lavfi spectrumsynth=channels=2:sample_rate=44100:win_func=hann:overlap=0.875:slide=fullframe output.flac
  12762. @end example
  12763. @end itemize
  12764. @section split, asplit
  12765. Split input into several identical outputs.
  12766. @code{asplit} works with audio input, @code{split} with video.
  12767. The filter accepts a single parameter which specifies the number of outputs. If
  12768. unspecified, it defaults to 2.
  12769. @subsection Examples
  12770. @itemize
  12771. @item
  12772. Create two separate outputs from the same input:
  12773. @example
  12774. [in] split [out0][out1]
  12775. @end example
  12776. @item
  12777. To create 3 or more outputs, you need to specify the number of
  12778. outputs, like in:
  12779. @example
  12780. [in] asplit=3 [out0][out1][out2]
  12781. @end example
  12782. @item
  12783. Create two separate outputs from the same input, one cropped and
  12784. one padded:
  12785. @example
  12786. [in] split [splitout1][splitout2];
  12787. [splitout1] crop=100:100:0:0 [cropout];
  12788. [splitout2] pad=200:200:100:100 [padout];
  12789. @end example
  12790. @item
  12791. Create 5 copies of the input audio with @command{ffmpeg}:
  12792. @example
  12793. ffmpeg -i INPUT -filter_complex asplit=5 OUTPUT
  12794. @end example
  12795. @end itemize
  12796. @section zmq, azmq
  12797. Receive commands sent through a libzmq client, and forward them to
  12798. filters in the filtergraph.
  12799. @code{zmq} and @code{azmq} work as a pass-through filters. @code{zmq}
  12800. must be inserted between two video filters, @code{azmq} between two
  12801. audio filters.
  12802. To enable these filters you need to install the libzmq library and
  12803. headers and configure FFmpeg with @code{--enable-libzmq}.
  12804. For more information about libzmq see:
  12805. @url{http://www.zeromq.org/}
  12806. The @code{zmq} and @code{azmq} filters work as a libzmq server, which
  12807. receives messages sent through a network interface defined by the
  12808. @option{bind_address} option.
  12809. The received message must be in the form:
  12810. @example
  12811. @var{TARGET} @var{COMMAND} [@var{ARG}]
  12812. @end example
  12813. @var{TARGET} specifies the target of the command, usually the name of
  12814. the filter class or a specific filter instance name.
  12815. @var{COMMAND} specifies the name of the command for the target filter.
  12816. @var{ARG} is optional and specifies the optional argument list for the
  12817. given @var{COMMAND}.
  12818. Upon reception, the message is processed and the corresponding command
  12819. is injected into the filtergraph. Depending on the result, the filter
  12820. will send a reply to the client, adopting the format:
  12821. @example
  12822. @var{ERROR_CODE} @var{ERROR_REASON}
  12823. @var{MESSAGE}
  12824. @end example
  12825. @var{MESSAGE} is optional.
  12826. @subsection Examples
  12827. Look at @file{tools/zmqsend} for an example of a zmq client which can
  12828. be used to send commands processed by these filters.
  12829. Consider the following filtergraph generated by @command{ffplay}
  12830. @example
  12831. ffplay -dumpgraph 1 -f lavfi "
  12832. color=s=100x100:c=red [l];
  12833. color=s=100x100:c=blue [r];
  12834. nullsrc=s=200x100, zmq [bg];
  12835. [bg][l] overlay [bg+l];
  12836. [bg+l][r] overlay=x=100 "
  12837. @end example
  12838. To change the color of the left side of the video, the following
  12839. command can be used:
  12840. @example
  12841. echo Parsed_color_0 c yellow | tools/zmqsend
  12842. @end example
  12843. To change the right side:
  12844. @example
  12845. echo Parsed_color_1 c pink | tools/zmqsend
  12846. @end example
  12847. @c man end MULTIMEDIA FILTERS
  12848. @chapter Multimedia Sources
  12849. @c man begin MULTIMEDIA SOURCES
  12850. Below is a description of the currently available multimedia sources.
  12851. @section amovie
  12852. This is the same as @ref{movie} source, except it selects an audio
  12853. stream by default.
  12854. @anchor{movie}
  12855. @section movie
  12856. Read audio and/or video stream(s) from a movie container.
  12857. It accepts the following parameters:
  12858. @table @option
  12859. @item filename
  12860. The name of the resource to read (not necessarily a file; it can also be a
  12861. device or a stream accessed through some protocol).
  12862. @item format_name, f
  12863. Specifies the format assumed for the movie to read, and can be either
  12864. the name of a container or an input device. If not specified, the
  12865. format is guessed from @var{movie_name} or by probing.
  12866. @item seek_point, sp
  12867. Specifies the seek point in seconds. The frames will be output
  12868. starting from this seek point. The parameter is evaluated with
  12869. @code{av_strtod}, so the numerical value may be suffixed by an IS
  12870. postfix. The default value is "0".
  12871. @item streams, s
  12872. Specifies the streams to read. Several streams can be specified,
  12873. separated by "+". The source will then have as many outputs, in the
  12874. same order. The syntax is explained in the ``Stream specifiers''
  12875. section in the ffmpeg manual. Two special names, "dv" and "da" specify
  12876. respectively the default (best suited) video and audio stream. Default
  12877. is "dv", or "da" if the filter is called as "amovie".
  12878. @item stream_index, si
  12879. Specifies the index of the video stream to read. If the value is -1,
  12880. the most suitable video stream will be automatically selected. The default
  12881. value is "-1". Deprecated. If the filter is called "amovie", it will select
  12882. audio instead of video.
  12883. @item loop
  12884. Specifies how many times to read the stream in sequence.
  12885. If the value is less than 1, the stream will be read again and again.
  12886. Default value is "1".
  12887. Note that when the movie is looped the source timestamps are not
  12888. changed, so it will generate non monotonically increasing timestamps.
  12889. @end table
  12890. It allows overlaying a second video on top of the main input of
  12891. a filtergraph, as shown in this graph:
  12892. @example
  12893. input -----------> deltapts0 --> overlay --> output
  12894. ^
  12895. |
  12896. movie --> scale--> deltapts1 -------+
  12897. @end example
  12898. @subsection Examples
  12899. @itemize
  12900. @item
  12901. Skip 3.2 seconds from the start of the AVI file in.avi, and overlay it
  12902. on top of the input labelled "in":
  12903. @example
  12904. movie=in.avi:seek_point=3.2, scale=180:-1, setpts=PTS-STARTPTS [over];
  12905. [in] setpts=PTS-STARTPTS [main];
  12906. [main][over] overlay=16:16 [out]
  12907. @end example
  12908. @item
  12909. Read from a video4linux2 device, and overlay it on top of the input
  12910. labelled "in":
  12911. @example
  12912. movie=/dev/video0:f=video4linux2, scale=180:-1, setpts=PTS-STARTPTS [over];
  12913. [in] setpts=PTS-STARTPTS [main];
  12914. [main][over] overlay=16:16 [out]
  12915. @end example
  12916. @item
  12917. Read the first video stream and the audio stream with id 0x81 from
  12918. dvd.vob; the video is connected to the pad named "video" and the audio is
  12919. connected to the pad named "audio":
  12920. @example
  12921. movie=dvd.vob:s=v:0+#0x81 [video] [audio]
  12922. @end example
  12923. @end itemize
  12924. @c man end MULTIMEDIA SOURCES