You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

5218 lines
148KB

  1. @chapter Filtering Introduction
  2. @c man begin FILTERING INTRODUCTION
  3. Filtering in FFmpeg is enabled through the libavfilter library.
  4. Libavfilter is the filtering API of FFmpeg. It is the substitute of
  5. the now deprecated 'vhooks' and started as a Google Summer of Code
  6. project.
  7. Audio filtering integration into the main FFmpeg repository is a work in
  8. progress, so audio API and ABI should not be considered stable yet.
  9. In libavfilter, it is possible for filters to have multiple inputs and
  10. multiple outputs.
  11. To illustrate the sorts of things that are possible, we can
  12. use a complex filter graph. For example, the following one:
  13. @example
  14. input --> split --> fifo -----------------------> overlay --> output
  15. | ^
  16. | |
  17. +------> fifo --> crop --> vflip --------+
  18. @end example
  19. splits the stream in two streams, sends one stream through the crop filter
  20. and the vflip filter before merging it back with the other stream by
  21. overlaying it on top. You can use the following command to achieve this:
  22. @example
  23. ffmpeg -i input -vf "[in] split [T1], fifo, [T2] overlay=0:H/2 [out]; [T1] fifo, crop=iw:ih/2:0:ih/2, vflip [T2]" output
  24. @end example
  25. The result will be that in output the top half of the video is mirrored
  26. onto the bottom half.
  27. Filters are loaded using the @var{-vf} or @var{-af} option passed to
  28. @command{ffmpeg} or to @command{ffplay}. Filters in the same linear
  29. chain are separated by commas. In our example, @var{split, fifo,
  30. overlay} are in one linear chain, and @var{fifo, crop, vflip} are in
  31. another. The points where the linear chains join are labeled by names
  32. enclosed in square brackets. In our example, that is @var{[T1]} and
  33. @var{[T2]}. The special labels @var{[in]} and @var{[out]} are the points
  34. where video is input and output.
  35. Some filters take in input a list of parameters: they are specified
  36. after the filter name and an equal sign, and are separated from each other
  37. by a colon.
  38. There exist so-called @var{source filters} that do not have an
  39. audio/video input, and @var{sink filters} that will not have audio/video
  40. output.
  41. @c man end FILTERING INTRODUCTION
  42. @chapter graph2dot
  43. @c man begin GRAPH2DOT
  44. The @file{graph2dot} program included in the FFmpeg @file{tools}
  45. directory can be used to parse a filter graph description and issue a
  46. corresponding textual representation in the dot language.
  47. Invoke the command:
  48. @example
  49. graph2dot -h
  50. @end example
  51. to see how to use @file{graph2dot}.
  52. You can then pass the dot description to the @file{dot} program (from
  53. the graphviz suite of programs) and obtain a graphical representation
  54. of the filter graph.
  55. For example the sequence of commands:
  56. @example
  57. echo @var{GRAPH_DESCRIPTION} | \
  58. tools/graph2dot -o graph.tmp && \
  59. dot -Tpng graph.tmp -o graph.png && \
  60. display graph.png
  61. @end example
  62. can be used to create and display an image representing the graph
  63. described by the @var{GRAPH_DESCRIPTION} string. Note that this string must be
  64. a complete self-contained graph, with its inputs and outputs explicitly defined.
  65. For example if your command line is of the form:
  66. @example
  67. ffmpeg -i infile -vf scale=640:360 outfile
  68. @end example
  69. your @var{GRAPH_DESCRIPTION} string will need to be of the form:
  70. @example
  71. nullsrc,scale=640:360,nullsink
  72. @end example
  73. you may also need to set the @var{nullsrc} parameters and add a @var{format}
  74. filter in order to simulate a specific input file.
  75. @c man end GRAPH2DOT
  76. @chapter Filtergraph description
  77. @c man begin FILTERGRAPH DESCRIPTION
  78. A filtergraph is a directed graph of connected filters. It can contain
  79. cycles, and there can be multiple links between a pair of
  80. filters. Each link has one input pad on one side connecting it to one
  81. filter from which it takes its input, and one output pad on the other
  82. side connecting it to the one filter accepting its output.
  83. Each filter in a filtergraph is an instance of a filter class
  84. registered in the application, which defines the features and the
  85. number of input and output pads of the filter.
  86. A filter with no input pads is called a "source", a filter with no
  87. output pads is called a "sink".
  88. @anchor{Filtergraph syntax}
  89. @section Filtergraph syntax
  90. A filtergraph can be represented using a textual representation, which is
  91. recognized by the @option{-filter}/@option{-vf} and @option{-filter_complex}
  92. options in @command{ffmpeg} and @option{-vf} in @command{ffplay}, and by the
  93. @code{avfilter_graph_parse()}/@code{avfilter_graph_parse2()} function defined in
  94. @file{libavfilter/avfiltergraph.h}.
  95. A filterchain consists of a sequence of connected filters, each one
  96. connected to the previous one in the sequence. A filterchain is
  97. represented by a list of ","-separated filter descriptions.
  98. A filtergraph consists of a sequence of filterchains. A sequence of
  99. filterchains is represented by a list of ";"-separated filterchain
  100. descriptions.
  101. A filter is represented by a string of the form:
  102. [@var{in_link_1}]...[@var{in_link_N}]@var{filter_name}=@var{arguments}[@var{out_link_1}]...[@var{out_link_M}]
  103. @var{filter_name} is the name of the filter class of which the
  104. described filter is an instance of, and has to be the name of one of
  105. the filter classes registered in the program.
  106. The name of the filter class is optionally followed by a string
  107. "=@var{arguments}".
  108. @var{arguments} is a string which contains the parameters used to
  109. initialize the filter instance, and are described in the filter
  110. descriptions below.
  111. The list of arguments can be quoted using the character "'" as initial
  112. and ending mark, and the character '\' for escaping the characters
  113. within the quoted text; otherwise the argument string is considered
  114. terminated when the next special character (belonging to the set
  115. "[]=;,") is encountered.
  116. The name and arguments of the filter are optionally preceded and
  117. followed by a list of link labels.
  118. A link label allows to name a link and associate it to a filter output
  119. or input pad. The preceding labels @var{in_link_1}
  120. ... @var{in_link_N}, are associated to the filter input pads,
  121. the following labels @var{out_link_1} ... @var{out_link_M}, are
  122. associated to the output pads.
  123. When two link labels with the same name are found in the
  124. filtergraph, a link between the corresponding input and output pad is
  125. created.
  126. If an output pad is not labelled, it is linked by default to the first
  127. unlabelled input pad of the next filter in the filterchain.
  128. For example in the filterchain:
  129. @example
  130. nullsrc, split[L1], [L2]overlay, nullsink
  131. @end example
  132. the split filter instance has two output pads, and the overlay filter
  133. instance two input pads. The first output pad of split is labelled
  134. "L1", the first input pad of overlay is labelled "L2", and the second
  135. output pad of split is linked to the second input pad of overlay,
  136. which are both unlabelled.
  137. In a complete filterchain all the unlabelled filter input and output
  138. pads must be connected. A filtergraph is considered valid if all the
  139. filter input and output pads of all the filterchains are connected.
  140. Libavfilter will automatically insert scale filters where format
  141. conversion is required. It is possible to specify swscale flags
  142. for those automatically inserted scalers by prepending
  143. @code{sws_flags=@var{flags};}
  144. to the filtergraph description.
  145. Follows a BNF description for the filtergraph syntax:
  146. @example
  147. @var{NAME} ::= sequence of alphanumeric characters and '_'
  148. @var{LINKLABEL} ::= "[" @var{NAME} "]"
  149. @var{LINKLABELS} ::= @var{LINKLABEL} [@var{LINKLABELS}]
  150. @var{FILTER_ARGUMENTS} ::= sequence of chars (eventually quoted)
  151. @var{FILTER} ::= [@var{LINKNAMES}] @var{NAME} ["=" @var{ARGUMENTS}] [@var{LINKNAMES}]
  152. @var{FILTERCHAIN} ::= @var{FILTER} [,@var{FILTERCHAIN}]
  153. @var{FILTERGRAPH} ::= [sws_flags=@var{flags};] @var{FILTERCHAIN} [;@var{FILTERGRAPH}]
  154. @end example
  155. @section Notes on filtergraph escaping
  156. Some filter arguments require the use of special characters, typically
  157. @code{:} to separate key=value pairs in a named options list. In this
  158. case the user should perform a first level escaping when specifying
  159. the filter arguments. For example, consider the following literal
  160. string to be embedded in the @ref{drawtext} filter arguments:
  161. @example
  162. this is a 'string': may contain one, or more, special characters
  163. @end example
  164. Since @code{:} is special for the filter arguments syntax, it needs to
  165. be escaped, so you get:
  166. @example
  167. text=this is a \'string\'\: may contain one, or more, special characters
  168. @end example
  169. A second level of escaping is required when embedding the filter
  170. arguments in a filtergraph description, in order to escape all the
  171. filtergraph special characters. Thus the example above becomes:
  172. @example
  173. drawtext=text=this is a \\\'string\\\'\\: may contain one\, or more\, special characters
  174. @end example
  175. Finally an additional level of escaping may be needed when writing the
  176. filtergraph description in a shell command, which depends on the
  177. escaping rules of the adopted shell. For example, assuming that
  178. @code{\} is special and needs to be escaped with another @code{\}, the
  179. previous string will finally result in:
  180. @example
  181. -vf "drawtext=text=this is a \\\\\\'string\\\\\\'\\\\: may contain one\\, or more\\, special characters"
  182. @end example
  183. Sometimes, it might be more convenient to employ quoting in place of
  184. escaping. For example the string:
  185. @example
  186. Caesar: tu quoque, Brute, fili mi
  187. @end example
  188. Can be quoted in the filter arguments as:
  189. @example
  190. text='Caesar: tu quoque, Brute, fili mi'
  191. @end example
  192. And finally inserted in a filtergraph like:
  193. @example
  194. drawtext=text=\'Caesar: tu quoque\, Brute\, fili mi\'
  195. @end example
  196. See the @ref{quoting_and_escaping, Quoting and escaping} section for
  197. more information about the escaping and quoting rules adopted by
  198. FFmpeg.
  199. @c man end FILTERGRAPH DESCRIPTION
  200. @chapter Audio Filters
  201. @c man begin AUDIO FILTERS
  202. When you configure your FFmpeg build, you can disable any of the
  203. existing filters using @code{--disable-filters}.
  204. The configure output will show the audio filters included in your
  205. build.
  206. Below is a description of the currently available audio filters.
  207. @section aconvert
  208. Convert the input audio format to the specified formats.
  209. The filter accepts a string of the form:
  210. "@var{sample_format}:@var{channel_layout}".
  211. @var{sample_format} specifies the sample format, and can be a string or the
  212. corresponding numeric value defined in @file{libavutil/samplefmt.h}. Use 'p'
  213. suffix for a planar sample format.
  214. @var{channel_layout} specifies the channel layout, and can be a string
  215. or the corresponding number value defined in @file{libavutil/channel_layout.h}.
  216. The special parameter "auto", signifies that the filter will
  217. automatically select the output format depending on the output filter.
  218. Some examples follow.
  219. @itemize
  220. @item
  221. Convert input to float, planar, stereo:
  222. @example
  223. aconvert=fltp:stereo
  224. @end example
  225. @item
  226. Convert input to unsigned 8-bit, automatically select out channel layout:
  227. @example
  228. aconvert=u8:auto
  229. @end example
  230. @end itemize
  231. @section aformat
  232. Convert the input audio to one of the specified formats. The framework will
  233. negotiate the most appropriate format to minimize conversions.
  234. The filter accepts the following named parameters:
  235. @table @option
  236. @item sample_fmts
  237. A comma-separated list of requested sample formats.
  238. @item sample_rates
  239. A comma-separated list of requested sample rates.
  240. @item channel_layouts
  241. A comma-separated list of requested channel layouts.
  242. @end table
  243. If a parameter is omitted, all values are allowed.
  244. For example to force the output to either unsigned 8-bit or signed 16-bit stereo:
  245. @example
  246. aformat=sample_fmts\=u8\,s16:channel_layouts\=stereo
  247. @end example
  248. @section amerge
  249. Merge two or more audio streams into a single multi-channel stream.
  250. The filter accepts the following named options:
  251. @table @option
  252. @item inputs
  253. Set the number of inputs. Default is 2.
  254. @end table
  255. If the channel layouts of the inputs are disjoint, and therefore compatible,
  256. the channel layout of the output will be set accordingly and the channels
  257. will be reordered as necessary. If the channel layouts of the inputs are not
  258. disjoint, the output will have all the channels of the first input then all
  259. the channels of the second input, in that order, and the channel layout of
  260. the output will be the default value corresponding to the total number of
  261. channels.
  262. For example, if the first input is in 2.1 (FL+FR+LF) and the second input
  263. is FC+BL+BR, then the output will be in 5.1, with the channels in the
  264. following order: a1, a2, b1, a3, b2, b3 (a1 is the first channel of the
  265. first input, b1 is the first channel of the second input).
  266. On the other hand, if both input are in stereo, the output channels will be
  267. in the default order: a1, a2, b1, b2, and the channel layout will be
  268. arbitrarily set to 4.0, which may or may not be the expected value.
  269. All inputs must have the same sample rate, and format.
  270. If inputs do not have the same duration, the output will stop with the
  271. shortest.
  272. Example: merge two mono files into a stereo stream:
  273. @example
  274. amovie=left.wav [l] ; amovie=right.mp3 [r] ; [l] [r] amerge
  275. @end example
  276. Example: multiple merges:
  277. @example
  278. ffmpeg -f lavfi -i "
  279. amovie=input.mkv:si=0 [a0];
  280. amovie=input.mkv:si=1 [a1];
  281. amovie=input.mkv:si=2 [a2];
  282. amovie=input.mkv:si=3 [a3];
  283. amovie=input.mkv:si=4 [a4];
  284. amovie=input.mkv:si=5 [a5];
  285. [a0][a1][a2][a3][a4][a5] amerge=inputs=6" -c:a pcm_s16le output.mkv
  286. @end example
  287. @section amix
  288. Mixes multiple audio inputs into a single output.
  289. For example
  290. @example
  291. ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex amix=inputs=3:duration=first:dropout_transition=3 OUTPUT
  292. @end example
  293. will mix 3 input audio streams to a single output with the same duration as the
  294. first input and a dropout transition time of 3 seconds.
  295. The filter accepts the following named parameters:
  296. @table @option
  297. @item inputs
  298. Number of inputs. If unspecified, it defaults to 2.
  299. @item duration
  300. How to determine the end-of-stream.
  301. @table @option
  302. @item longest
  303. Duration of longest input. (default)
  304. @item shortest
  305. Duration of shortest input.
  306. @item first
  307. Duration of first input.
  308. @end table
  309. @item dropout_transition
  310. Transition time, in seconds, for volume renormalization when an input
  311. stream ends. The default value is 2 seconds.
  312. @end table
  313. @section anull
  314. Pass the audio source unchanged to the output.
  315. @section aresample
  316. Resample the input audio to the specified sample rate.
  317. The filter accepts exactly one parameter, the output sample rate. If not
  318. specified then the filter will automatically convert between its input
  319. and output sample rates.
  320. For example, to resample the input audio to 44100Hz:
  321. @example
  322. aresample=44100
  323. @end example
  324. @section asetnsamples
  325. Set the number of samples per each output audio frame.
  326. The last output packet may contain a different number of samples, as
  327. the filter will flush all the remaining samples when the input audio
  328. signal its end.
  329. The filter accepts parameters as a list of @var{key}=@var{value} pairs,
  330. separated by ":".
  331. @table @option
  332. @item nb_out_samples, n
  333. Set the number of frames per each output audio frame. The number is
  334. intended as the number of samples @emph{per each channel}.
  335. Default value is 1024.
  336. @item pad, p
  337. If set to 1, the filter will pad the last audio frame with zeroes, so
  338. that the last frame will contain the same number of samples as the
  339. previous ones. Default value is 1.
  340. @end table
  341. For example, to set the number of per-frame samples to 1234 and
  342. disable padding for the last frame, use:
  343. @example
  344. asetnsamples=n=1234:p=0
  345. @end example
  346. @section ashowinfo
  347. Show a line containing various information for each input audio frame.
  348. The input audio is not modified.
  349. The shown line contains a sequence of key/value pairs of the form
  350. @var{key}:@var{value}.
  351. A description of each shown parameter follows:
  352. @table @option
  353. @item n
  354. sequential number of the input frame, starting from 0
  355. @item pts
  356. Presentation timestamp of the input frame, in time base units; the time base
  357. depends on the filter input pad, and is usually 1/@var{sample_rate}.
  358. @item pts_time
  359. presentation timestamp of the input frame in seconds
  360. @item pos
  361. position of the frame in the input stream, -1 if this information in
  362. unavailable and/or meaningless (for example in case of synthetic audio)
  363. @item fmt
  364. sample format
  365. @item chlayout
  366. channel layout
  367. @item rate
  368. sample rate for the audio frame
  369. @item nb_samples
  370. number of samples (per channel) in the frame
  371. @item checksum
  372. Adler-32 checksum (printed in hexadecimal) of the audio data. For planar audio
  373. the data is treated as if all the planes were concatenated.
  374. @item plane_checksums
  375. A list of Adler-32 checksums for each data plane.
  376. @end table
  377. @section asplit
  378. Split input audio into several identical outputs.
  379. The filter accepts a single parameter which specifies the number of outputs. If
  380. unspecified, it defaults to 2.
  381. For example:
  382. @example
  383. [in] asplit [out0][out1]
  384. @end example
  385. will create two separate outputs from the same input.
  386. To create 3 or more outputs, you need to specify the number of
  387. outputs, like in:
  388. @example
  389. [in] asplit=3 [out0][out1][out2]
  390. @end example
  391. @example
  392. ffmpeg -i INPUT -filter_complex asplit=5 OUTPUT
  393. @end example
  394. will create 5 copies of the input audio.
  395. @section astreamsync
  396. Forward two audio streams and control the order the buffers are forwarded.
  397. The argument to the filter is an expression deciding which stream should be
  398. forwarded next: if the result is negative, the first stream is forwarded; if
  399. the result is positive or zero, the second stream is forwarded. It can use
  400. the following variables:
  401. @table @var
  402. @item b1 b2
  403. number of buffers forwarded so far on each stream
  404. @item s1 s2
  405. number of samples forwarded so far on each stream
  406. @item t1 t2
  407. current timestamp of each stream
  408. @end table
  409. The default value is @code{t1-t2}, which means to always forward the stream
  410. that has a smaller timestamp.
  411. Example: stress-test @code{amerge} by randomly sending buffers on the wrong
  412. input, while avoiding too much of a desynchronization:
  413. @example
  414. amovie=file.ogg [a] ; amovie=file.mp3 [b] ;
  415. [a] [b] astreamsync=(2*random(1))-1+tanh(5*(t1-t2)) [a2] [b2] ;
  416. [a2] [b2] amerge
  417. @end example
  418. @section atempo
  419. Adjust audio tempo.
  420. The filter accepts exactly one parameter, the audio tempo. If not
  421. specified then the filter will assume nominal 1.0 tempo. Tempo must
  422. be in the [0.5, 2.0] range.
  423. For example, to slow down audio to 80% tempo:
  424. @example
  425. atempo=0.8
  426. @end example
  427. For example, to speed up audio to 125% tempo:
  428. @example
  429. atempo=1.25
  430. @end example
  431. @section earwax
  432. Make audio easier to listen to on headphones.
  433. This filter adds `cues' to 44.1kHz stereo (i.e. audio CD format) audio
  434. so that when listened to on headphones the stereo image is moved from
  435. inside your head (standard for headphones) to outside and in front of
  436. the listener (standard for speakers).
  437. Ported from SoX.
  438. @section pan
  439. Mix channels with specific gain levels. The filter accepts the output
  440. channel layout followed by a set of channels definitions.
  441. This filter is also designed to remap efficiently the channels of an audio
  442. stream.
  443. The filter accepts parameters of the form:
  444. "@var{l}:@var{outdef}:@var{outdef}:..."
  445. @table @option
  446. @item l
  447. output channel layout or number of channels
  448. @item outdef
  449. output channel specification, of the form:
  450. "@var{out_name}=[@var{gain}*]@var{in_name}[+[@var{gain}*]@var{in_name}...]"
  451. @item out_name
  452. output channel to define, either a channel name (FL, FR, etc.) or a channel
  453. number (c0, c1, etc.)
  454. @item gain
  455. multiplicative coefficient for the channel, 1 leaving the volume unchanged
  456. @item in_name
  457. input channel to use, see out_name for details; it is not possible to mix
  458. named and numbered input channels
  459. @end table
  460. If the `=' in a channel specification is replaced by `<', then the gains for
  461. that specification will be renormalized so that the total is 1, thus
  462. avoiding clipping noise.
  463. @subsection Mixing examples
  464. For example, if you want to down-mix from stereo to mono, but with a bigger
  465. factor for the left channel:
  466. @example
  467. pan=1:c0=0.9*c0+0.1*c1
  468. @end example
  469. A customized down-mix to stereo that works automatically for 3-, 4-, 5- and
  470. 7-channels surround:
  471. @example
  472. pan=stereo: FL < FL + 0.5*FC + 0.6*BL + 0.6*SL : FR < FR + 0.5*FC + 0.6*BR + 0.6*SR
  473. @end example
  474. Note that @command{ffmpeg} integrates a default down-mix (and up-mix) system
  475. that should be preferred (see "-ac" option) unless you have very specific
  476. needs.
  477. @subsection Remapping examples
  478. The channel remapping will be effective if, and only if:
  479. @itemize
  480. @item gain coefficients are zeroes or ones,
  481. @item only one input per channel output,
  482. @end itemize
  483. If all these conditions are satisfied, the filter will notify the user ("Pure
  484. channel mapping detected"), and use an optimized and lossless method to do the
  485. remapping.
  486. For example, if you have a 5.1 source and want a stereo audio stream by
  487. dropping the extra channels:
  488. @example
  489. pan="stereo: c0=FL : c1=FR"
  490. @end example
  491. Given the same source, you can also switch front left and front right channels
  492. and keep the input channel layout:
  493. @example
  494. pan="5.1: c0=c1 : c1=c0 : c2=c2 : c3=c3 : c4=c4 : c5=c5"
  495. @end example
  496. If the input is a stereo audio stream, you can mute the front left channel (and
  497. still keep the stereo channel layout) with:
  498. @example
  499. pan="stereo:c1=c1"
  500. @end example
  501. Still with a stereo audio stream input, you can copy the right channel in both
  502. front left and right:
  503. @example
  504. pan="stereo: c0=FR : c1=FR"
  505. @end example
  506. @section silencedetect
  507. Detect silence in an audio stream.
  508. This filter logs a message when it detects that the input audio volume is less
  509. or equal to a noise tolerance value for a duration greater or equal to the
  510. minimum detected noise duration.
  511. The printed times and duration are expressed in seconds.
  512. @table @option
  513. @item duration, d
  514. Set silence duration until notification (default is 2 seconds).
  515. @item noise, n
  516. Set noise tolerance. Can be specified in dB (in case "dB" is appended to the
  517. specified value) or amplitude ratio. Default is -60dB, or 0.001.
  518. @end table
  519. Detect 5 seconds of silence with -50dB noise tolerance:
  520. @example
  521. silencedetect=n=-50dB:d=5
  522. @end example
  523. Complete example with @command{ffmpeg} to detect silence with 0.0001 noise
  524. tolerance in @file{silence.mp3}:
  525. @example
  526. ffmpeg -f lavfi -i amovie=silence.mp3,silencedetect=noise=0.0001 -f null -
  527. @end example
  528. @section volume
  529. Adjust the input audio volume.
  530. The filter accepts exactly one parameter @var{vol}, which expresses
  531. how the audio volume will be increased or decreased.
  532. Output values are clipped to the maximum value.
  533. If @var{vol} is expressed as a decimal number, the output audio
  534. volume is given by the relation:
  535. @example
  536. @var{output_volume} = @var{vol} * @var{input_volume}
  537. @end example
  538. If @var{vol} is expressed as a decimal number followed by the string
  539. "dB", the value represents the requested change in decibels of the
  540. input audio power, and the output audio volume is given by the
  541. relation:
  542. @example
  543. @var{output_volume} = 10^(@var{vol}/20) * @var{input_volume}
  544. @end example
  545. Otherwise @var{vol} is considered an expression and its evaluated
  546. value is used for computing the output audio volume according to the
  547. first relation.
  548. Default value for @var{vol} is 1.0.
  549. @subsection Examples
  550. @itemize
  551. @item
  552. Half the input audio volume:
  553. @example
  554. volume=0.5
  555. @end example
  556. The above example is equivalent to:
  557. @example
  558. volume=1/2
  559. @end example
  560. @item
  561. Decrease input audio power by 12 decibels:
  562. @example
  563. volume=-12dB
  564. @end example
  565. @end itemize
  566. @section volumedetect
  567. Detect the volume of the input video.
  568. The filter has no parameters. The input is not modified. Statistics about
  569. the volume will be printed in the log when the input stream end is reached.
  570. In particular it will show the mean volume (root mean square), maximum
  571. volume (on a per-sample basis), and the beginning of an histogram of the
  572. registered volume values (from the maximum value to a cumulated 1/1000 of
  573. the samples).
  574. All volumes are in decibels relative to the maximum PCM value.
  575. Here is an excerpt of the output:
  576. @example
  577. [Parsed_volumedetect_0 @ 0xa23120] mean_volume: -27 dB
  578. [Parsed_volumedetect_0 @ 0xa23120] max_volume: -4 dB
  579. [Parsed_volumedetect_0 @ 0xa23120] histogram_4db: 6
  580. [Parsed_volumedetect_0 @ 0xa23120] histogram_5db: 62
  581. [Parsed_volumedetect_0 @ 0xa23120] histogram_6db: 286
  582. [Parsed_volumedetect_0 @ 0xa23120] histogram_7db: 1042
  583. [Parsed_volumedetect_0 @ 0xa23120] histogram_8db: 2551
  584. [Parsed_volumedetect_0 @ 0xa23120] histogram_9db: 4609
  585. [Parsed_volumedetect_0 @ 0xa23120] histogram_10db: 8409
  586. @end example
  587. It means that:
  588. @itemize
  589. @item
  590. The mean square energy is approximately -27 dB, or 10^-2.7.
  591. @item
  592. The largest sample is at -4 dB, or more precisely between -4 dB and -5 dB.
  593. @item
  594. There are 6 samples at -4 dB, 62 at -5 dB, 286 at -6 dB, etc.
  595. @end itemize
  596. In other words, raising the volume by +4 dB does not cause any clipping,
  597. raising it by +5 dB causes clipping for 6 samples, etc.
  598. @section asyncts
  599. Synchronize audio data with timestamps by squeezing/stretching it and/or
  600. dropping samples/adding silence when needed.
  601. The filter accepts the following named parameters:
  602. @table @option
  603. @item compensate
  604. Enable stretching/squeezing the data to make it match the timestamps. Disabled
  605. by default. When disabled, time gaps are covered with silence.
  606. @item min_delta
  607. Minimum difference between timestamps and audio data (in seconds) to trigger
  608. adding/dropping samples. Default value is 0.1. If you get non-perfect sync with
  609. this filter, try setting this parameter to 0.
  610. @item max_comp
  611. Maximum compensation in samples per second. Relevant only with compensate=1.
  612. Default value 500.
  613. @item first_pts
  614. Assume the first pts should be this value.
  615. This allows for padding/trimming at the start of stream. By default, no
  616. assumption is made about the first frame's expected pts, so no padding or
  617. trimming is done. For example, this could be set to 0 to pad the beginning with
  618. silence if an audio stream starts after the video stream.
  619. @end table
  620. @section channelsplit
  621. Split each channel in input audio stream into a separate output stream.
  622. This filter accepts the following named parameters:
  623. @table @option
  624. @item channel_layout
  625. Channel layout of the input stream. Default is "stereo".
  626. @end table
  627. For example, assuming a stereo input MP3 file
  628. @example
  629. ffmpeg -i in.mp3 -filter_complex channelsplit out.mkv
  630. @end example
  631. will create an output Matroska file with two audio streams, one containing only
  632. the left channel and the other the right channel.
  633. To split a 5.1 WAV file into per-channel files
  634. @example
  635. ffmpeg -i in.wav -filter_complex
  636. 'channelsplit=channel_layout=5.1[FL][FR][FC][LFE][SL][SR]'
  637. -map '[FL]' front_left.wav -map '[FR]' front_right.wav -map '[FC]'
  638. front_center.wav -map '[LFE]' lfe.wav -map '[SL]' side_left.wav -map '[SR]'
  639. side_right.wav
  640. @end example
  641. @section channelmap
  642. Remap input channels to new locations.
  643. This filter accepts the following named parameters:
  644. @table @option
  645. @item channel_layout
  646. Channel layout of the output stream.
  647. @item map
  648. Map channels from input to output. The argument is a comma-separated list of
  649. mappings, each in the @code{@var{in_channel}-@var{out_channel}} or
  650. @var{in_channel} form. @var{in_channel} can be either the name of the input
  651. channel (e.g. FL for front left) or its index in the input channel layout.
  652. @var{out_channel} is the name of the output channel or its index in the output
  653. channel layout. If @var{out_channel} is not given then it is implicitly an
  654. index, starting with zero and increasing by one for each mapping.
  655. @end table
  656. If no mapping is present, the filter will implicitly map input channels to
  657. output channels preserving index.
  658. For example, assuming a 5.1+downmix input MOV file
  659. @example
  660. ffmpeg -i in.mov -filter 'channelmap=map=DL-FL\,DR-FR' out.wav
  661. @end example
  662. will create an output WAV file tagged as stereo from the downmix channels of
  663. the input.
  664. To fix a 5.1 WAV improperly encoded in AAC's native channel order
  665. @example
  666. ffmpeg -i in.wav -filter 'channelmap=1\,2\,0\,5\,3\,4:channel_layout=5.1' out.wav
  667. @end example
  668. @section join
  669. Join multiple input streams into one multi-channel stream.
  670. The filter accepts the following named parameters:
  671. @table @option
  672. @item inputs
  673. Number of input streams. Defaults to 2.
  674. @item channel_layout
  675. Desired output channel layout. Defaults to stereo.
  676. @item map
  677. Map channels from inputs to output. The argument is a comma-separated list of
  678. mappings, each in the @code{@var{input_idx}.@var{in_channel}-@var{out_channel}}
  679. form. @var{input_idx} is the 0-based index of the input stream. @var{in_channel}
  680. can be either the name of the input channel (e.g. FL for front left) or its
  681. index in the specified input stream. @var{out_channel} is the name of the output
  682. channel.
  683. @end table
  684. The filter will attempt to guess the mappings when those are not specified
  685. explicitly. It does so by first trying to find an unused matching input channel
  686. and if that fails it picks the first unused input channel.
  687. E.g. to join 3 inputs (with properly set channel layouts)
  688. @example
  689. ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex join=inputs=3 OUTPUT
  690. @end example
  691. To build a 5.1 output from 6 single-channel streams:
  692. @example
  693. ffmpeg -i fl -i fr -i fc -i sl -i sr -i lfe -filter_complex
  694. 'join=inputs=6:channel_layout=5.1:map=0.0-FL\,1.0-FR\,2.0-FC\,3.0-SL\,4.0-SR\,5.0-LFE'
  695. out
  696. @end example
  697. @section resample
  698. Convert the audio sample format, sample rate and channel layout. This filter is
  699. not meant to be used directly.
  700. @c man end AUDIO FILTERS
  701. @chapter Audio Sources
  702. @c man begin AUDIO SOURCES
  703. Below is a description of the currently available audio sources.
  704. @section abuffer
  705. Buffer audio frames, and make them available to the filter chain.
  706. This source is mainly intended for a programmatic use, in particular
  707. through the interface defined in @file{libavfilter/asrc_abuffer.h}.
  708. It accepts the following mandatory parameters:
  709. @var{sample_rate}:@var{sample_fmt}:@var{channel_layout}
  710. @table @option
  711. @item sample_rate
  712. The sample rate of the incoming audio buffers.
  713. @item sample_fmt
  714. The sample format of the incoming audio buffers.
  715. Either a sample format name or its corresponging integer representation from
  716. the enum AVSampleFormat in @file{libavutil/samplefmt.h}
  717. @item channel_layout
  718. The channel layout of the incoming audio buffers.
  719. Either a channel layout name from channel_layout_map in
  720. @file{libavutil/channel_layout.c} or its corresponding integer representation
  721. from the AV_CH_LAYOUT_* macros in @file{libavutil/channel_layout.h}
  722. @end table
  723. For example:
  724. @example
  725. abuffer=44100:s16p:stereo
  726. @end example
  727. will instruct the source to accept planar 16bit signed stereo at 44100Hz.
  728. Since the sample format with name "s16p" corresponds to the number
  729. 6 and the "stereo" channel layout corresponds to the value 0x3, this is
  730. equivalent to:
  731. @example
  732. abuffer=44100:6:0x3
  733. @end example
  734. @section aevalsrc
  735. Generate an audio signal specified by an expression.
  736. This source accepts in input one or more expressions (one for each
  737. channel), which are evaluated and used to generate a corresponding
  738. audio signal.
  739. It accepts the syntax: @var{exprs}[::@var{options}].
  740. @var{exprs} is a list of expressions separated by ":", one for each
  741. separate channel. In case the @var{channel_layout} is not
  742. specified, the selected channel layout depends on the number of
  743. provided expressions.
  744. @var{options} is an optional sequence of @var{key}=@var{value} pairs,
  745. separated by ":".
  746. The description of the accepted options follows.
  747. @table @option
  748. @item channel_layout, c
  749. Set the channel layout. The number of channels in the specified layout
  750. must be equal to the number of specified expressions.
  751. @item duration, d
  752. Set the minimum duration of the sourced audio. See the function
  753. @code{av_parse_time()} for the accepted format.
  754. Note that the resulting duration may be greater than the specified
  755. duration, as the generated audio is always cut at the end of a
  756. complete frame.
  757. If not specified, or the expressed duration is negative, the audio is
  758. supposed to be generated forever.
  759. @item nb_samples, n
  760. Set the number of samples per channel per each output frame,
  761. default to 1024.
  762. @item sample_rate, s
  763. Specify the sample rate, default to 44100.
  764. @end table
  765. Each expression in @var{exprs} can contain the following constants:
  766. @table @option
  767. @item n
  768. number of the evaluated sample, starting from 0
  769. @item t
  770. time of the evaluated sample expressed in seconds, starting from 0
  771. @item s
  772. sample rate
  773. @end table
  774. @subsection Examples
  775. @itemize
  776. @item
  777. Generate silence:
  778. @example
  779. aevalsrc=0
  780. @end example
  781. @item
  782. Generate a sin signal with frequency of 440 Hz, set sample rate to
  783. 8000 Hz:
  784. @example
  785. aevalsrc="sin(440*2*PI*t)::s=8000"
  786. @end example
  787. @item
  788. Generate a two channels signal, specify the channel layout (Front
  789. Center + Back Center) explicitly:
  790. @example
  791. aevalsrc="sin(420*2*PI*t):cos(430*2*PI*t)::c=FC|BC"
  792. @end example
  793. @item
  794. Generate white noise:
  795. @example
  796. aevalsrc="-2+random(0)"
  797. @end example
  798. @item
  799. Generate an amplitude modulated signal:
  800. @example
  801. aevalsrc="sin(10*2*PI*t)*sin(880*2*PI*t)"
  802. @end example
  803. @item
  804. Generate 2.5 Hz binaural beats on a 360 Hz carrier:
  805. @example
  806. aevalsrc="0.1*sin(2*PI*(360-2.5/2)*t) : 0.1*sin(2*PI*(360+2.5/2)*t)"
  807. @end example
  808. @end itemize
  809. @section anullsrc
  810. Null audio source, return unprocessed audio frames. It is mainly useful
  811. as a template and to be employed in analysis / debugging tools, or as
  812. the source for filters which ignore the input data (for example the sox
  813. synth filter).
  814. It accepts an optional sequence of @var{key}=@var{value} pairs,
  815. separated by ":".
  816. The description of the accepted options follows.
  817. @table @option
  818. @item sample_rate, s
  819. Specify the sample rate, and defaults to 44100.
  820. @item channel_layout, cl
  821. Specify the channel layout, and can be either an integer or a string
  822. representing a channel layout. The default value of @var{channel_layout}
  823. is "stereo".
  824. Check the channel_layout_map definition in
  825. @file{libavutil/channel_layout.c} for the mapping between strings and
  826. channel layout values.
  827. @item nb_samples, n
  828. Set the number of samples per requested frames.
  829. @end table
  830. Follow some examples:
  831. @example
  832. # set the sample rate to 48000 Hz and the channel layout to AV_CH_LAYOUT_MONO.
  833. anullsrc=r=48000:cl=4
  834. # same as
  835. anullsrc=r=48000:cl=mono
  836. @end example
  837. @section abuffer
  838. Buffer audio frames, and make them available to the filter chain.
  839. This source is not intended to be part of user-supplied graph descriptions but
  840. for insertion by calling programs through the interface defined in
  841. @file{libavfilter/buffersrc.h}.
  842. It accepts the following named parameters:
  843. @table @option
  844. @item time_base
  845. Timebase which will be used for timestamps of submitted frames. It must be
  846. either a floating-point number or in @var{numerator}/@var{denominator} form.
  847. @item sample_rate
  848. Audio sample rate.
  849. @item sample_fmt
  850. Name of the sample format, as returned by @code{av_get_sample_fmt_name()}.
  851. @item channel_layout
  852. Channel layout of the audio data, in the form that can be accepted by
  853. @code{av_get_channel_layout()}.
  854. @end table
  855. All the parameters need to be explicitly defined.
  856. @section flite
  857. Synthesize a voice utterance using the libflite library.
  858. To enable compilation of this filter you need to configure FFmpeg with
  859. @code{--enable-libflite}.
  860. Note that the flite library is not thread-safe.
  861. The source accepts parameters as a list of @var{key}=@var{value} pairs,
  862. separated by ":".
  863. The description of the accepted parameters follows.
  864. @table @option
  865. @item list_voices
  866. If set to 1, list the names of the available voices and exit
  867. immediately. Default value is 0.
  868. @item nb_samples, n
  869. Set the maximum number of samples per frame. Default value is 512.
  870. @item textfile
  871. Set the filename containing the text to speak.
  872. @item text
  873. Set the text to speak.
  874. @item voice, v
  875. Set the voice to use for the speech synthesis. Default value is
  876. @code{kal}. See also the @var{list_voices} option.
  877. @end table
  878. @subsection Examples
  879. @itemize
  880. @item
  881. Read from file @file{speech.txt}, and synthetize the text using the
  882. standard flite voice:
  883. @example
  884. flite=textfile=speech.txt
  885. @end example
  886. @item
  887. Read the specified text selecting the @code{slt} voice:
  888. @example
  889. flite=text='So fare thee well, poor devil of a Sub-Sub, whose commentator I am':voice=slt
  890. @end example
  891. @item
  892. Input text to ffmpeg:
  893. @example
  894. ffmpeg -f lavfi -i flite=text='So fare thee well, poor devil of a Sub-Sub, whose commentator I am':voice=slt
  895. @end example
  896. @item
  897. Make @file{ffplay} speak the specified text, using @code{flite} and
  898. the @code{lavfi} device:
  899. @example
  900. ffplay -f lavfi flite=text='No more be grieved for which that thou hast done.'
  901. @end example
  902. @end itemize
  903. For more information about libflite, check:
  904. @url{http://www.speech.cs.cmu.edu/flite/}
  905. @c man end AUDIO SOURCES
  906. @chapter Audio Sinks
  907. @c man begin AUDIO SINKS
  908. Below is a description of the currently available audio sinks.
  909. @section abuffersink
  910. Buffer audio frames, and make them available to the end of filter chain.
  911. This sink is mainly intended for programmatic use, in particular
  912. through the interface defined in @file{libavfilter/buffersink.h}.
  913. It requires a pointer to an AVABufferSinkContext structure, which
  914. defines the incoming buffers' formats, to be passed as the opaque
  915. parameter to @code{avfilter_init_filter} for initialization.
  916. @section anullsink
  917. Null audio sink, do absolutely nothing with the input audio. It is
  918. mainly useful as a template and to be employed in analysis / debugging
  919. tools.
  920. @section abuffersink
  921. This sink is intended for programmatic use. Frames that arrive on this sink can
  922. be retrieved by the calling program using the interface defined in
  923. @file{libavfilter/buffersink.h}.
  924. This filter accepts no parameters.
  925. @c man end AUDIO SINKS
  926. @chapter Video Filters
  927. @c man begin VIDEO FILTERS
  928. When you configure your FFmpeg build, you can disable any of the
  929. existing filters using @code{--disable-filters}.
  930. The configure output will show the video filters included in your
  931. build.
  932. Below is a description of the currently available video filters.
  933. @section alphaextract
  934. Extract the alpha component from the input as a grayscale video. This
  935. is especially useful with the @var{alphamerge} filter.
  936. @section alphamerge
  937. Add or replace the alpha component of the primary input with the
  938. grayscale value of a second input. This is intended for use with
  939. @var{alphaextract} to allow the transmission or storage of frame
  940. sequences that have alpha in a format that doesn't support an alpha
  941. channel.
  942. For example, to reconstruct full frames from a normal YUV-encoded video
  943. and a separate video created with @var{alphaextract}, you might use:
  944. @example
  945. movie=in_alpha.mkv [alpha]; [in][alpha] alphamerge [out]
  946. @end example
  947. Since this filter is designed for reconstruction, it operates on frame
  948. sequences without considering timestamps, and terminates when either
  949. input reaches end of stream. This will cause problems if your encoding
  950. pipeline drops frames. If you're trying to apply an image as an
  951. overlay to a video stream, consider the @var{overlay} filter instead.
  952. @section ass
  953. Same as the @ref{subtitles} filter, except that it doesn't require libavcodec
  954. and libavformat to work. On the other hand, it is limited to ASS (Advanced
  955. Substation Alpha) subtitles files.
  956. @section bbox
  957. Compute the bounding box for the non-black pixels in the input frame
  958. luminance plane.
  959. This filter computes the bounding box containing all the pixels with a
  960. luminance value greater than the minimum allowed value.
  961. The parameters describing the bounding box are printed on the filter
  962. log.
  963. @section blackdetect
  964. Detect video intervals that are (almost) completely black. Can be
  965. useful to detect chapter transitions, commercials, or invalid
  966. recordings. Output lines contains the time for the start, end and
  967. duration of the detected black interval expressed in seconds.
  968. In order to display the output lines, you need to set the loglevel at
  969. least to the AV_LOG_INFO value.
  970. This filter accepts a list of options in the form of
  971. @var{key}=@var{value} pairs separated by ":". A description of the
  972. accepted options follows.
  973. @table @option
  974. @item black_min_duration, d
  975. Set the minimum detected black duration expressed in seconds. It must
  976. be a non-negative floating point number.
  977. Default value is 2.0.
  978. @item picture_black_ratio_th, pic_th
  979. Set the threshold for considering a picture "black".
  980. Express the minimum value for the ratio:
  981. @example
  982. @var{nb_black_pixels} / @var{nb_pixels}
  983. @end example
  984. for which a picture is considered black.
  985. Default value is 0.98.
  986. @item pixel_black_th, pix_th
  987. Set the threshold for considering a pixel "black".
  988. The threshold expresses the maximum pixel luminance value for which a
  989. pixel is considered "black". The provided value is scaled according to
  990. the following equation:
  991. @example
  992. @var{absolute_threshold} = @var{luminance_minimum_value} + @var{pixel_black_th} * @var{luminance_range_size}
  993. @end example
  994. @var{luminance_range_size} and @var{luminance_minimum_value} depend on
  995. the input video format, the range is [0-255] for YUV full-range
  996. formats and [16-235] for YUV non full-range formats.
  997. Default value is 0.10.
  998. @end table
  999. The following example sets the maximum pixel threshold to the minimum
  1000. value, and detects only black intervals of 2 or more seconds:
  1001. @example
  1002. blackdetect=d=2:pix_th=0.00
  1003. @end example
  1004. @section blackframe
  1005. Detect frames that are (almost) completely black. Can be useful to
  1006. detect chapter transitions or commercials. Output lines consist of
  1007. the frame number of the detected frame, the percentage of blackness,
  1008. the position in the file if known or -1 and the timestamp in seconds.
  1009. In order to display the output lines, you need to set the loglevel at
  1010. least to the AV_LOG_INFO value.
  1011. The filter accepts the syntax:
  1012. @example
  1013. blackframe[=@var{amount}:[@var{threshold}]]
  1014. @end example
  1015. @var{amount} is the percentage of the pixels that have to be below the
  1016. threshold, and defaults to 98.
  1017. @var{threshold} is the threshold below which a pixel value is
  1018. considered black, and defaults to 32.
  1019. @section boxblur
  1020. Apply boxblur algorithm to the input video.
  1021. This filter accepts the parameters:
  1022. @var{luma_radius}:@var{luma_power}:@var{chroma_radius}:@var{chroma_power}:@var{alpha_radius}:@var{alpha_power}
  1023. Chroma and alpha parameters are optional, if not specified they default
  1024. to the corresponding values set for @var{luma_radius} and
  1025. @var{luma_power}.
  1026. @var{luma_radius}, @var{chroma_radius}, and @var{alpha_radius} represent
  1027. the radius in pixels of the box used for blurring the corresponding
  1028. input plane. They are expressions, and can contain the following
  1029. constants:
  1030. @table @option
  1031. @item w, h
  1032. the input width and height in pixels
  1033. @item cw, ch
  1034. the input chroma image width and height in pixels
  1035. @item hsub, vsub
  1036. horizontal and vertical chroma subsample values. For example for the
  1037. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  1038. @end table
  1039. The radius must be a non-negative number, and must not be greater than
  1040. the value of the expression @code{min(w,h)/2} for the luma and alpha planes,
  1041. and of @code{min(cw,ch)/2} for the chroma planes.
  1042. @var{luma_power}, @var{chroma_power}, and @var{alpha_power} represent
  1043. how many times the boxblur filter is applied to the corresponding
  1044. plane.
  1045. Some examples follow:
  1046. @itemize
  1047. @item
  1048. Apply a boxblur filter with luma, chroma, and alpha radius
  1049. set to 2:
  1050. @example
  1051. boxblur=2:1
  1052. @end example
  1053. @item
  1054. Set luma radius to 2, alpha and chroma radius to 0
  1055. @example
  1056. boxblur=2:1:0:0:0:0
  1057. @end example
  1058. @item
  1059. Set luma and chroma radius to a fraction of the video dimension
  1060. @example
  1061. boxblur=min(h\,w)/10:1:min(cw\,ch)/10:1
  1062. @end example
  1063. @end itemize
  1064. @section colormatrix
  1065. The colormatrix filter allows conversion between any of the following color
  1066. space: BT.709 (@var{bt709}), BT.601 (@var{bt601}), SMPTE-240M (@var{smpte240m})
  1067. and FCC (@var{fcc}).
  1068. The syntax of the parameters is @var{source}:@var{destination}:
  1069. @example
  1070. colormatrix=bt601:smpte240m
  1071. @end example
  1072. @section copy
  1073. Copy the input source unchanged to the output. Mainly useful for
  1074. testing purposes.
  1075. @section crop
  1076. Crop the input video to @var{out_w}:@var{out_h}:@var{x}:@var{y}:@var{keep_aspect}
  1077. The @var{keep_aspect} parameter is optional, if specified and set to a
  1078. non-zero value will force the output display aspect ratio to be the
  1079. same of the input, by changing the output sample aspect ratio.
  1080. The @var{out_w}, @var{out_h}, @var{x}, @var{y} parameters are
  1081. expressions containing the following constants:
  1082. @table @option
  1083. @item x, y
  1084. the computed values for @var{x} and @var{y}. They are evaluated for
  1085. each new frame.
  1086. @item in_w, in_h
  1087. the input width and height
  1088. @item iw, ih
  1089. same as @var{in_w} and @var{in_h}
  1090. @item out_w, out_h
  1091. the output (cropped) width and height
  1092. @item ow, oh
  1093. same as @var{out_w} and @var{out_h}
  1094. @item a
  1095. same as @var{iw} / @var{ih}
  1096. @item sar
  1097. input sample aspect ratio
  1098. @item dar
  1099. input display aspect ratio, it is the same as (@var{iw} / @var{ih}) * @var{sar}
  1100. @item hsub, vsub
  1101. horizontal and vertical chroma subsample values. For example for the
  1102. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  1103. @item n
  1104. the number of input frame, starting from 0
  1105. @item pos
  1106. the position in the file of the input frame, NAN if unknown
  1107. @item t
  1108. timestamp expressed in seconds, NAN if the input timestamp is unknown
  1109. @end table
  1110. The @var{out_w} and @var{out_h} parameters specify the expressions for
  1111. the width and height of the output (cropped) video. They are
  1112. evaluated just at the configuration of the filter.
  1113. The default value of @var{out_w} is "in_w", and the default value of
  1114. @var{out_h} is "in_h".
  1115. The expression for @var{out_w} may depend on the value of @var{out_h},
  1116. and the expression for @var{out_h} may depend on @var{out_w}, but they
  1117. cannot depend on @var{x} and @var{y}, as @var{x} and @var{y} are
  1118. evaluated after @var{out_w} and @var{out_h}.
  1119. The @var{x} and @var{y} parameters specify the expressions for the
  1120. position of the top-left corner of the output (non-cropped) area. They
  1121. are evaluated for each frame. If the evaluated value is not valid, it
  1122. is approximated to the nearest valid value.
  1123. The default value of @var{x} is "(in_w-out_w)/2", and the default
  1124. value for @var{y} is "(in_h-out_h)/2", which set the cropped area at
  1125. the center of the input image.
  1126. The expression for @var{x} may depend on @var{y}, and the expression
  1127. for @var{y} may depend on @var{x}.
  1128. Follow some examples:
  1129. @example
  1130. # crop the central input area with size 100x100
  1131. crop=100:100
  1132. # crop the central input area with size 2/3 of the input video
  1133. "crop=2/3*in_w:2/3*in_h"
  1134. # crop the input video central square
  1135. crop=in_h
  1136. # delimit the rectangle with the top-left corner placed at position
  1137. # 100:100 and the right-bottom corner corresponding to the right-bottom
  1138. # corner of the input image.
  1139. crop=in_w-100:in_h-100:100:100
  1140. # crop 10 pixels from the left and right borders, and 20 pixels from
  1141. # the top and bottom borders
  1142. "crop=in_w-2*10:in_h-2*20"
  1143. # keep only the bottom right quarter of the input image
  1144. "crop=in_w/2:in_h/2:in_w/2:in_h/2"
  1145. # crop height for getting Greek harmony
  1146. "crop=in_w:1/PHI*in_w"
  1147. # trembling effect
  1148. "crop=in_w/2:in_h/2:(in_w-out_w)/2+((in_w-out_w)/2)*sin(n/10):(in_h-out_h)/2 +((in_h-out_h)/2)*sin(n/7)"
  1149. # erratic camera effect depending on timestamp
  1150. "crop=in_w/2:in_h/2:(in_w-out_w)/2+((in_w-out_w)/2)*sin(t*10):(in_h-out_h)/2 +((in_h-out_h)/2)*sin(t*13)"
  1151. # set x depending on the value of y
  1152. "crop=in_w/2:in_h/2:y:10+10*sin(n/10)"
  1153. @end example
  1154. @section cropdetect
  1155. Auto-detect crop size.
  1156. Calculate necessary cropping parameters and prints the recommended
  1157. parameters through the logging system. The detected dimensions
  1158. correspond to the non-black area of the input video.
  1159. It accepts the syntax:
  1160. @example
  1161. cropdetect[=@var{limit}[:@var{round}[:@var{reset}]]]
  1162. @end example
  1163. @table @option
  1164. @item limit
  1165. Threshold, which can be optionally specified from nothing (0) to
  1166. everything (255), defaults to 24.
  1167. @item round
  1168. Value which the width/height should be divisible by, defaults to
  1169. 16. The offset is automatically adjusted to center the video. Use 2 to
  1170. get only even dimensions (needed for 4:2:2 video). 16 is best when
  1171. encoding to most video codecs.
  1172. @item reset
  1173. Counter that determines after how many frames cropdetect will reset
  1174. the previously detected largest video area and start over to detect
  1175. the current optimal crop area. Defaults to 0.
  1176. This can be useful when channel logos distort the video area. 0
  1177. indicates never reset and return the largest area encountered during
  1178. playback.
  1179. @end table
  1180. @section decimate
  1181. This filter drops frames that do not differ greatly from the previous
  1182. frame in order to reduce framerate. The main use of this filter is
  1183. for very-low-bitrate encoding (e.g. streaming over dialup modem), but
  1184. it could in theory be used for fixing movies that were
  1185. inverse-telecined incorrectly.
  1186. It accepts the following parameters:
  1187. @var{max}:@var{hi}:@var{lo}:@var{frac}.
  1188. @table @option
  1189. @item max
  1190. Set the maximum number of consecutive frames which can be dropped (if
  1191. positive), or the minimum interval between dropped frames (if
  1192. negative). If the value is 0, the frame is dropped unregarding the
  1193. number of previous sequentially dropped frames.
  1194. Default value is 0.
  1195. @item hi, lo, frac
  1196. Set the dropping threshold values.
  1197. Values for @var{hi} and @var{lo} are for 8x8 pixel blocks and
  1198. represent actual pixel value differences, so a threshold of 64
  1199. corresponds to 1 unit of difference for each pixel, or the same spread
  1200. out differently over the block.
  1201. A frame is a candidate for dropping if no 8x8 blocks differ by more
  1202. than a threshold of @var{hi}, and if no more than @var{frac} blocks (1
  1203. meaning the whole image) differ by more than a threshold of @var{lo}.
  1204. Default value for @var{hi} is 64*12, default value for @var{lo} is
  1205. 64*5, and default value for @var{frac} is 0.33.
  1206. @end table
  1207. @section delogo
  1208. Suppress a TV station logo by a simple interpolation of the surrounding
  1209. pixels. Just set a rectangle covering the logo and watch it disappear
  1210. (and sometimes something even uglier appear - your mileage may vary).
  1211. The filter accepts parameters as a string of the form
  1212. "@var{x}:@var{y}:@var{w}:@var{h}:@var{band}", or as a list of
  1213. @var{key}=@var{value} pairs, separated by ":".
  1214. The description of the accepted parameters follows.
  1215. @table @option
  1216. @item x, y
  1217. Specify the top left corner coordinates of the logo. They must be
  1218. specified.
  1219. @item w, h
  1220. Specify the width and height of the logo to clear. They must be
  1221. specified.
  1222. @item band, t
  1223. Specify the thickness of the fuzzy edge of the rectangle (added to
  1224. @var{w} and @var{h}). The default value is 4.
  1225. @item show
  1226. When set to 1, a green rectangle is drawn on the screen to simplify
  1227. finding the right @var{x}, @var{y}, @var{w}, @var{h} parameters, and
  1228. @var{band} is set to 4. The default value is 0.
  1229. @end table
  1230. Some examples follow.
  1231. @itemize
  1232. @item
  1233. Set a rectangle covering the area with top left corner coordinates 0,0
  1234. and size 100x77, setting a band of size 10:
  1235. @example
  1236. delogo=0:0:100:77:10
  1237. @end example
  1238. @item
  1239. As the previous example, but use named options:
  1240. @example
  1241. delogo=x=0:y=0:w=100:h=77:band=10
  1242. @end example
  1243. @end itemize
  1244. @section deshake
  1245. Attempt to fix small changes in horizontal and/or vertical shift. This
  1246. filter helps remove camera shake from hand-holding a camera, bumping a
  1247. tripod, moving on a vehicle, etc.
  1248. The filter accepts parameters as a string of the form
  1249. "@var{x}:@var{y}:@var{w}:@var{h}:@var{rx}:@var{ry}:@var{edge}:@var{blocksize}:@var{contrast}:@var{search}:@var{filename}"
  1250. A description of the accepted parameters follows.
  1251. @table @option
  1252. @item x, y, w, h
  1253. Specify a rectangular area where to limit the search for motion
  1254. vectors.
  1255. If desired the search for motion vectors can be limited to a
  1256. rectangular area of the frame defined by its top left corner, width
  1257. and height. These parameters have the same meaning as the drawbox
  1258. filter which can be used to visualise the position of the bounding
  1259. box.
  1260. This is useful when simultaneous movement of subjects within the frame
  1261. might be confused for camera motion by the motion vector search.
  1262. If any or all of @var{x}, @var{y}, @var{w} and @var{h} are set to -1
  1263. then the full frame is used. This allows later options to be set
  1264. without specifying the bounding box for the motion vector search.
  1265. Default - search the whole frame.
  1266. @item rx, ry
  1267. Specify the maximum extent of movement in x and y directions in the
  1268. range 0-64 pixels. Default 16.
  1269. @item edge
  1270. Specify how to generate pixels to fill blanks at the edge of the
  1271. frame. An integer from 0 to 3 as follows:
  1272. @table @option
  1273. @item 0
  1274. Fill zeroes at blank locations
  1275. @item 1
  1276. Original image at blank locations
  1277. @item 2
  1278. Extruded edge value at blank locations
  1279. @item 3
  1280. Mirrored edge at blank locations
  1281. @end table
  1282. The default setting is mirror edge at blank locations.
  1283. @item blocksize
  1284. Specify the blocksize to use for motion search. Range 4-128 pixels,
  1285. default 8.
  1286. @item contrast
  1287. Specify the contrast threshold for blocks. Only blocks with more than
  1288. the specified contrast (difference between darkest and lightest
  1289. pixels) will be considered. Range 1-255, default 125.
  1290. @item search
  1291. Specify the search strategy 0 = exhaustive search, 1 = less exhaustive
  1292. search. Default - exhaustive search.
  1293. @item filename
  1294. If set then a detailed log of the motion search is written to the
  1295. specified file.
  1296. @end table
  1297. @section drawbox
  1298. Draw a colored box on the input image.
  1299. The filter accepts parameters as a list of @var{key}=@var{value} pairs,
  1300. separated by ":".
  1301. The description of the accepted parameters follows.
  1302. @table @option
  1303. @item x, y
  1304. Specify the top left corner coordinates of the box. Default to 0.
  1305. @item width, w
  1306. @item height, h
  1307. Specify the width and height of the box, if 0 they are interpreted as
  1308. the input width and height. Default to 0.
  1309. @item color, c
  1310. Specify the color of the box to write, it can be the name of a color
  1311. (case insensitive match) or a 0xRRGGBB[AA] sequence. If the special
  1312. value @code{invert} is used, the box edge color is the same as the
  1313. video with inverted luma.
  1314. @item thickness, t
  1315. Set the thickness of the box edge. Default value is @code{4}.
  1316. @end table
  1317. If the key of the first options is omitted, the arguments are
  1318. interpreted according to the following syntax:
  1319. @example
  1320. drawbox=@var{x}:@var{y}:@var{width}:@var{height}:@var{color}:@var{thickness}
  1321. @end example
  1322. Some examples follow:
  1323. @itemize
  1324. @item
  1325. Draw a black box around the edge of the input image:
  1326. @example
  1327. drawbox
  1328. @end example
  1329. @item
  1330. Draw a box with color red and an opacity of 50%:
  1331. @example
  1332. drawbox=10:20:200:60:red@@0.5
  1333. @end example
  1334. The previous example can be specified as:
  1335. @example
  1336. drawbox=x=10:y=20:w=200:h=60:color=red@@0.5
  1337. @end example
  1338. @item
  1339. Fill the box with pink color:
  1340. @example
  1341. drawbox=x=10:y=10:w=100:h=100:color=pink@@0.5:t=max
  1342. @end example
  1343. @end itemize
  1344. @anchor{drawtext}
  1345. @section drawtext
  1346. Draw text string or text from specified file on top of video using the
  1347. libfreetype library.
  1348. To enable compilation of this filter you need to configure FFmpeg with
  1349. @code{--enable-libfreetype}.
  1350. @subsection Syntax
  1351. The filter accepts parameters as a list of @var{key}=@var{value} pairs,
  1352. separated by ":".
  1353. The description of the accepted parameters follows.
  1354. @table @option
  1355. @item box
  1356. Used to draw a box around text using background color.
  1357. Value should be either 1 (enable) or 0 (disable).
  1358. The default value of @var{box} is 0.
  1359. @item boxcolor
  1360. The color to be used for drawing box around text.
  1361. Either a string (e.g. "yellow") or in 0xRRGGBB[AA] format
  1362. (e.g. "0xff00ff"), possibly followed by an alpha specifier.
  1363. The default value of @var{boxcolor} is "white".
  1364. @item draw
  1365. Set an expression which specifies if the text should be drawn. If the
  1366. expression evaluates to 0, the text is not drawn. This is useful for
  1367. specifying that the text should be drawn only when specific conditions
  1368. are met.
  1369. Default value is "1".
  1370. See below for the list of accepted constants and functions.
  1371. @item expansion
  1372. Select how the @var{text} is expanded. Can be either @code{none},
  1373. @code{strftime} (default for compatibity reasons but deprecated) or
  1374. @code{normal}. See the @ref{drawtext_expansion, Text expansion} section
  1375. below for details.
  1376. @item fix_bounds
  1377. If true, check and fix text coords to avoid clipping.
  1378. @item fontcolor
  1379. The color to be used for drawing fonts.
  1380. Either a string (e.g. "red") or in 0xRRGGBB[AA] format
  1381. (e.g. "0xff000033"), possibly followed by an alpha specifier.
  1382. The default value of @var{fontcolor} is "black".
  1383. @item fontfile
  1384. The font file to be used for drawing text. Path must be included.
  1385. This parameter is mandatory.
  1386. @item fontsize
  1387. The font size to be used for drawing text.
  1388. The default value of @var{fontsize} is 16.
  1389. @item ft_load_flags
  1390. Flags to be used for loading the fonts.
  1391. The flags map the corresponding flags supported by libfreetype, and are
  1392. a combination of the following values:
  1393. @table @var
  1394. @item default
  1395. @item no_scale
  1396. @item no_hinting
  1397. @item render
  1398. @item no_bitmap
  1399. @item vertical_layout
  1400. @item force_autohint
  1401. @item crop_bitmap
  1402. @item pedantic
  1403. @item ignore_global_advance_width
  1404. @item no_recurse
  1405. @item ignore_transform
  1406. @item monochrome
  1407. @item linear_design
  1408. @item no_autohint
  1409. @item end table
  1410. @end table
  1411. Default value is "render".
  1412. For more information consult the documentation for the FT_LOAD_*
  1413. libfreetype flags.
  1414. @item shadowcolor
  1415. The color to be used for drawing a shadow behind the drawn text. It
  1416. can be a color name (e.g. "yellow") or a string in the 0xRRGGBB[AA]
  1417. form (e.g. "0xff00ff"), possibly followed by an alpha specifier.
  1418. The default value of @var{shadowcolor} is "black".
  1419. @item shadowx, shadowy
  1420. The x and y offsets for the text shadow position with respect to the
  1421. position of the text. They can be either positive or negative
  1422. values. Default value for both is "0".
  1423. @item tabsize
  1424. The size in number of spaces to use for rendering the tab.
  1425. Default value is 4.
  1426. @item timecode
  1427. Set the initial timecode representation in "hh:mm:ss[:;.]ff"
  1428. format. It can be used with or without text parameter. @var{timecode_rate}
  1429. option must be specified.
  1430. @item timecode_rate, rate, r
  1431. Set the timecode frame rate (timecode only).
  1432. @item text
  1433. The text string to be drawn. The text must be a sequence of UTF-8
  1434. encoded characters.
  1435. This parameter is mandatory if no file is specified with the parameter
  1436. @var{textfile}.
  1437. @item textfile
  1438. A text file containing text to be drawn. The text must be a sequence
  1439. of UTF-8 encoded characters.
  1440. This parameter is mandatory if no text string is specified with the
  1441. parameter @var{text}.
  1442. If both @var{text} and @var{textfile} are specified, an error is thrown.
  1443. @item reload
  1444. If set to 1, the @var{textfile} will be reloaded before each frame.
  1445. Be sure to update it atomically, or it may be read partially, or even fail.
  1446. @item x, y
  1447. The expressions which specify the offsets where text will be drawn
  1448. within the video frame. They are relative to the top/left border of the
  1449. output image.
  1450. The default value of @var{x} and @var{y} is "0".
  1451. See below for the list of accepted constants and functions.
  1452. @end table
  1453. The parameters for @var{x} and @var{y} are expressions containing the
  1454. following constants and functions:
  1455. @table @option
  1456. @item dar
  1457. input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}
  1458. @item hsub, vsub
  1459. horizontal and vertical chroma subsample values. For example for the
  1460. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  1461. @item line_h, lh
  1462. the height of each text line
  1463. @item main_h, h, H
  1464. the input height
  1465. @item main_w, w, W
  1466. the input width
  1467. @item max_glyph_a, ascent
  1468. the maximum distance from the baseline to the highest/upper grid
  1469. coordinate used to place a glyph outline point, for all the rendered
  1470. glyphs.
  1471. It is a positive value, due to the grid's orientation with the Y axis
  1472. upwards.
  1473. @item max_glyph_d, descent
  1474. the maximum distance from the baseline to the lowest grid coordinate
  1475. used to place a glyph outline point, for all the rendered glyphs.
  1476. This is a negative value, due to the grid's orientation, with the Y axis
  1477. upwards.
  1478. @item max_glyph_h
  1479. maximum glyph height, that is the maximum height for all the glyphs
  1480. contained in the rendered text, it is equivalent to @var{ascent} -
  1481. @var{descent}.
  1482. @item max_glyph_w
  1483. maximum glyph width, that is the maximum width for all the glyphs
  1484. contained in the rendered text
  1485. @item n
  1486. the number of input frame, starting from 0
  1487. @item rand(min, max)
  1488. return a random number included between @var{min} and @var{max}
  1489. @item sar
  1490. input sample aspect ratio
  1491. @item t
  1492. timestamp expressed in seconds, NAN if the input timestamp is unknown
  1493. @item text_h, th
  1494. the height of the rendered text
  1495. @item text_w, tw
  1496. the width of the rendered text
  1497. @item x, y
  1498. the x and y offset coordinates where the text is drawn.
  1499. These parameters allow the @var{x} and @var{y} expressions to refer
  1500. each other, so you can for example specify @code{y=x/dar}.
  1501. @end table
  1502. If libavfilter was built with @code{--enable-fontconfig}, then
  1503. @option{fontfile} can be a fontconfig pattern or omitted.
  1504. @anchor{drawtext_expansion}
  1505. @subsection Text expansion
  1506. If @option{expansion} is set to @code{strftime} (which is the default for
  1507. now), the filter recognizes strftime() sequences in the provided text and
  1508. expands them accordingly. Check the documentation of strftime(). This
  1509. feature is deprecated.
  1510. If @option{expansion} is set to @code{none}, the text is printed verbatim.
  1511. If @option{expansion} is set to @code{normal} (which will be the default),
  1512. the following expansion mechanism is used.
  1513. The backslash character '\', followed by any character, always expands to
  1514. the second character.
  1515. Sequence of the form @code{%@{...@}} are expanded. The text between the
  1516. braces is a function name, possibly followed by arguments separated by ':'.
  1517. If the arguments contain special characters or delimiters (':' or '@}'),
  1518. they should be escaped.
  1519. Note that they probably must also be escaped as the value for the
  1520. @option{text} option in the filter argument string and as the filter
  1521. argument in the filter graph description, and possibly also for the shell,
  1522. that makes up to four levels of escaping; using a text file avoids these
  1523. problems.
  1524. The following functions are available:
  1525. @table @command
  1526. @item expr, e
  1527. The expression evaluation result.
  1528. It must take one argument specifying the expression to be evaluated,
  1529. which accepts the same constants and functions as the @var{x} and
  1530. @var{y} values. Note that not all constants should be used, for
  1531. example the text size is not known when evaluating the expression, so
  1532. the constants @var{text_w} and @var{text_h} will have an undefined
  1533. value.
  1534. @item gmtime
  1535. The time at which the filter is running, expressed in UTC.
  1536. It can accept an argument: a strftime() format string.
  1537. @item localtime
  1538. The time at which the filter is running, expressed in the local time zone.
  1539. It can accept an argument: a strftime() format string.
  1540. @item n, frame_num
  1541. The frame number, starting from 0.
  1542. @item pts
  1543. The timestamp of the current frame, in seconds, with microsecond accuracy.
  1544. @end table
  1545. @subsection Examples
  1546. Some examples follow.
  1547. @itemize
  1548. @item
  1549. Draw "Test Text" with font FreeSerif, using the default values for the
  1550. optional parameters.
  1551. @example
  1552. drawtext="fontfile=/usr/share/fonts/truetype/freefont/FreeSerif.ttf: text='Test Text'"
  1553. @end example
  1554. @item
  1555. Draw 'Test Text' with font FreeSerif of size 24 at position x=100
  1556. and y=50 (counting from the top-left corner of the screen), text is
  1557. yellow with a red box around it. Both the text and the box have an
  1558. opacity of 20%.
  1559. @example
  1560. drawtext="fontfile=/usr/share/fonts/truetype/freefont/FreeSerif.ttf: text='Test Text':\
  1561. x=100: y=50: fontsize=24: fontcolor=yellow@@0.2: box=1: boxcolor=red@@0.2"
  1562. @end example
  1563. Note that the double quotes are not necessary if spaces are not used
  1564. within the parameter list.
  1565. @item
  1566. Show the text at the center of the video frame:
  1567. @example
  1568. drawtext="fontsize=30:fontfile=FreeSerif.ttf:text='hello world':x=(w-text_w)/2:y=(h-text_h-line_h)/2"
  1569. @end example
  1570. @item
  1571. Show a text line sliding from right to left in the last row of the video
  1572. frame. The file @file{LONG_LINE} is assumed to contain a single line
  1573. with no newlines.
  1574. @example
  1575. drawtext="fontsize=15:fontfile=FreeSerif.ttf:text=LONG_LINE:y=h-line_h:x=-50*t"
  1576. @end example
  1577. @item
  1578. Show the content of file @file{CREDITS} off the bottom of the frame and scroll up.
  1579. @example
  1580. drawtext="fontsize=20:fontfile=FreeSerif.ttf:textfile=CREDITS:y=h-20*t"
  1581. @end example
  1582. @item
  1583. Draw a single green letter "g", at the center of the input video.
  1584. The glyph baseline is placed at half screen height.
  1585. @example
  1586. drawtext="fontsize=60:fontfile=FreeSerif.ttf:fontcolor=green:text=g:x=(w-max_glyph_w)/2:y=h/2-ascent"
  1587. @end example
  1588. @item
  1589. Show text for 1 second every 3 seconds:
  1590. @example
  1591. drawtext="fontfile=FreeSerif.ttf:fontcolor=white:x=100:y=x/dar:draw=lt(mod(t\,3)\,1):text='blink'"
  1592. @end example
  1593. @item
  1594. Use fontconfig to set the font. Note that the colons need to be escaped.
  1595. @example
  1596. drawtext='fontfile=Linux Libertine O-40\:style=Semibold:text=FFmpeg'
  1597. @end example
  1598. @item
  1599. Print the date of a real-time encoding (see strftime(3)):
  1600. @example
  1601. drawtext='fontfile=FreeSans.ttf:expansion=normal:text=%@{localtime:%a %b %d %Y@}'
  1602. @end example
  1603. @end itemize
  1604. For more information about libfreetype, check:
  1605. @url{http://www.freetype.org/}.
  1606. For more information about fontconfig, check:
  1607. @url{http://freedesktop.org/software/fontconfig/fontconfig-user.html}.
  1608. @section edgedetect
  1609. Detect and draw edges. The filter uses the Canny Edge Detection algorithm.
  1610. This filter accepts the following optional named parameters:
  1611. @table @option
  1612. @item low, high
  1613. Set low and high threshold values used by the Canny thresholding
  1614. algorithm.
  1615. The high threshold selects the "strong" edge pixels, which are then
  1616. connected through 8-connectivity with the "weak" edge pixels selected
  1617. by the low threshold.
  1618. @var{low} and @var{high} threshold values must be choosen in the range
  1619. [0,1], and @var{low} should be lesser or equal to @var{high}.
  1620. Default value for @var{low} is @code{20/255}, and default value for @var{high}
  1621. is @code{50/255}.
  1622. @end table
  1623. Example:
  1624. @example
  1625. edgedetect=low=0.1:high=0.4
  1626. @end example
  1627. @section fade
  1628. Apply fade-in/out effect to input video.
  1629. It accepts the parameters:
  1630. @var{type}:@var{start_frame}:@var{nb_frames}[:@var{options}]
  1631. @var{type} specifies if the effect type, can be either "in" for
  1632. fade-in, or "out" for a fade-out effect.
  1633. @var{start_frame} specifies the number of the start frame for starting
  1634. to apply the fade effect.
  1635. @var{nb_frames} specifies the number of frames for which the fade
  1636. effect has to last. At the end of the fade-in effect the output video
  1637. will have the same intensity as the input video, at the end of the
  1638. fade-out transition the output video will be completely black.
  1639. @var{options} is an optional sequence of @var{key}=@var{value} pairs,
  1640. separated by ":". The description of the accepted options follows.
  1641. @table @option
  1642. @item type, t
  1643. See @var{type}.
  1644. @item start_frame, s
  1645. See @var{start_frame}.
  1646. @item nb_frames, n
  1647. See @var{nb_frames}.
  1648. @item alpha
  1649. If set to 1, fade only alpha channel, if one exists on the input.
  1650. Default value is 0.
  1651. @end table
  1652. A few usage examples follow, usable too as test scenarios.
  1653. @example
  1654. # fade in first 30 frames of video
  1655. fade=in:0:30
  1656. # fade out last 45 frames of a 200-frame video
  1657. fade=out:155:45
  1658. # fade in first 25 frames and fade out last 25 frames of a 1000-frame video
  1659. fade=in:0:25, fade=out:975:25
  1660. # make first 5 frames black, then fade in from frame 5-24
  1661. fade=in:5:20
  1662. # fade in alpha over first 25 frames of video
  1663. fade=in:0:25:alpha=1
  1664. @end example
  1665. @section field
  1666. Extract a single field from an interlaced image using stride
  1667. arithmetic to avoid wasting CPU time. The output frames are marked as
  1668. non-interlaced.
  1669. This filter accepts the following named options:
  1670. @table @option
  1671. @item type
  1672. Specify whether to extract the top (if the value is @code{0} or
  1673. @code{top}) or the bottom field (if the value is @code{1} or
  1674. @code{bottom}).
  1675. @end table
  1676. If the option key is not specified, the first value sets the @var{type}
  1677. option. For example:
  1678. @example
  1679. field=bottom
  1680. @end example
  1681. is equivalent to:
  1682. @example
  1683. field=type=bottom
  1684. @end example
  1685. @section fieldorder
  1686. Transform the field order of the input video.
  1687. It accepts one parameter which specifies the required field order that
  1688. the input interlaced video will be transformed to. The parameter can
  1689. assume one of the following values:
  1690. @table @option
  1691. @item 0 or bff
  1692. output bottom field first
  1693. @item 1 or tff
  1694. output top field first
  1695. @end table
  1696. Default value is "tff".
  1697. Transformation is achieved by shifting the picture content up or down
  1698. by one line, and filling the remaining line with appropriate picture content.
  1699. This method is consistent with most broadcast field order converters.
  1700. If the input video is not flagged as being interlaced, or it is already
  1701. flagged as being of the required output field order then this filter does
  1702. not alter the incoming video.
  1703. This filter is very useful when converting to or from PAL DV material,
  1704. which is bottom field first.
  1705. For example:
  1706. @example
  1707. ffmpeg -i in.vob -vf "fieldorder=bff" out.dv
  1708. @end example
  1709. @section fifo
  1710. Buffer input images and send them when they are requested.
  1711. This filter is mainly useful when auto-inserted by the libavfilter
  1712. framework.
  1713. The filter does not take parameters.
  1714. @section format
  1715. Convert the input video to one of the specified pixel formats.
  1716. Libavfilter will try to pick one that is supported for the input to
  1717. the next filter.
  1718. The filter accepts a list of pixel format names, separated by ":",
  1719. for example "yuv420p:monow:rgb24".
  1720. Some examples follow:
  1721. @example
  1722. # convert the input video to the format "yuv420p"
  1723. format=yuv420p
  1724. # convert the input video to any of the formats in the list
  1725. format=yuv420p:yuv444p:yuv410p
  1726. @end example
  1727. @section fps
  1728. Convert the video to specified constant framerate by duplicating or dropping
  1729. frames as necessary.
  1730. This filter accepts the following named parameters:
  1731. @table @option
  1732. @item fps
  1733. Desired output framerate.
  1734. @item round
  1735. Rounding method. The default is @code{near}.
  1736. @end table
  1737. @section framestep
  1738. Select one frame every N.
  1739. This filter accepts in input a string representing a positive
  1740. integer. Default argument is @code{1}.
  1741. @anchor{frei0r}
  1742. @section frei0r
  1743. Apply a frei0r effect to the input video.
  1744. To enable compilation of this filter you need to install the frei0r
  1745. header and configure FFmpeg with @code{--enable-frei0r}.
  1746. The filter supports the syntax:
  1747. @example
  1748. @var{filter_name}[@{:|=@}@var{param1}:@var{param2}:...:@var{paramN}]
  1749. @end example
  1750. @var{filter_name} is the name of the frei0r effect to load. If the
  1751. environment variable @env{FREI0R_PATH} is defined, the frei0r effect
  1752. is searched in each one of the directories specified by the colon (or
  1753. semicolon on Windows platforms) separated list in @env{FREIOR_PATH},
  1754. otherwise in the standard frei0r paths, which are in this order:
  1755. @file{HOME/.frei0r-1/lib/}, @file{/usr/local/lib/frei0r-1/},
  1756. @file{/usr/lib/frei0r-1/}.
  1757. @var{param1}, @var{param2}, ... , @var{paramN} specify the parameters
  1758. for the frei0r effect.
  1759. A frei0r effect parameter can be a boolean (whose values are specified
  1760. with "y" and "n"), a double, a color (specified by the syntax
  1761. @var{R}/@var{G}/@var{B}, @var{R}, @var{G}, and @var{B} being float
  1762. numbers from 0.0 to 1.0) or by an @code{av_parse_color()} color
  1763. description), a position (specified by the syntax @var{X}/@var{Y},
  1764. @var{X} and @var{Y} being float numbers) and a string.
  1765. The number and kind of parameters depend on the loaded effect. If an
  1766. effect parameter is not specified the default value is set.
  1767. Some examples follow:
  1768. @itemize
  1769. @item
  1770. Apply the distort0r effect, set the first two double parameters:
  1771. @example
  1772. frei0r=distort0r:0.5:0.01
  1773. @end example
  1774. @item
  1775. Apply the colordistance effect, take a color as first parameter:
  1776. @example
  1777. frei0r=colordistance:0.2/0.3/0.4
  1778. frei0r=colordistance:violet
  1779. frei0r=colordistance:0x112233
  1780. @end example
  1781. @item
  1782. Apply the perspective effect, specify the top left and top right image
  1783. positions:
  1784. @example
  1785. frei0r=perspective:0.2/0.2:0.8/0.2
  1786. @end example
  1787. @end itemize
  1788. For more information see:
  1789. @url{http://frei0r.dyne.org}
  1790. @section geq
  1791. The filter takes one, two or three equations as parameter, separated by ':'.
  1792. The first equation is mandatory and applies to the luma plane. The two
  1793. following are respectively for chroma blue and chroma red planes.
  1794. The filter syntax allows named parameters:
  1795. @table @option
  1796. @item lum_expr
  1797. the luminance expression
  1798. @item cb_expr
  1799. the chrominance blue expression
  1800. @item cr_expr
  1801. the chrominance red expression
  1802. @end table
  1803. If one of the chrominance expression is not defined, it falls back on the other
  1804. one. If none of them are specified, they will evaluate the luminance
  1805. expression.
  1806. The expressions can use the following variables and functions:
  1807. @table @option
  1808. @item N
  1809. The sequential number of the filtered frame, starting from @code{0}.
  1810. @item X, Y
  1811. The coordinates of the current sample.
  1812. @item W, H
  1813. The width and height of the image.
  1814. @item SW, SH
  1815. Width and height scale depending on the currently filtered plane. It is the
  1816. ratio between the corresponding luma plane number of pixels and the current
  1817. plane ones. E.g. for YUV4:2:0 the values are @code{1,1} for the luma plane, and
  1818. @code{0.5,0.5} for chroma planes.
  1819. @item T
  1820. Time of the current frame, expressed in seconds.
  1821. @item p(x, y)
  1822. Return the value of the pixel at location (@var{x},@var{y}) of the current
  1823. plane.
  1824. @item lum(x, y)
  1825. Return the value of the pixel at location (@var{x},@var{y}) of the luminance
  1826. plane.
  1827. @item cb(x, y)
  1828. Return the value of the pixel at location (@var{x},@var{y}) of the
  1829. blue-difference chroma plane.
  1830. @item cr(x, y)
  1831. Return the value of the pixel at location (@var{x},@var{y}) of the
  1832. red-difference chroma plane.
  1833. @end table
  1834. For functions, if @var{x} and @var{y} are outside the area, the value will be
  1835. automatically clipped to the closer edge.
  1836. Some examples follow:
  1837. @itemize
  1838. @item
  1839. Flip the image horizontally:
  1840. @example
  1841. geq=p(W-X\,Y)
  1842. @end example
  1843. @item
  1844. Generate a bidimensional sine wave, with angle @code{PI/3} and a
  1845. wavelength of 100 pixels:
  1846. @example
  1847. geq=128 + 100*sin(2*(PI/100)*(cos(PI/3)*(X-50*T) + sin(PI/3)*Y)):128:128
  1848. @end example
  1849. @item
  1850. Generate a fancy enigmatic moving light:
  1851. @example
  1852. nullsrc=s=256x256,geq=random(1)/hypot(X-cos(N*0.07)*W/2-W/2\,Y-sin(N*0.09)*H/2-H/2)^2*1000000*sin(N*0.02):128:128
  1853. @end example
  1854. @end itemize
  1855. @section gradfun
  1856. Fix the banding artifacts that are sometimes introduced into nearly flat
  1857. regions by truncation to 8bit color depth.
  1858. Interpolate the gradients that should go where the bands are, and
  1859. dither them.
  1860. This filter is designed for playback only. Do not use it prior to
  1861. lossy compression, because compression tends to lose the dither and
  1862. bring back the bands.
  1863. The filter takes two optional parameters, separated by ':':
  1864. @var{strength}:@var{radius}
  1865. @var{strength} is the maximum amount by which the filter will change
  1866. any one pixel. Also the threshold for detecting nearly flat
  1867. regions. Acceptable values range from .51 to 255, default value is
  1868. 1.2, out-of-range values will be clipped to the valid range.
  1869. @var{radius} is the neighborhood to fit the gradient to. A larger
  1870. radius makes for smoother gradients, but also prevents the filter from
  1871. modifying the pixels near detailed regions. Acceptable values are
  1872. 8-32, default value is 16, out-of-range values will be clipped to the
  1873. valid range.
  1874. @example
  1875. # default parameters
  1876. gradfun=1.2:16
  1877. # omitting radius
  1878. gradfun=1.2
  1879. @end example
  1880. @section hflip
  1881. Flip the input video horizontally.
  1882. For example to horizontally flip the input video with @command{ffmpeg}:
  1883. @example
  1884. ffmpeg -i in.avi -vf "hflip" out.avi
  1885. @end example
  1886. @section hqdn3d
  1887. High precision/quality 3d denoise filter. This filter aims to reduce
  1888. image noise producing smooth images and making still images really
  1889. still. It should enhance compressibility.
  1890. It accepts the following optional parameters:
  1891. @var{luma_spatial}:@var{chroma_spatial}:@var{luma_tmp}:@var{chroma_tmp}
  1892. @table @option
  1893. @item luma_spatial
  1894. a non-negative float number which specifies spatial luma strength,
  1895. defaults to 4.0
  1896. @item chroma_spatial
  1897. a non-negative float number which specifies spatial chroma strength,
  1898. defaults to 3.0*@var{luma_spatial}/4.0
  1899. @item luma_tmp
  1900. a float number which specifies luma temporal strength, defaults to
  1901. 6.0*@var{luma_spatial}/4.0
  1902. @item chroma_tmp
  1903. a float number which specifies chroma temporal strength, defaults to
  1904. @var{luma_tmp}*@var{chroma_spatial}/@var{luma_spatial}
  1905. @end table
  1906. @section hue
  1907. Modify the hue and/or the saturation of the input.
  1908. This filter accepts the following optional named options:
  1909. @table @option
  1910. @item h
  1911. Specify the hue angle as a number of degrees. It accepts a float
  1912. number or an expression, and defaults to 0.0.
  1913. @item H
  1914. Specify the hue angle as a number of degrees. It accepts a float
  1915. number or an expression, and defaults to 0.0.
  1916. @item s
  1917. Specify the saturation in the [-10,10] range. It accepts a float number and
  1918. defaults to 1.0.
  1919. @end table
  1920. The @var{h}, @var{H} and @var{s} parameters are expressions containing the
  1921. following constants:
  1922. @table @option
  1923. @item n
  1924. frame count of the input frame starting from 0
  1925. @item pts
  1926. presentation timestamp of the input frame expressed in time base units
  1927. @item r
  1928. frame rate of the input video, NAN if the input frame rate is unknown
  1929. @item t
  1930. timestamp expressed in seconds, NAN if the input timestamp is unknown
  1931. @item tb
  1932. time base of the input video
  1933. @end table
  1934. The options can also be set using the syntax: @var{hue}:@var{saturation}
  1935. In this case @var{hue} is expressed in degrees.
  1936. Some examples follow:
  1937. @itemize
  1938. @item
  1939. Set the hue to 90 degrees and the saturation to 1.0:
  1940. @example
  1941. hue=h=90:s=1
  1942. @end example
  1943. @item
  1944. Same command but expressing the hue in radians:
  1945. @example
  1946. hue=H=PI/2:s=1
  1947. @end example
  1948. @item
  1949. Same command without named options, hue must be expressed in degrees:
  1950. @example
  1951. hue=90:1
  1952. @end example
  1953. @item
  1954. Note that "h:s" syntax does not support expressions for the values of
  1955. h and s, so the following example will issue an error:
  1956. @example
  1957. hue=PI/2:1
  1958. @end example
  1959. @item
  1960. Rotate hue and make the saturation swing between 0
  1961. and 2 over a period of 1 second:
  1962. @example
  1963. hue="H=2*PI*t: s=sin(2*PI*t)+1"
  1964. @end example
  1965. @item
  1966. Apply a 3 seconds saturation fade-in effect starting at 0:
  1967. @example
  1968. hue="s=min(t/3\,1)"
  1969. @end example
  1970. The general fade-in expression can be written as:
  1971. @example
  1972. hue="s=min(0\, max((t-START)/DURATION\, 1))"
  1973. @end example
  1974. @item
  1975. Apply a 3 seconds saturation fade-out effect starting at 5 seconds:
  1976. @example
  1977. hue="s=max(0\, min(1\, (8-t)/3))"
  1978. @end example
  1979. The general fade-out expression can be written as:
  1980. @example
  1981. hue="s=max(0\, min(1\, (START+DURATION-t)/DURATION))"
  1982. @end example
  1983. @end itemize
  1984. @subsection Commands
  1985. This filter supports the following command:
  1986. @table @option
  1987. @item reinit
  1988. Modify the hue and/or the saturation of the input video.
  1989. The command accepts the same named options and syntax than when calling the
  1990. filter from the command-line.
  1991. If a parameter is omitted, it is kept at its current value.
  1992. @end table
  1993. @section idet
  1994. Interlaceing detect filter. This filter tries to detect if the input is
  1995. interlaced or progressive. Top or bottom field first.
  1996. @section lut, lutrgb, lutyuv
  1997. Compute a look-up table for binding each pixel component input value
  1998. to an output value, and apply it to input video.
  1999. @var{lutyuv} applies a lookup table to a YUV input video, @var{lutrgb}
  2000. to an RGB input video.
  2001. These filters accept in input a ":"-separated list of options, which
  2002. specify the expressions used for computing the lookup table for the
  2003. corresponding pixel component values.
  2004. The @var{lut} filter requires either YUV or RGB pixel formats in
  2005. input, and accepts the options:
  2006. @table @option
  2007. @item @var{c0} (first pixel component)
  2008. @item @var{c1} (second pixel component)
  2009. @item @var{c2} (third pixel component)
  2010. @item @var{c3} (fourth pixel component, corresponds to the alpha component)
  2011. @end table
  2012. The exact component associated to each option depends on the format in
  2013. input.
  2014. The @var{lutrgb} filter requires RGB pixel formats in input, and
  2015. accepts the options:
  2016. @table @option
  2017. @item @var{r} (red component)
  2018. @item @var{g} (green component)
  2019. @item @var{b} (blue component)
  2020. @item @var{a} (alpha component)
  2021. @end table
  2022. The @var{lutyuv} filter requires YUV pixel formats in input, and
  2023. accepts the options:
  2024. @table @option
  2025. @item @var{y} (Y/luminance component)
  2026. @item @var{u} (U/Cb component)
  2027. @item @var{v} (V/Cr component)
  2028. @item @var{a} (alpha component)
  2029. @end table
  2030. The expressions can contain the following constants and functions:
  2031. @table @option
  2032. @item w, h
  2033. the input width and height
  2034. @item val
  2035. input value for the pixel component
  2036. @item clipval
  2037. the input value clipped in the @var{minval}-@var{maxval} range
  2038. @item maxval
  2039. maximum value for the pixel component
  2040. @item minval
  2041. minimum value for the pixel component
  2042. @item negval
  2043. the negated value for the pixel component value clipped in the
  2044. @var{minval}-@var{maxval} range , it corresponds to the expression
  2045. "maxval-clipval+minval"
  2046. @item clip(val)
  2047. the computed value in @var{val} clipped in the
  2048. @var{minval}-@var{maxval} range
  2049. @item gammaval(gamma)
  2050. the computed gamma correction value of the pixel component value
  2051. clipped in the @var{minval}-@var{maxval} range, corresponds to the
  2052. expression
  2053. "pow((clipval-minval)/(maxval-minval)\,@var{gamma})*(maxval-minval)+minval"
  2054. @end table
  2055. All expressions default to "val".
  2056. Some examples follow:
  2057. @example
  2058. # negate input video
  2059. lutrgb="r=maxval+minval-val:g=maxval+minval-val:b=maxval+minval-val"
  2060. lutyuv="y=maxval+minval-val:u=maxval+minval-val:v=maxval+minval-val"
  2061. # the above is the same as
  2062. lutrgb="r=negval:g=negval:b=negval"
  2063. lutyuv="y=negval:u=negval:v=negval"
  2064. # negate luminance
  2065. lutyuv=y=negval
  2066. # remove chroma components, turns the video into a graytone image
  2067. lutyuv="u=128:v=128"
  2068. # apply a luma burning effect
  2069. lutyuv="y=2*val"
  2070. # remove green and blue components
  2071. lutrgb="g=0:b=0"
  2072. # set a constant alpha channel value on input
  2073. format=rgba,lutrgb=a="maxval-minval/2"
  2074. # correct luminance gamma by a 0.5 factor
  2075. lutyuv=y=gammaval(0.5)
  2076. @end example
  2077. @section mp
  2078. Apply an MPlayer filter to the input video.
  2079. This filter provides a wrapper around most of the filters of
  2080. MPlayer/MEncoder.
  2081. This wrapper is considered experimental. Some of the wrapped filters
  2082. may not work properly and we may drop support for them, as they will
  2083. be implemented natively into FFmpeg. Thus you should avoid
  2084. depending on them when writing portable scripts.
  2085. The filters accepts the parameters:
  2086. @var{filter_name}[:=]@var{filter_params}
  2087. @var{filter_name} is the name of a supported MPlayer filter,
  2088. @var{filter_params} is a string containing the parameters accepted by
  2089. the named filter.
  2090. The list of the currently supported filters follows:
  2091. @table @var
  2092. @item denoise3d
  2093. @item detc
  2094. @item dint
  2095. @item divtc
  2096. @item down3dright
  2097. @item dsize
  2098. @item eq2
  2099. @item eq
  2100. @item fil
  2101. @item fspp
  2102. @item harddup
  2103. @item il
  2104. @item ilpack
  2105. @item ivtc
  2106. @item kerndeint
  2107. @item mcdeint
  2108. @item noise
  2109. @item ow
  2110. @item perspective
  2111. @item phase
  2112. @item pp7
  2113. @item pullup
  2114. @item qp
  2115. @item sab
  2116. @item softpulldown
  2117. @item softskip
  2118. @item spp
  2119. @item telecine
  2120. @item tinterlace
  2121. @item unsharp
  2122. @item uspp
  2123. @end table
  2124. The parameter syntax and behavior for the listed filters are the same
  2125. of the corresponding MPlayer filters. For detailed instructions check
  2126. the "VIDEO FILTERS" section in the MPlayer manual.
  2127. Some examples follow:
  2128. @itemize
  2129. @item
  2130. Adjust gamma, brightness, contrast:
  2131. @example
  2132. mp=eq2=1.0:2:0.5
  2133. @end example
  2134. @item
  2135. Add temporal noise to input video:
  2136. @example
  2137. mp=noise=20t
  2138. @end example
  2139. @end itemize
  2140. See also mplayer(1), @url{http://www.mplayerhq.hu/}.
  2141. @section negate
  2142. Negate input video.
  2143. This filter accepts an integer in input, if non-zero it negates the
  2144. alpha component (if available). The default value in input is 0.
  2145. @section noformat
  2146. Force libavfilter not to use any of the specified pixel formats for the
  2147. input to the next filter.
  2148. The filter accepts a list of pixel format names, separated by ":",
  2149. for example "yuv420p:monow:rgb24".
  2150. Some examples follow:
  2151. @example
  2152. # force libavfilter to use a format different from "yuv420p" for the
  2153. # input to the vflip filter
  2154. noformat=yuv420p,vflip
  2155. # convert the input video to any of the formats not contained in the list
  2156. noformat=yuv420p:yuv444p:yuv410p
  2157. @end example
  2158. @section null
  2159. Pass the video source unchanged to the output.
  2160. @section ocv
  2161. Apply video transform using libopencv.
  2162. To enable this filter install libopencv library and headers and
  2163. configure FFmpeg with @code{--enable-libopencv}.
  2164. The filter takes the parameters: @var{filter_name}@{:=@}@var{filter_params}.
  2165. @var{filter_name} is the name of the libopencv filter to apply.
  2166. @var{filter_params} specifies the parameters to pass to the libopencv
  2167. filter. If not specified the default values are assumed.
  2168. Refer to the official libopencv documentation for more precise
  2169. information:
  2170. @url{http://opencv.willowgarage.com/documentation/c/image_filtering.html}
  2171. Follows the list of supported libopencv filters.
  2172. @anchor{dilate}
  2173. @subsection dilate
  2174. Dilate an image by using a specific structuring element.
  2175. This filter corresponds to the libopencv function @code{cvDilate}.
  2176. It accepts the parameters: @var{struct_el}:@var{nb_iterations}.
  2177. @var{struct_el} represents a structuring element, and has the syntax:
  2178. @var{cols}x@var{rows}+@var{anchor_x}x@var{anchor_y}/@var{shape}
  2179. @var{cols} and @var{rows} represent the number of columns and rows of
  2180. the structuring element, @var{anchor_x} and @var{anchor_y} the anchor
  2181. point, and @var{shape} the shape for the structuring element, and
  2182. can be one of the values "rect", "cross", "ellipse", "custom".
  2183. If the value for @var{shape} is "custom", it must be followed by a
  2184. string of the form "=@var{filename}". The file with name
  2185. @var{filename} is assumed to represent a binary image, with each
  2186. printable character corresponding to a bright pixel. When a custom
  2187. @var{shape} is used, @var{cols} and @var{rows} are ignored, the number
  2188. or columns and rows of the read file are assumed instead.
  2189. The default value for @var{struct_el} is "3x3+0x0/rect".
  2190. @var{nb_iterations} specifies the number of times the transform is
  2191. applied to the image, and defaults to 1.
  2192. Follow some example:
  2193. @example
  2194. # use the default values
  2195. ocv=dilate
  2196. # dilate using a structuring element with a 5x5 cross, iterate two times
  2197. ocv=dilate=5x5+2x2/cross:2
  2198. # read the shape from the file diamond.shape, iterate two times
  2199. # the file diamond.shape may contain a pattern of characters like this:
  2200. # *
  2201. # ***
  2202. # *****
  2203. # ***
  2204. # *
  2205. # the specified cols and rows are ignored (but not the anchor point coordinates)
  2206. ocv=0x0+2x2/custom=diamond.shape:2
  2207. @end example
  2208. @subsection erode
  2209. Erode an image by using a specific structuring element.
  2210. This filter corresponds to the libopencv function @code{cvErode}.
  2211. The filter accepts the parameters: @var{struct_el}:@var{nb_iterations},
  2212. with the same syntax and semantics as the @ref{dilate} filter.
  2213. @subsection smooth
  2214. Smooth the input video.
  2215. The filter takes the following parameters:
  2216. @var{type}:@var{param1}:@var{param2}:@var{param3}:@var{param4}.
  2217. @var{type} is the type of smooth filter to apply, and can be one of
  2218. the following values: "blur", "blur_no_scale", "median", "gaussian",
  2219. "bilateral". The default value is "gaussian".
  2220. @var{param1}, @var{param2}, @var{param3}, and @var{param4} are
  2221. parameters whose meanings depend on smooth type. @var{param1} and
  2222. @var{param2} accept integer positive values or 0, @var{param3} and
  2223. @var{param4} accept float values.
  2224. The default value for @var{param1} is 3, the default value for the
  2225. other parameters is 0.
  2226. These parameters correspond to the parameters assigned to the
  2227. libopencv function @code{cvSmooth}.
  2228. @anchor{overlay}
  2229. @section overlay
  2230. Overlay one video on top of another.
  2231. It takes two inputs and one output, the first input is the "main"
  2232. video on which the second input is overlayed.
  2233. It accepts the parameters: @var{x}:@var{y}[:@var{options}].
  2234. @var{x} is the x coordinate of the overlayed video on the main video,
  2235. @var{y} is the y coordinate. @var{x} and @var{y} are expressions containing
  2236. the following parameters:
  2237. @table @option
  2238. @item main_w, main_h
  2239. main input width and height
  2240. @item W, H
  2241. same as @var{main_w} and @var{main_h}
  2242. @item overlay_w, overlay_h
  2243. overlay input width and height
  2244. @item w, h
  2245. same as @var{overlay_w} and @var{overlay_h}
  2246. @end table
  2247. @var{options} is an optional list of @var{key}=@var{value} pairs,
  2248. separated by ":".
  2249. The description of the accepted options follows.
  2250. @table @option
  2251. @item rgb
  2252. If set to 1, force the filter to accept inputs in the RGB
  2253. color space. Default value is 0.
  2254. @end table
  2255. Be aware that frames are taken from each input video in timestamp
  2256. order, hence, if their initial timestamps differ, it is a a good idea
  2257. to pass the two inputs through a @var{setpts=PTS-STARTPTS} filter to
  2258. have them begin in the same zero timestamp, as it does the example for
  2259. the @var{movie} filter.
  2260. Follow some examples:
  2261. @example
  2262. # draw the overlay at 10 pixels from the bottom right
  2263. # corner of the main video.
  2264. overlay=main_w-overlay_w-10:main_h-overlay_h-10
  2265. # insert a transparent PNG logo in the bottom left corner of the input
  2266. ffmpeg -i input -i logo -filter_complex 'overlay=10:main_h-overlay_h-10' output
  2267. # insert 2 different transparent PNG logos (second logo on bottom
  2268. # right corner):
  2269. ffmpeg -i input -i logo1 -i logo2 -filter_complex
  2270. 'overlay=10:H-h-10,overlay=W-w-10:H-h-10' output
  2271. # add a transparent color layer on top of the main video,
  2272. # WxH specifies the size of the main input to the overlay filter
  2273. color=red@@.3:WxH [over]; [in][over] overlay [out]
  2274. # play an original video and a filtered version (here with the deshake filter)
  2275. # side by side
  2276. ffplay input.avi -vf 'split[a][b]; [a]pad=iw*2:ih[src]; [b]deshake[filt]; [src][filt]overlay=w'
  2277. # the previous example is the same as:
  2278. ffplay input.avi -vf 'split[b], pad=iw*2[src], [b]deshake, [src]overlay=w'
  2279. @end example
  2280. You can chain together more overlays but the efficiency of such
  2281. approach is yet to be tested.
  2282. @section pad
  2283. Add paddings to the input image, and places the original input at the
  2284. given coordinates @var{x}, @var{y}.
  2285. It accepts the following parameters:
  2286. @var{width}:@var{height}:@var{x}:@var{y}:@var{color}.
  2287. The parameters @var{width}, @var{height}, @var{x}, and @var{y} are
  2288. expressions containing the following constants:
  2289. @table @option
  2290. @item in_w, in_h
  2291. the input video width and height
  2292. @item iw, ih
  2293. same as @var{in_w} and @var{in_h}
  2294. @item out_w, out_h
  2295. the output width and height, that is the size of the padded area as
  2296. specified by the @var{width} and @var{height} expressions
  2297. @item ow, oh
  2298. same as @var{out_w} and @var{out_h}
  2299. @item x, y
  2300. x and y offsets as specified by the @var{x} and @var{y}
  2301. expressions, or NAN if not yet specified
  2302. @item a
  2303. same as @var{iw} / @var{ih}
  2304. @item sar
  2305. input sample aspect ratio
  2306. @item dar
  2307. input display aspect ratio, it is the same as (@var{iw} / @var{ih}) * @var{sar}
  2308. @item hsub, vsub
  2309. horizontal and vertical chroma subsample values. For example for the
  2310. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  2311. @end table
  2312. Follows the description of the accepted parameters.
  2313. @table @option
  2314. @item width, height
  2315. Specify the size of the output image with the paddings added. If the
  2316. value for @var{width} or @var{height} is 0, the corresponding input size
  2317. is used for the output.
  2318. The @var{width} expression can reference the value set by the
  2319. @var{height} expression, and vice versa.
  2320. The default value of @var{width} and @var{height} is 0.
  2321. @item x, y
  2322. Specify the offsets where to place the input image in the padded area
  2323. with respect to the top/left border of the output image.
  2324. The @var{x} expression can reference the value set by the @var{y}
  2325. expression, and vice versa.
  2326. The default value of @var{x} and @var{y} is 0.
  2327. @item color
  2328. Specify the color of the padded area, it can be the name of a color
  2329. (case insensitive match) or a 0xRRGGBB[AA] sequence.
  2330. The default value of @var{color} is "black".
  2331. @end table
  2332. @subsection Examples
  2333. @itemize
  2334. @item
  2335. Add paddings with color "violet" to the input video. Output video
  2336. size is 640x480, the top-left corner of the input video is placed at
  2337. column 0, row 40:
  2338. @example
  2339. pad=640:480:0:40:violet
  2340. @end example
  2341. @item
  2342. Pad the input to get an output with dimensions increased by 3/2,
  2343. and put the input video at the center of the padded area:
  2344. @example
  2345. pad="3/2*iw:3/2*ih:(ow-iw)/2:(oh-ih)/2"
  2346. @end example
  2347. @item
  2348. Pad the input to get a squared output with size equal to the maximum
  2349. value between the input width and height, and put the input video at
  2350. the center of the padded area:
  2351. @example
  2352. pad="max(iw\,ih):ow:(ow-iw)/2:(oh-ih)/2"
  2353. @end example
  2354. @item
  2355. Pad the input to get a final w/h ratio of 16:9:
  2356. @example
  2357. pad="ih*16/9:ih:(ow-iw)/2:(oh-ih)/2"
  2358. @end example
  2359. @item
  2360. In case of anamorphic video, in order to set the output display aspect
  2361. correctly, it is necessary to use @var{sar} in the expression,
  2362. according to the relation:
  2363. @example
  2364. (ih * X / ih) * sar = output_dar
  2365. X = output_dar / sar
  2366. @end example
  2367. Thus the previous example needs to be modified to:
  2368. @example
  2369. pad="ih*16/9/sar:ih:(ow-iw)/2:(oh-ih)/2"
  2370. @end example
  2371. @item
  2372. Double output size and put the input video in the bottom-right
  2373. corner of the output padded area:
  2374. @example
  2375. pad="2*iw:2*ih:ow-iw:oh-ih"
  2376. @end example
  2377. @end itemize
  2378. @section pixdesctest
  2379. Pixel format descriptor test filter, mainly useful for internal
  2380. testing. The output video should be equal to the input video.
  2381. For example:
  2382. @example
  2383. format=monow, pixdesctest
  2384. @end example
  2385. can be used to test the monowhite pixel format descriptor definition.
  2386. @section removelogo
  2387. Suppress a TV station logo, using an image file to determine which
  2388. pixels comprise the logo. It works by filling in the pixels that
  2389. comprise the logo with neighboring pixels.
  2390. This filter requires one argument which specifies the filter bitmap
  2391. file, which can be any image format supported by libavformat. The
  2392. width and height of the image file must match those of the video
  2393. stream being processed.
  2394. Pixels in the provided bitmap image with a value of zero are not
  2395. considered part of the logo, non-zero pixels are considered part of
  2396. the logo. If you use white (255) for the logo and black (0) for the
  2397. rest, you will be safe. For making the filter bitmap, it is
  2398. recommended to take a screen capture of a black frame with the logo
  2399. visible, and then using a threshold filter followed by the erode
  2400. filter once or twice.
  2401. If needed, little splotches can be fixed manually. Remember that if
  2402. logo pixels are not covered, the filter quality will be much
  2403. reduced. Marking too many pixels as part of the logo does not hurt as
  2404. much, but it will increase the amount of blurring needed to cover over
  2405. the image and will destroy more information than necessary, and extra
  2406. pixels will slow things down on a large logo.
  2407. @section scale
  2408. Scale (resize) the input video, using the libswscale library.
  2409. The scale filter forces the output display aspect ratio to be the same
  2410. of the input, by changing the output sample aspect ratio.
  2411. This filter accepts a list of named options in the form of
  2412. @var{key}=@var{value} pairs separated by ":". If the key for the first
  2413. two options is not specified, the assumed keys for the first two
  2414. values are @code{w} and @code{h}. If the first option has no key and
  2415. can be interpreted like a video size specification, it will be used
  2416. to set the video size.
  2417. A description of the accepted options follows.
  2418. @table @option
  2419. @item width, w
  2420. Set the video width expression, default value is @code{iw}. See below
  2421. for the list of accepted constants.
  2422. @item height, h
  2423. Set the video heiht expression, default value is @code{ih}.
  2424. See below for the list of accepted constants.
  2425. @item interl
  2426. Set the interlacing. It accepts the following values:
  2427. @table @option
  2428. @item 1
  2429. force interlaced aware scaling
  2430. @item 0
  2431. do not apply interlaced scaling
  2432. @item -1
  2433. select interlaced aware scaling depending on whether the source frames
  2434. are flagged as interlaced or not
  2435. @end table
  2436. Default value is @code{0}.
  2437. @item flags
  2438. Set libswscale scaling flags. If not explictly specified the filter
  2439. applies a bilinear scaling algorithm.
  2440. @item size, s
  2441. Set the video size, the value must be a valid abbreviation or in the
  2442. form @var{width}x@var{height}.
  2443. @end table
  2444. The values of the @var{w} and @var{h} options are expressions
  2445. containing the following constants:
  2446. @table @option
  2447. @item in_w, in_h
  2448. the input width and height
  2449. @item iw, ih
  2450. same as @var{in_w} and @var{in_h}
  2451. @item out_w, out_h
  2452. the output (cropped) width and height
  2453. @item ow, oh
  2454. same as @var{out_w} and @var{out_h}
  2455. @item a
  2456. same as @var{iw} / @var{ih}
  2457. @item sar
  2458. input sample aspect ratio
  2459. @item dar
  2460. input display aspect ratio, it is the same as (@var{iw} / @var{ih}) * @var{sar}
  2461. @item hsub, vsub
  2462. horizontal and vertical chroma subsample values. For example for the
  2463. pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
  2464. @end table
  2465. If the input image format is different from the format requested by
  2466. the next filter, the scale filter will convert the input to the
  2467. requested format.
  2468. If the value for @var{width} or @var{height} is 0, the respective input
  2469. size is used for the output.
  2470. If the value for @var{width} or @var{height} is -1, the scale filter will
  2471. use, for the respective output size, a value that maintains the aspect
  2472. ratio of the input image.
  2473. @subsection Examples
  2474. @itemize
  2475. @item
  2476. Scale the input video to a size of 200x100:
  2477. @example
  2478. scale=200:100
  2479. @end example
  2480. This is equivalent to:
  2481. @example
  2482. scale=w=200:h=100
  2483. @end example
  2484. or:
  2485. @example
  2486. scale=200x100
  2487. @end example
  2488. @item
  2489. Specify a size abbreviation for the output size:
  2490. @example
  2491. scale=qcif
  2492. @end example
  2493. which can also be written as:
  2494. @example
  2495. scale=size=qcif
  2496. @end example
  2497. @item
  2498. Scale the input to 2x:
  2499. @example
  2500. scale=2*iw:2*ih
  2501. @end example
  2502. @item
  2503. The above is the same as:
  2504. @example
  2505. scale=2*in_w:2*in_h
  2506. @end example
  2507. @item
  2508. Scale the input to 2x with forced interlaced scaling:
  2509. @example
  2510. scale=2*iw:2*ih:interl=1
  2511. @end example
  2512. @item
  2513. Scale the input to half size:
  2514. @example
  2515. scale=iw/2:ih/2
  2516. @end example
  2517. @item
  2518. Increase the width, and set the height to the same size:
  2519. @example
  2520. scale=3/2*iw:ow
  2521. @end example
  2522. @item
  2523. Seek for Greek harmony:
  2524. @example
  2525. scale=iw:1/PHI*iw
  2526. scale=ih*PHI:ih
  2527. @end example
  2528. @item
  2529. Increase the height, and set the width to 3/2 of the height:
  2530. @example
  2531. scale=3/2*oh:3/5*ih
  2532. @end example
  2533. @item
  2534. Increase the size, but make the size a multiple of the chroma:
  2535. @example
  2536. scale="trunc(3/2*iw/hsub)*hsub:trunc(3/2*ih/vsub)*vsub"
  2537. @end example
  2538. @item
  2539. Increase the width to a maximum of 500 pixels, keep the same input
  2540. aspect ratio:
  2541. @example
  2542. scale='min(500\, iw*3/2):-1'
  2543. @end example
  2544. @end itemize
  2545. @section select
  2546. Select frames to pass in output.
  2547. It accepts in input an expression, which is evaluated for each input
  2548. frame. If the expression is evaluated to a non-zero value, the frame
  2549. is selected and passed to the output, otherwise it is discarded.
  2550. The expression can contain the following constants:
  2551. @table @option
  2552. @item n
  2553. the sequential number of the filtered frame, starting from 0
  2554. @item selected_n
  2555. the sequential number of the selected frame, starting from 0
  2556. @item prev_selected_n
  2557. the sequential number of the last selected frame, NAN if undefined
  2558. @item TB
  2559. timebase of the input timestamps
  2560. @item pts
  2561. the PTS (Presentation TimeStamp) of the filtered video frame,
  2562. expressed in @var{TB} units, NAN if undefined
  2563. @item t
  2564. the PTS (Presentation TimeStamp) of the filtered video frame,
  2565. expressed in seconds, NAN if undefined
  2566. @item prev_pts
  2567. the PTS of the previously filtered video frame, NAN if undefined
  2568. @item prev_selected_pts
  2569. the PTS of the last previously filtered video frame, NAN if undefined
  2570. @item prev_selected_t
  2571. the PTS of the last previously selected video frame, NAN if undefined
  2572. @item start_pts
  2573. the PTS of the first video frame in the video, NAN if undefined
  2574. @item start_t
  2575. the time of the first video frame in the video, NAN if undefined
  2576. @item pict_type
  2577. the type of the filtered frame, can assume one of the following
  2578. values:
  2579. @table @option
  2580. @item I
  2581. @item P
  2582. @item B
  2583. @item S
  2584. @item SI
  2585. @item SP
  2586. @item BI
  2587. @end table
  2588. @item interlace_type
  2589. the frame interlace type, can assume one of the following values:
  2590. @table @option
  2591. @item PROGRESSIVE
  2592. the frame is progressive (not interlaced)
  2593. @item TOPFIRST
  2594. the frame is top-field-first
  2595. @item BOTTOMFIRST
  2596. the frame is bottom-field-first
  2597. @end table
  2598. @item key
  2599. 1 if the filtered frame is a key-frame, 0 otherwise
  2600. @item pos
  2601. the position in the file of the filtered frame, -1 if the information
  2602. is not available (e.g. for synthetic video)
  2603. @item scene
  2604. value between 0 and 1 to indicate a new scene; a low value reflects a low
  2605. probability for the current frame to introduce a new scene, while a higher
  2606. value means the current frame is more likely to be one (see the example below)
  2607. @end table
  2608. The default value of the select expression is "1".
  2609. Some examples follow:
  2610. @example
  2611. # select all frames in input
  2612. select
  2613. # the above is the same as:
  2614. select=1
  2615. # skip all frames:
  2616. select=0
  2617. # select only I-frames
  2618. select='eq(pict_type\,I)'
  2619. # select one frame every 100
  2620. select='not(mod(n\,100))'
  2621. # select only frames contained in the 10-20 time interval
  2622. select='gte(t\,10)*lte(t\,20)'
  2623. # select only I frames contained in the 10-20 time interval
  2624. select='gte(t\,10)*lte(t\,20)*eq(pict_type\,I)'
  2625. # select frames with a minimum distance of 10 seconds
  2626. select='isnan(prev_selected_t)+gte(t-prev_selected_t\,10)'
  2627. @end example
  2628. Complete example to create a mosaic of the first scenes:
  2629. @example
  2630. ffmpeg -i video.avi -vf select='gt(scene\,0.4)',scale=160:120,tile -frames:v 1 preview.png
  2631. @end example
  2632. Comparing @var{scene} against a value between 0.3 and 0.5 is generally a sane
  2633. choice.
  2634. @section setdar, setsar
  2635. The @code{setdar} filter sets the Display Aspect Ratio for the filter
  2636. output video.
  2637. This is done by changing the specified Sample (aka Pixel) Aspect
  2638. Ratio, according to the following equation:
  2639. @example
  2640. @var{DAR} = @var{HORIZONTAL_RESOLUTION} / @var{VERTICAL_RESOLUTION} * @var{SAR}
  2641. @end example
  2642. Keep in mind that the @code{setdar} filter does not modify the pixel
  2643. dimensions of the video frame. Also the display aspect ratio set by
  2644. this filter may be changed by later filters in the filterchain,
  2645. e.g. in case of scaling or if another "setdar" or a "setsar" filter is
  2646. applied.
  2647. The @code{setsar} filter sets the Sample (aka Pixel) Aspect Ratio for
  2648. the filter output video.
  2649. Note that as a consequence of the application of this filter, the
  2650. output display aspect ratio will change according to the equation
  2651. above.
  2652. Keep in mind that the sample aspect ratio set by the @code{setsar}
  2653. filter may be changed by later filters in the filterchain, e.g. if
  2654. another "setsar" or a "setdar" filter is applied.
  2655. The @code{setdar} and @code{setsar} filters accept a string in the
  2656. form @var{num}:@var{den} expressing an aspect ratio, or the following
  2657. named options, expressed as a sequence of @var{key}=@var{value} pairs,
  2658. separated by ":".
  2659. @table @option
  2660. @item max
  2661. Set the maximum integer value to use for expressing numerator and
  2662. denominator when reducing the expressed aspect ratio to a rational.
  2663. Default value is @code{100}.
  2664. @item r, ratio:
  2665. Set the aspect ratio used by the filter.
  2666. The parameter can be a floating point number string, an expression, or
  2667. a string of the form @var{num}:@var{den}, where @var{num} and
  2668. @var{den} are the numerator and denominator of the aspect ratio. If
  2669. the parameter is not specified, it is assumed the value "0".
  2670. In case the form "@var{num}:@var{den}" the @code{:} character should
  2671. be escaped.
  2672. @end table
  2673. If the keys are omitted in the named options list, the specifed values
  2674. are assumed to be @var{ratio} and @var{max} in that order.
  2675. For example to change the display aspect ratio to 16:9, specify:
  2676. @example
  2677. setdar='16:9'
  2678. @end example
  2679. The example above is equivalent to:
  2680. @example
  2681. setdar=1.77777
  2682. @end example
  2683. To change the sample aspect ratio to 10:11, specify:
  2684. @example
  2685. setsar='10:11'
  2686. @end example
  2687. To set a display aspect ratio of 16:9, and specify a maximum integer value of
  2688. 1000 in the aspect ratio reduction, use the command:
  2689. @example
  2690. setdar=ratio='16:9':max=1000
  2691. @end example
  2692. @section setfield
  2693. Force field for the output video frame.
  2694. The @code{setfield} filter marks the interlace type field for the
  2695. output frames. It does not change the input frame, but only sets the
  2696. corresponding property, which affects how the frame is treated by
  2697. following filters (e.g. @code{fieldorder} or @code{yadif}).
  2698. It accepts a string parameter, which can assume the following values:
  2699. @table @samp
  2700. @item auto
  2701. Keep the same field property.
  2702. @item bff
  2703. Mark the frame as bottom-field-first.
  2704. @item tff
  2705. Mark the frame as top-field-first.
  2706. @item prog
  2707. Mark the frame as progressive.
  2708. @end table
  2709. @section showinfo
  2710. Show a line containing various information for each input video frame.
  2711. The input video is not modified.
  2712. The shown line contains a sequence of key/value pairs of the form
  2713. @var{key}:@var{value}.
  2714. A description of each shown parameter follows:
  2715. @table @option
  2716. @item n
  2717. sequential number of the input frame, starting from 0
  2718. @item pts
  2719. Presentation TimeStamp of the input frame, expressed as a number of
  2720. time base units. The time base unit depends on the filter input pad.
  2721. @item pts_time
  2722. Presentation TimeStamp of the input frame, expressed as a number of
  2723. seconds
  2724. @item pos
  2725. position of the frame in the input stream, -1 if this information in
  2726. unavailable and/or meaningless (for example in case of synthetic video)
  2727. @item fmt
  2728. pixel format name
  2729. @item sar
  2730. sample aspect ratio of the input frame, expressed in the form
  2731. @var{num}/@var{den}
  2732. @item s
  2733. size of the input frame, expressed in the form
  2734. @var{width}x@var{height}
  2735. @item i
  2736. interlaced mode ("P" for "progressive", "T" for top field first, "B"
  2737. for bottom field first)
  2738. @item iskey
  2739. 1 if the frame is a key frame, 0 otherwise
  2740. @item type
  2741. picture type of the input frame ("I" for an I-frame, "P" for a
  2742. P-frame, "B" for a B-frame, "?" for unknown type).
  2743. Check also the documentation of the @code{AVPictureType} enum and of
  2744. the @code{av_get_picture_type_char} function defined in
  2745. @file{libavutil/avutil.h}.
  2746. @item checksum
  2747. Adler-32 checksum (printed in hexadecimal) of all the planes of the input frame
  2748. @item plane_checksum
  2749. Adler-32 checksum (printed in hexadecimal) of each plane of the input frame,
  2750. expressed in the form "[@var{c0} @var{c1} @var{c2} @var{c3}]"
  2751. @end table
  2752. @section smartblur
  2753. Blur the input video without impacting the outlines.
  2754. The filter accepts the following parameters:
  2755. @var{luma_radius}:@var{luma_strength}:@var{luma_threshold}[:@var{chroma_radius}:@var{chroma_strength}:@var{chroma_threshold}]
  2756. Parameters prefixed by @var{luma} indicate that they work on the
  2757. luminance of the pixels whereas parameters prefixed by @var{chroma}
  2758. refer to the chrominance of the pixels.
  2759. If the chroma parameters are not set, the luma parameters are used for
  2760. either the luminance and the chrominance of the pixels.
  2761. @var{luma_radius} or @var{chroma_radius} must be a float number in the
  2762. range [0.1,5.0] that specifies the variance of the gaussian filter
  2763. used to blur the image (slower if larger).
  2764. @var{luma_strength} or @var{chroma_strength} must be a float number in
  2765. the range [-1.0,1.0] that configures the blurring. A value included in
  2766. [0.0,1.0] will blur the image whereas a value included in [-1.0,0.0]
  2767. will sharpen the image.
  2768. @var{luma_threshold} or @var{chroma_threshold} must be an integer in
  2769. the range [-30,30] that is used as a coefficient to determine whether
  2770. a pixel should be blurred or not. A value of 0 will filter all the
  2771. image, a value included in [0,30] will filter flat areas and a value
  2772. included in [-30,0] will filter edges.
  2773. @anchor{subtitles}
  2774. @section subtitles
  2775. Draw subtitles on top of input video using the libass library.
  2776. To enable compilation of this filter you need to configure FFmpeg with
  2777. @code{--enable-libass}. This filter also requires a build with libavcodec and
  2778. libavformat to convert the passed subtitles file to ASS (Advanced Substation
  2779. Alpha) subtitles format.
  2780. This filter accepts the following named options, expressed as a
  2781. sequence of @var{key}=@var{value} pairs, separated by ":".
  2782. @table @option
  2783. @item filename, f
  2784. Set the filename of the subtitle file to read. It must be specified.
  2785. @item original_size
  2786. Specify the size of the original video, the video for which the ASS file
  2787. was composed. Due to a misdesign in ASS aspect ratio arithmetic, this is
  2788. necessary to correctly scale the fonts if the aspect ratio has been changed.
  2789. @end table
  2790. If the first key is not specified, it is assumed that the first value
  2791. specifies the @option{filename}.
  2792. For example, to render the file @file{sub.srt} on top of the input
  2793. video, use the command:
  2794. @example
  2795. subtitles=sub.srt
  2796. @end example
  2797. which is equivalent to:
  2798. @example
  2799. subtitles=filename=sub.srt
  2800. @end example
  2801. @section split
  2802. Split input video into several identical outputs.
  2803. The filter accepts a single parameter which specifies the number of outputs. If
  2804. unspecified, it defaults to 2.
  2805. For example
  2806. @example
  2807. ffmpeg -i INPUT -filter_complex split=5 OUTPUT
  2808. @end example
  2809. will create 5 copies of the input video.
  2810. For example:
  2811. @example
  2812. [in] split [splitout1][splitout2];
  2813. [splitout1] crop=100:100:0:0 [cropout];
  2814. [splitout2] pad=200:200:100:100 [padout];
  2815. @end example
  2816. will create two separate outputs from the same input, one cropped and
  2817. one padded.
  2818. @section super2xsai
  2819. Scale the input by 2x and smooth using the Super2xSaI (Scale and
  2820. Interpolate) pixel art scaling algorithm.
  2821. Useful for enlarging pixel art images without reducing sharpness.
  2822. @section swapuv
  2823. Swap U & V plane.
  2824. @section thumbnail
  2825. Select the most representative frame in a given sequence of consecutive frames.
  2826. It accepts as argument the frames batch size to analyze (default @var{N}=100);
  2827. in a set of @var{N} frames, the filter will pick one of them, and then handle
  2828. the next batch of @var{N} frames until the end.
  2829. Since the filter keeps track of the whole frames sequence, a bigger @var{N}
  2830. value will result in a higher memory usage, so a high value is not recommended.
  2831. The following example extract one picture each 50 frames:
  2832. @example
  2833. thumbnail=50
  2834. @end example
  2835. Complete example of a thumbnail creation with @command{ffmpeg}:
  2836. @example
  2837. ffmpeg -i in.avi -vf thumbnail,scale=300:200 -frames:v 1 out.png
  2838. @end example
  2839. @section tile
  2840. Tile several successive frames together.
  2841. It accepts a list of options in the form of @var{key}=@var{value} pairs
  2842. separated by ":". A description of the accepted options follows.
  2843. @table @option
  2844. @item layout
  2845. Set the grid size (i.e. the number of lines and columns) in the form
  2846. "@var{w}x@var{h}".
  2847. @item margin
  2848. Set the outer border margin in pixels.
  2849. @item padding
  2850. Set the inner border thickness (i.e. the number of pixels between frames). For
  2851. more advanced padding options (such as having different values for the edges),
  2852. refer to the pad video filter.
  2853. @item nb_frames
  2854. Set the maximum number of frames to render in the given area. It must be less
  2855. than or equal to @var{w}x@var{h}. The default value is @code{0}, meaning all
  2856. the area will be used.
  2857. @end table
  2858. Alternatively, the options can be specified as a flat string:
  2859. @var{layout}[:@var{nb_frames}[:@var{margin}[:@var{padding}]]]
  2860. For example, produce 8×8 PNG tiles of all keyframes (@option{-skip_frame
  2861. nokey}) in a movie:
  2862. @example
  2863. ffmpeg -skip_frame nokey -i file.avi -vf 'scale=128:72,tile=8x8' -an -vsync 0 keyframes%03d.png
  2864. @end example
  2865. The @option{-vsync 0} is necessary to prevent @command{ffmpeg} from
  2866. duplicating each output frame to accomodate the originally detected frame
  2867. rate.
  2868. Another example to display @code{5} pictures in an area of @code{3x2} frames,
  2869. with @code{7} pixels between them, and @code{2} pixels of initial margin, using
  2870. mixed flat and named options:
  2871. @example
  2872. tile=3x2:nb_frames=5:padding=7:margin=2
  2873. @end example
  2874. @section tinterlace
  2875. Perform various types of temporal field interlacing.
  2876. Frames are counted starting from 1, so the first input frame is
  2877. considered odd.
  2878. This filter accepts a single parameter specifying the mode. Available
  2879. modes are:
  2880. @table @samp
  2881. @item merge, 0
  2882. Move odd frames into the upper field, even into the lower field,
  2883. generating a double height frame at half framerate.
  2884. @item drop_odd, 1
  2885. Only output even frames, odd frames are dropped, generating a frame with
  2886. unchanged height at half framerate.
  2887. @item drop_even, 2
  2888. Only output odd frames, even frames are dropped, generating a frame with
  2889. unchanged height at half framerate.
  2890. @item pad, 3
  2891. Expand each frame to full height, but pad alternate lines with black,
  2892. generating a frame with double height at the same input framerate.
  2893. @item interleave_top, 4
  2894. Interleave the upper field from odd frames with the lower field from
  2895. even frames, generating a frame with unchanged height at half framerate.
  2896. @item interleave_bottom, 5
  2897. Interleave the lower field from odd frames with the upper field from
  2898. even frames, generating a frame with unchanged height at half framerate.
  2899. @item interlacex2, 6
  2900. Double frame rate with unchanged height. Frames are inserted each
  2901. containing the second temporal field from the previous input frame and
  2902. the first temporal field from the next input frame. This mode relies on
  2903. the top_field_first flag. Useful for interlaced video displays with no
  2904. field synchronisation.
  2905. @end table
  2906. Numeric values are deprecated but are accepted for backward
  2907. compatibility reasons.
  2908. Default mode is @code{merge}.
  2909. @section transpose
  2910. Transpose rows with columns in the input video and optionally flip it.
  2911. This filter accepts the following named parameters:
  2912. @table @option
  2913. @item dir
  2914. Specify the transposition direction. Can assume the following values:
  2915. @table @samp
  2916. @item 0, 4
  2917. Rotate by 90 degrees counterclockwise and vertically flip (default), that is:
  2918. @example
  2919. L.R L.l
  2920. . . -> . .
  2921. l.r R.r
  2922. @end example
  2923. @item 1, 5
  2924. Rotate by 90 degrees clockwise, that is:
  2925. @example
  2926. L.R l.L
  2927. . . -> . .
  2928. l.r r.R
  2929. @end example
  2930. @item 2, 6
  2931. Rotate by 90 degrees counterclockwise, that is:
  2932. @example
  2933. L.R R.r
  2934. . . -> . .
  2935. l.r L.l
  2936. @end example
  2937. @item 3, 7
  2938. Rotate by 90 degrees clockwise and vertically flip, that is:
  2939. @example
  2940. L.R r.R
  2941. . . -> . .
  2942. l.r l.L
  2943. @end example
  2944. @end table
  2945. For values between 4-7, the transposition is only done if the input
  2946. video geometry is portrait and not landscape. These values are
  2947. deprecated, the @code{passthrough} option should be used instead.
  2948. @item passthrough
  2949. Do not apply the transposition if the input geometry matches the one
  2950. specified by the specified value. It accepts the following values:
  2951. @table @samp
  2952. @item none
  2953. Always apply transposition.
  2954. @item portrait
  2955. Preserve portrait geometry (when @var{height} >= @var{width}).
  2956. @item landscape
  2957. Preserve landscape geometry (when @var{width} >= @var{height}).
  2958. @end table
  2959. Default value is @code{none}.
  2960. @end table
  2961. @section unsharp
  2962. Sharpen or blur the input video.
  2963. It accepts the following parameters:
  2964. @var{luma_msize_x}:@var{luma_msize_y}:@var{luma_amount}:@var{chroma_msize_x}:@var{chroma_msize_y}:@var{chroma_amount}
  2965. Negative values for the amount will blur the input video, while positive
  2966. values will sharpen. All parameters are optional and default to the
  2967. equivalent of the string '5:5:1.0:5:5:0.0'.
  2968. @table @option
  2969. @item luma_msize_x
  2970. Set the luma matrix horizontal size. It can be an integer between 3
  2971. and 13, default value is 5.
  2972. @item luma_msize_y
  2973. Set the luma matrix vertical size. It can be an integer between 3
  2974. and 13, default value is 5.
  2975. @item luma_amount
  2976. Set the luma effect strength. It can be a float number between -2.0
  2977. and 5.0, default value is 1.0.
  2978. @item chroma_msize_x
  2979. Set the chroma matrix horizontal size. It can be an integer between 3
  2980. and 13, default value is 5.
  2981. @item chroma_msize_y
  2982. Set the chroma matrix vertical size. It can be an integer between 3
  2983. and 13, default value is 5.
  2984. @item chroma_amount
  2985. Set the chroma effect strength. It can be a float number between -2.0
  2986. and 5.0, default value is 0.0.
  2987. @end table
  2988. @example
  2989. # Strong luma sharpen effect parameters
  2990. unsharp=7:7:2.5
  2991. # Strong blur of both luma and chroma parameters
  2992. unsharp=7:7:-2:7:7:-2
  2993. # Use the default values with @command{ffmpeg}
  2994. ffmpeg -i in.avi -vf "unsharp" out.mp4
  2995. @end example
  2996. @section vflip
  2997. Flip the input video vertically.
  2998. @example
  2999. ffmpeg -i in.avi -vf "vflip" out.avi
  3000. @end example
  3001. @section yadif
  3002. Deinterlace the input video ("yadif" means "yet another deinterlacing
  3003. filter").
  3004. It accepts the optional parameters: @var{mode}:@var{parity}:@var{auto}.
  3005. @var{mode} specifies the interlacing mode to adopt, accepts one of the
  3006. following values:
  3007. @table @option
  3008. @item 0
  3009. output 1 frame for each frame
  3010. @item 1
  3011. output 1 frame for each field
  3012. @item 2
  3013. like 0 but skips spatial interlacing check
  3014. @item 3
  3015. like 1 but skips spatial interlacing check
  3016. @end table
  3017. Default value is 0.
  3018. @var{parity} specifies the picture field parity assumed for the input
  3019. interlaced video, accepts one of the following values:
  3020. @table @option
  3021. @item 0
  3022. assume top field first
  3023. @item 1
  3024. assume bottom field first
  3025. @item -1
  3026. enable automatic detection
  3027. @end table
  3028. Default value is -1.
  3029. If interlacing is unknown or decoder does not export this information,
  3030. top field first will be assumed.
  3031. @var{auto} specifies if deinterlacer should trust the interlaced flag
  3032. and only deinterlace frames marked as interlaced
  3033. @table @option
  3034. @item 0
  3035. deinterlace all frames
  3036. @item 1
  3037. only deinterlace frames marked as interlaced
  3038. @end table
  3039. Default value is 0.
  3040. @c man end VIDEO FILTERS
  3041. @chapter Video Sources
  3042. @c man begin VIDEO SOURCES
  3043. Below is a description of the currently available video sources.
  3044. @section buffer
  3045. Buffer video frames, and make them available to the filter chain.
  3046. This source is mainly intended for a programmatic use, in particular
  3047. through the interface defined in @file{libavfilter/vsrc_buffer.h}.
  3048. It accepts a list of options in the form of @var{key}=@var{value} pairs
  3049. separated by ":". A description of the accepted options follows.
  3050. @table @option
  3051. @item video_size
  3052. Specify the size (width and height) of the buffered video frames.
  3053. @item pix_fmt
  3054. A string representing the pixel format of the buffered video frames.
  3055. It may be a number corresponding to a pixel format, or a pixel format
  3056. name.
  3057. @item time_base
  3058. Specify the timebase assumed by the timestamps of the buffered frames.
  3059. @item time_base
  3060. Specify the frame rate expected for the video stream.
  3061. @item pixel_aspect
  3062. Specify the sample aspect ratio assumed by the video frames.
  3063. @item sws_param
  3064. Specify the optional parameters to be used for the scale filter which
  3065. is automatically inserted when an input change is detected in the
  3066. input size or format.
  3067. @end table
  3068. For example:
  3069. @example
  3070. buffer=size=320x240:pix_fmt=yuv410p:time_base=1/24:pixel_aspect=1/1
  3071. @end example
  3072. will instruct the source to accept video frames with size 320x240 and
  3073. with format "yuv410p", assuming 1/24 as the timestamps timebase and
  3074. square pixels (1:1 sample aspect ratio).
  3075. Since the pixel format with name "yuv410p" corresponds to the number 6
  3076. (check the enum AVPixelFormat definition in @file{libavutil/pixfmt.h}),
  3077. this example corresponds to:
  3078. @example
  3079. buffer=size=320x240:pixfmt=6:time_base=1/24:pixel_aspect=1/1
  3080. @end example
  3081. Alternatively, the options can be specified as a flat string, but this
  3082. syntax is deprecated:
  3083. @var{width}:@var{height}:@var{pix_fmt}:@var{time_base.num}:@var{time_base.den}:@var{pixel_aspect.num}:@var{pixel_aspect.den}[:@var{sws_param}]
  3084. @section cellauto
  3085. Create a pattern generated by an elementary cellular automaton.
  3086. The initial state of the cellular automaton can be defined through the
  3087. @option{filename}, and @option{pattern} options. If such options are
  3088. not specified an initial state is created randomly.
  3089. At each new frame a new row in the video is filled with the result of
  3090. the cellular automaton next generation. The behavior when the whole
  3091. frame is filled is defined by the @option{scroll} option.
  3092. This source accepts a list of options in the form of
  3093. @var{key}=@var{value} pairs separated by ":". A description of the
  3094. accepted options follows.
  3095. @table @option
  3096. @item filename, f
  3097. Read the initial cellular automaton state, i.e. the starting row, from
  3098. the specified file.
  3099. In the file, each non-whitespace character is considered an alive
  3100. cell, a newline will terminate the row, and further characters in the
  3101. file will be ignored.
  3102. @item pattern, p
  3103. Read the initial cellular automaton state, i.e. the starting row, from
  3104. the specified string.
  3105. Each non-whitespace character in the string is considered an alive
  3106. cell, a newline will terminate the row, and further characters in the
  3107. string will be ignored.
  3108. @item rate, r
  3109. Set the video rate, that is the number of frames generated per second.
  3110. Default is 25.
  3111. @item random_fill_ratio, ratio
  3112. Set the random fill ratio for the initial cellular automaton row. It
  3113. is a floating point number value ranging from 0 to 1, defaults to
  3114. 1/PHI.
  3115. This option is ignored when a file or a pattern is specified.
  3116. @item random_seed, seed
  3117. Set the seed for filling randomly the initial row, must be an integer
  3118. included between 0 and UINT32_MAX. If not specified, or if explicitly
  3119. set to -1, the filter will try to use a good random seed on a best
  3120. effort basis.
  3121. @item rule
  3122. Set the cellular automaton rule, it is a number ranging from 0 to 255.
  3123. Default value is 110.
  3124. @item size, s
  3125. Set the size of the output video.
  3126. If @option{filename} or @option{pattern} is specified, the size is set
  3127. by default to the width of the specified initial state row, and the
  3128. height is set to @var{width} * PHI.
  3129. If @option{size} is set, it must contain the width of the specified
  3130. pattern string, and the specified pattern will be centered in the
  3131. larger row.
  3132. If a filename or a pattern string is not specified, the size value
  3133. defaults to "320x518" (used for a randomly generated initial state).
  3134. @item scroll
  3135. If set to 1, scroll the output upward when all the rows in the output
  3136. have been already filled. If set to 0, the new generated row will be
  3137. written over the top row just after the bottom row is filled.
  3138. Defaults to 1.
  3139. @item start_full, full
  3140. If set to 1, completely fill the output with generated rows before
  3141. outputting the first frame.
  3142. This is the default behavior, for disabling set the value to 0.
  3143. @item stitch
  3144. If set to 1, stitch the left and right row edges together.
  3145. This is the default behavior, for disabling set the value to 0.
  3146. @end table
  3147. @subsection Examples
  3148. @itemize
  3149. @item
  3150. Read the initial state from @file{pattern}, and specify an output of
  3151. size 200x400.
  3152. @example
  3153. cellauto=f=pattern:s=200x400
  3154. @end example
  3155. @item
  3156. Generate a random initial row with a width of 200 cells, with a fill
  3157. ratio of 2/3:
  3158. @example
  3159. cellauto=ratio=2/3:s=200x200
  3160. @end example
  3161. @item
  3162. Create a pattern generated by rule 18 starting by a single alive cell
  3163. centered on an initial row with width 100:
  3164. @example
  3165. cellauto=p=@@:s=100x400:full=0:rule=18
  3166. @end example
  3167. @item
  3168. Specify a more elaborated initial pattern:
  3169. @example
  3170. cellauto=p='@@@@ @@ @@@@':s=100x400:full=0:rule=18
  3171. @end example
  3172. @end itemize
  3173. @section mandelbrot
  3174. Generate a Mandelbrot set fractal, and progressively zoom towards the
  3175. point specified with @var{start_x} and @var{start_y}.
  3176. This source accepts a list of options in the form of
  3177. @var{key}=@var{value} pairs separated by ":". A description of the
  3178. accepted options follows.
  3179. @table @option
  3180. @item end_pts
  3181. Set the terminal pts value. Default value is 400.
  3182. @item end_scale
  3183. Set the terminal scale value.
  3184. Must be a floating point value. Default value is 0.3.
  3185. @item inner
  3186. Set the inner coloring mode, that is the algorithm used to draw the
  3187. Mandelbrot fractal internal region.
  3188. It shall assume one of the following values:
  3189. @table @option
  3190. @item black
  3191. Set black mode.
  3192. @item convergence
  3193. Show time until convergence.
  3194. @item mincol
  3195. Set color based on point closest to the origin of the iterations.
  3196. @item period
  3197. Set period mode.
  3198. @end table
  3199. Default value is @var{mincol}.
  3200. @item bailout
  3201. Set the bailout value. Default value is 10.0.
  3202. @item maxiter
  3203. Set the maximum of iterations performed by the rendering
  3204. algorithm. Default value is 7189.
  3205. @item outer
  3206. Set outer coloring mode.
  3207. It shall assume one of following values:
  3208. @table @option
  3209. @item iteration_count
  3210. Set iteration cound mode.
  3211. @item normalized_iteration_count
  3212. set normalized iteration count mode.
  3213. @end table
  3214. Default value is @var{normalized_iteration_count}.
  3215. @item rate, r
  3216. Set frame rate, expressed as number of frames per second. Default
  3217. value is "25".
  3218. @item size, s
  3219. Set frame size. Default value is "640x480".
  3220. @item start_scale
  3221. Set the initial scale value. Default value is 3.0.
  3222. @item start_x
  3223. Set the initial x position. Must be a floating point value between
  3224. -100 and 100. Default value is -0.743643887037158704752191506114774.
  3225. @item start_y
  3226. Set the initial y position. Must be a floating point value between
  3227. -100 and 100. Default value is -0.131825904205311970493132056385139.
  3228. @end table
  3229. @section mptestsrc
  3230. Generate various test patterns, as generated by the MPlayer test filter.
  3231. The size of the generated video is fixed, and is 256x256.
  3232. This source is useful in particular for testing encoding features.
  3233. This source accepts an optional sequence of @var{key}=@var{value} pairs,
  3234. separated by ":". The description of the accepted options follows.
  3235. @table @option
  3236. @item rate, r
  3237. Specify the frame rate of the sourced video, as the number of frames
  3238. generated per second. It has to be a string in the format
  3239. @var{frame_rate_num}/@var{frame_rate_den}, an integer number, a float
  3240. number or a valid video frame rate abbreviation. The default value is
  3241. "25".
  3242. @item duration, d
  3243. Set the video duration of the sourced video. The accepted syntax is:
  3244. @example
  3245. [-]HH:MM:SS[.m...]
  3246. [-]S+[.m...]
  3247. @end example
  3248. See also the function @code{av_parse_time()}.
  3249. If not specified, or the expressed duration is negative, the video is
  3250. supposed to be generated forever.
  3251. @item test, t
  3252. Set the number or the name of the test to perform. Supported tests are:
  3253. @table @option
  3254. @item dc_luma
  3255. @item dc_chroma
  3256. @item freq_luma
  3257. @item freq_chroma
  3258. @item amp_luma
  3259. @item amp_chroma
  3260. @item cbp
  3261. @item mv
  3262. @item ring1
  3263. @item ring2
  3264. @item all
  3265. @end table
  3266. Default value is "all", which will cycle through the list of all tests.
  3267. @end table
  3268. For example the following:
  3269. @example
  3270. testsrc=t=dc_luma
  3271. @end example
  3272. will generate a "dc_luma" test pattern.
  3273. @section frei0r_src
  3274. Provide a frei0r source.
  3275. To enable compilation of this filter you need to install the frei0r
  3276. header and configure FFmpeg with @code{--enable-frei0r}.
  3277. The source supports the syntax:
  3278. @example
  3279. @var{size}:@var{rate}:@var{src_name}[@{=|:@}@var{param1}:@var{param2}:...:@var{paramN}]
  3280. @end example
  3281. @var{size} is the size of the video to generate, may be a string of the
  3282. form @var{width}x@var{height} or a frame size abbreviation.
  3283. @var{rate} is the rate of the video to generate, may be a string of
  3284. the form @var{num}/@var{den} or a frame rate abbreviation.
  3285. @var{src_name} is the name to the frei0r source to load. For more
  3286. information regarding frei0r and how to set the parameters read the
  3287. section @ref{frei0r} in the description of the video filters.
  3288. For example, to generate a frei0r partik0l source with size 200x200
  3289. and frame rate 10 which is overlayed on the overlay filter main input:
  3290. @example
  3291. frei0r_src=200x200:10:partik0l=1234 [overlay]; [in][overlay] overlay
  3292. @end example
  3293. @section life
  3294. Generate a life pattern.
  3295. This source is based on a generalization of John Conway's life game.
  3296. The sourced input represents a life grid, each pixel represents a cell
  3297. which can be in one of two possible states, alive or dead. Every cell
  3298. interacts with its eight neighbours, which are the cells that are
  3299. horizontally, vertically, or diagonally adjacent.
  3300. At each interaction the grid evolves according to the adopted rule,
  3301. which specifies the number of neighbor alive cells which will make a
  3302. cell stay alive or born. The @option{rule} option allows to specify
  3303. the rule to adopt.
  3304. This source accepts a list of options in the form of
  3305. @var{key}=@var{value} pairs separated by ":". A description of the
  3306. accepted options follows.
  3307. @table @option
  3308. @item filename, f
  3309. Set the file from which to read the initial grid state. In the file,
  3310. each non-whitespace character is considered an alive cell, and newline
  3311. is used to delimit the end of each row.
  3312. If this option is not specified, the initial grid is generated
  3313. randomly.
  3314. @item rate, r
  3315. Set the video rate, that is the number of frames generated per second.
  3316. Default is 25.
  3317. @item random_fill_ratio, ratio
  3318. Set the random fill ratio for the initial random grid. It is a
  3319. floating point number value ranging from 0 to 1, defaults to 1/PHI.
  3320. It is ignored when a file is specified.
  3321. @item random_seed, seed
  3322. Set the seed for filling the initial random grid, must be an integer
  3323. included between 0 and UINT32_MAX. If not specified, or if explicitly
  3324. set to -1, the filter will try to use a good random seed on a best
  3325. effort basis.
  3326. @item rule
  3327. Set the life rule.
  3328. A rule can be specified with a code of the kind "S@var{NS}/B@var{NB}",
  3329. where @var{NS} and @var{NB} are sequences of numbers in the range 0-8,
  3330. @var{NS} specifies the number of alive neighbor cells which make a
  3331. live cell stay alive, and @var{NB} the number of alive neighbor cells
  3332. which make a dead cell to become alive (i.e. to "born").
  3333. "s" and "b" can be used in place of "S" and "B", respectively.
  3334. Alternatively a rule can be specified by an 18-bits integer. The 9
  3335. high order bits are used to encode the next cell state if it is alive
  3336. for each number of neighbor alive cells, the low order bits specify
  3337. the rule for "borning" new cells. Higher order bits encode for an
  3338. higher number of neighbor cells.
  3339. For example the number 6153 = @code{(12<<9)+9} specifies a stay alive
  3340. rule of 12 and a born rule of 9, which corresponds to "S23/B03".
  3341. Default value is "S23/B3", which is the original Conway's game of life
  3342. rule, and will keep a cell alive if it has 2 or 3 neighbor alive
  3343. cells, and will born a new cell if there are three alive cells around
  3344. a dead cell.
  3345. @item size, s
  3346. Set the size of the output video.
  3347. If @option{filename} is specified, the size is set by default to the
  3348. same size of the input file. If @option{size} is set, it must contain
  3349. the size specified in the input file, and the initial grid defined in
  3350. that file is centered in the larger resulting area.
  3351. If a filename is not specified, the size value defaults to "320x240"
  3352. (used for a randomly generated initial grid).
  3353. @item stitch
  3354. If set to 1, stitch the left and right grid edges together, and the
  3355. top and bottom edges also. Defaults to 1.
  3356. @item mold
  3357. Set cell mold speed. If set, a dead cell will go from @option{death_color} to
  3358. @option{mold_color} with a step of @option{mold}. @option{mold} can have a
  3359. value from 0 to 255.
  3360. @item life_color
  3361. Set the color of living (or new born) cells.
  3362. @item death_color
  3363. Set the color of dead cells. If @option{mold} is set, this is the first color
  3364. used to represent a dead cell.
  3365. @item mold_color
  3366. Set mold color, for definitely dead and moldy cells.
  3367. @end table
  3368. @subsection Examples
  3369. @itemize
  3370. @item
  3371. Read a grid from @file{pattern}, and center it on a grid of size
  3372. 300x300 pixels:
  3373. @example
  3374. life=f=pattern:s=300x300
  3375. @end example
  3376. @item
  3377. Generate a random grid of size 200x200, with a fill ratio of 2/3:
  3378. @example
  3379. life=ratio=2/3:s=200x200
  3380. @end example
  3381. @item
  3382. Specify a custom rule for evolving a randomly generated grid:
  3383. @example
  3384. life=rule=S14/B34
  3385. @end example
  3386. @item
  3387. Full example with slow death effect (mold) using @command{ffplay}:
  3388. @example
  3389. ffplay -f lavfi life=s=300x200:mold=10:r=60:ratio=0.1:death_color=#C83232:life_color=#00ff00,scale=1200:800:flags=16
  3390. @end example
  3391. @end itemize
  3392. @section color, nullsrc, rgbtestsrc, smptebars, testsrc
  3393. The @code{color} source provides an uniformly colored input.
  3394. The @code{nullsrc} source returns unprocessed video frames. It is
  3395. mainly useful to be employed in analysis / debugging tools, or as the
  3396. source for filters which ignore the input data.
  3397. The @code{rgbtestsrc} source generates an RGB test pattern useful for
  3398. detecting RGB vs BGR issues. You should see a red, green and blue
  3399. stripe from top to bottom.
  3400. The @code{smptebars} source generates a color bars pattern, based on
  3401. the SMPTE Engineering Guideline EG 1-1990.
  3402. The @code{testsrc} source generates a test video pattern, showing a
  3403. color pattern, a scrolling gradient and a timestamp. This is mainly
  3404. intended for testing purposes.
  3405. These sources accept an optional sequence of @var{key}=@var{value} pairs,
  3406. separated by ":". The description of the accepted options follows.
  3407. @table @option
  3408. @item color, c
  3409. Specify the color of the source, only used in the @code{color}
  3410. source. It can be the name of a color (case insensitive match) or a
  3411. 0xRRGGBB[AA] sequence, possibly followed by an alpha specifier. The
  3412. default value is "black".
  3413. @item size, s
  3414. Specify the size of the sourced video, it may be a string of the form
  3415. @var{width}x@var{height}, or the name of a size abbreviation. The
  3416. default value is "320x240".
  3417. @item rate, r
  3418. Specify the frame rate of the sourced video, as the number of frames
  3419. generated per second. It has to be a string in the format
  3420. @var{frame_rate_num}/@var{frame_rate_den}, an integer number, a float
  3421. number or a valid video frame rate abbreviation. The default value is
  3422. "25".
  3423. @item sar
  3424. Set the sample aspect ratio of the sourced video.
  3425. @item duration, d
  3426. Set the video duration of the sourced video. The accepted syntax is:
  3427. @example
  3428. [-]HH[:MM[:SS[.m...]]]
  3429. [-]S+[.m...]
  3430. @end example
  3431. See also the function @code{av_parse_time()}.
  3432. If not specified, or the expressed duration is negative, the video is
  3433. supposed to be generated forever.
  3434. @item decimals, n
  3435. Set the number of decimals to show in the timestamp, only used in the
  3436. @code{testsrc} source.
  3437. The displayed timestamp value will correspond to the original
  3438. timestamp value multiplied by the power of 10 of the specified
  3439. value. Default value is 0.
  3440. @end table
  3441. For example the following:
  3442. @example
  3443. testsrc=duration=5.3:size=qcif:rate=10
  3444. @end example
  3445. will generate a video with a duration of 5.3 seconds, with size
  3446. 176x144 and a frame rate of 10 frames per second.
  3447. The following graph description will generate a red source
  3448. with an opacity of 0.2, with size "qcif" and a frame rate of 10
  3449. frames per second.
  3450. @example
  3451. color=c=red@@0.2:s=qcif:r=10
  3452. @end example
  3453. If the input content is to be ignored, @code{nullsrc} can be used. The
  3454. following command generates noise in the luminance plane by employing
  3455. the @code{geq} filter:
  3456. @example
  3457. nullsrc=s=256x256, geq=random(1)*255:128:128
  3458. @end example
  3459. @c man end VIDEO SOURCES
  3460. @chapter Video Sinks
  3461. @c man begin VIDEO SINKS
  3462. Below is a description of the currently available video sinks.
  3463. @section buffersink
  3464. Buffer video frames, and make them available to the end of the filter
  3465. graph.
  3466. This sink is mainly intended for a programmatic use, in particular
  3467. through the interface defined in @file{libavfilter/buffersink.h}.
  3468. It does not require a string parameter in input, but you need to
  3469. specify a pointer to a list of supported pixel formats terminated by
  3470. -1 in the opaque parameter provided to @code{avfilter_init_filter}
  3471. when initializing this sink.
  3472. @section nullsink
  3473. Null video sink, do absolutely nothing with the input video. It is
  3474. mainly useful as a template and to be employed in analysis / debugging
  3475. tools.
  3476. @c man end VIDEO SINKS
  3477. @chapter Multimedia Filters
  3478. @c man begin MULTIMEDIA FILTERS
  3479. Below is a description of the currently available multimedia filters.
  3480. @section asendcmd, sendcmd
  3481. Send commands to filters in the filtergraph.
  3482. These filters read commands to be sent to other filters in the
  3483. filtergraph.
  3484. @code{asendcmd} must be inserted between two audio filters,
  3485. @code{sendcmd} must be inserted between two video filters, but apart
  3486. from that they act the same way.
  3487. The specification of commands can be provided in the filter arguments
  3488. with the @var{commands} option, or in a file specified by the
  3489. @var{filename} option.
  3490. These filters accept the following options:
  3491. @table @option
  3492. @item commands, c
  3493. Set the commands to be read and sent to the other filters.
  3494. @item filename, f
  3495. Set the filename of the commands to be read and sent to the other
  3496. filters.
  3497. @end table
  3498. @subsection Commands syntax
  3499. A commands description consists of a sequence of interval
  3500. specifications, comprising a list of commands to be executed when a
  3501. particular event related to that interval occurs. The occurring event
  3502. is typically the current frame time entering or leaving a given time
  3503. interval.
  3504. An interval is specified by the following syntax:
  3505. @example
  3506. @var{START}[-@var{END}] @var{COMMANDS};
  3507. @end example
  3508. The time interval is specified by the @var{START} and @var{END} times.
  3509. @var{END} is optional and defaults to the maximum time.
  3510. The current frame time is considered within the specified interval if
  3511. it is included in the interval [@var{START}, @var{END}), that is when
  3512. the time is greater or equal to @var{START} and is lesser than
  3513. @var{END}.
  3514. @var{COMMANDS} consists of a sequence of one or more command
  3515. specifications, separated by ",", relating to that interval. The
  3516. syntax of a command specification is given by:
  3517. @example
  3518. [@var{FLAGS}] @var{TARGET} @var{COMMAND} @var{ARG}
  3519. @end example
  3520. @var{FLAGS} is optional and specifies the type of events relating to
  3521. the time interval which enable sending the specified command, and must
  3522. be a non-null sequence of identifier flags separated by "+" or "|" and
  3523. enclosed between "[" and "]".
  3524. The following flags are recognized:
  3525. @table @option
  3526. @item enter
  3527. The command is sent when the current frame timestamp enters the
  3528. specified interval. In other words, the command is sent when the
  3529. previous frame timestamp was not in the given interval, and the
  3530. current is.
  3531. @item leave
  3532. The command is sent when the current frame timestamp leaves the
  3533. specified interval. In other words, the command is sent when the
  3534. previous frame timestamp was in the given interval, and the
  3535. current is not.
  3536. @end table
  3537. If @var{FLAGS} is not specified, a default value of @code{[enter]} is
  3538. assumed.
  3539. @var{TARGET} specifies the target of the command, usually the name of
  3540. the filter class or a specific filter instance name.
  3541. @var{COMMAND} specifies the name of the command for the target filter.
  3542. @var{ARG} is optional and specifies the optional list of argument for
  3543. the given @var{COMMAND}.
  3544. Between one interval specification and another, whitespaces, or
  3545. sequences of characters starting with @code{#} until the end of line,
  3546. are ignored and can be used to annotate comments.
  3547. A simplified BNF description of the commands specification syntax
  3548. follows:
  3549. @example
  3550. @var{COMMAND_FLAG} ::= "enter" | "leave"
  3551. @var{COMMAND_FLAGS} ::= @var{COMMAND_FLAG} [(+|"|")@var{COMMAND_FLAG}]
  3552. @var{COMMAND} ::= ["[" @var{COMMAND_FLAGS} "]"] @var{TARGET} @var{COMMAND} [@var{ARG}]
  3553. @var{COMMANDS} ::= @var{COMMAND} [,@var{COMMANDS}]
  3554. @var{INTERVAL} ::= @var{START}[-@var{END}] @var{COMMANDS}
  3555. @var{INTERVALS} ::= @var{INTERVAL}[;@var{INTERVALS}]
  3556. @end example
  3557. @subsection Examples
  3558. @itemize
  3559. @item
  3560. Specify audio tempo change at second 4:
  3561. @example
  3562. asendcmd=c='4.0 atempo tempo 1.5',atempo
  3563. @end example
  3564. @item
  3565. Specify a list of drawtext and hue commands in a file.
  3566. @example
  3567. # show text in the interval 5-10
  3568. 5.0-10.0 [enter] drawtext reinit 'fontfile=FreeSerif.ttf:text=hello world',
  3569. [leave] drawtext reinit 'fontfile=FreeSerif.ttf:text=';
  3570. # desaturate the image in the interval 15-20
  3571. 15.0-20.0 [enter] hue reinit s=0,
  3572. [enter] drawtext reinit 'fontfile=FreeSerif.ttf:text=nocolor',
  3573. [leave] hue reinit s=1,
  3574. [leave] drawtext reinit 'fontfile=FreeSerif.ttf:text=color';
  3575. # apply an exponential saturation fade-out effect, starting from time 25
  3576. 25 [enter] hue s=exp(t-25)
  3577. @end example
  3578. A filtergraph allowing to read and process the above command list
  3579. stored in a file @file{test.cmd}, can be specified with:
  3580. @example
  3581. sendcmd=f=test.cmd,drawtext=fontfile=FreeSerif.ttf:text='',hue
  3582. @end example
  3583. @end itemize
  3584. @section asetpts, setpts
  3585. Change the PTS (presentation timestamp) of the input frames.
  3586. @code{asetpts} works on audio frames, @code{setpts} on video frames.
  3587. Accept in input an expression evaluated through the eval API, which
  3588. can contain the following constants:
  3589. @table @option
  3590. @item FRAME_RATE
  3591. frame rate, only defined for constant frame-rate video
  3592. @item PTS
  3593. the presentation timestamp in input
  3594. @item N
  3595. the count of the input frame, starting from 0.
  3596. @item NB_CONSUMED_SAMPLES
  3597. the number of consumed samples, not including the current frame (only
  3598. audio)
  3599. @item NB_SAMPLES
  3600. the number of samples in the current frame (only audio)
  3601. @item SAMPLE_RATE
  3602. audio sample rate
  3603. @item STARTPTS
  3604. the PTS of the first frame
  3605. @item STARTT
  3606. the time in seconds of the first frame
  3607. @item INTERLACED
  3608. tell if the current frame is interlaced
  3609. @item T
  3610. the time in seconds of the current frame
  3611. @item TB
  3612. the time base
  3613. @item POS
  3614. original position in the file of the frame, or undefined if undefined
  3615. for the current frame
  3616. @item PREV_INPTS
  3617. previous input PTS
  3618. @item PREV_INT
  3619. previous input time in seconds
  3620. @item PREV_OUTPTS
  3621. previous output PTS
  3622. @item PREV_OUTT
  3623. previous output time in seconds
  3624. @end table
  3625. @subsection Examples
  3626. @itemize
  3627. @item
  3628. Start counting PTS from zero
  3629. @example
  3630. setpts=PTS-STARTPTS
  3631. @end example
  3632. @item
  3633. Apply fast motion effect:
  3634. @example
  3635. setpts=0.5*PTS
  3636. @end example
  3637. @item
  3638. Apply slow motion effect:
  3639. @example
  3640. setpts=2.0*PTS
  3641. @end example
  3642. @item
  3643. Set fixed rate of 25 frames per second:
  3644. @example
  3645. setpts=N/(25*TB)
  3646. @end example
  3647. @item
  3648. Set fixed rate 25 fps with some jitter:
  3649. @example
  3650. setpts='1/(25*TB) * (N + 0.05 * sin(N*2*PI/25))'
  3651. @end example
  3652. @item
  3653. Apply an offset of 10 seconds to the input PTS:
  3654. @example
  3655. setpts=PTS+10/TB
  3656. @end example
  3657. @end itemize
  3658. @section ebur128
  3659. EBU R128 scanner filter. This filter takes an audio stream as input and outputs
  3660. it unchanged. By default, it logs a message at a frequency of 10Hz with the
  3661. Momentary loudness (identified by @code{M}), Short-term loudness (@code{S}),
  3662. Integrated loudness (@code{I}) and Loudness Range (@code{LRA}).
  3663. The filter also has a video output (see the @var{video} option) with a real
  3664. time graph to observe the loudness evolution. The graphic contains the logged
  3665. message mentioned above, so it is not printed anymore when this option is set,
  3666. unless the verbose logging is set. The main graphing area contains the
  3667. short-term loudness (3 seconds of analysis), and the gauge on the right is for
  3668. the momentary loudness (400 milliseconds).
  3669. More information about the Loudness Recommendation EBU R128 on
  3670. @url{http://tech.ebu.ch/loudness}.
  3671. The filter accepts the following named parameters:
  3672. @table @option
  3673. @item video
  3674. Activate the video output. The audio stream is passed unchanged whether this
  3675. option is set or no. The video stream will be the first output stream if
  3676. activated. Default is @code{0}.
  3677. @item size
  3678. Set the video size. This option is for video only. Default and minimum
  3679. resolution is @code{640x480}.
  3680. @item meter
  3681. Set the EBU scale meter. Default is @code{9}. Common values are @code{9} and
  3682. @code{18}, respectively for EBU scale meter +9 and EBU scale meter +18. Any
  3683. other integer value between this range is allowed.
  3684. @end table
  3685. Example of real-time graph using @command{ffplay}, with a EBU scale meter +18:
  3686. @example
  3687. ffplay -f lavfi -i "amovie=input.mp3,ebur128=video=1:meter=18 [out0][out1]"
  3688. @end example
  3689. Run an analysis with @command{ffmpeg}:
  3690. @example
  3691. ffmpeg -nostats -i input.mp3 -filter_complex ebur128 -f null -
  3692. @end example
  3693. @section settb, asettb
  3694. Set the timebase to use for the output frames timestamps.
  3695. It is mainly useful for testing timebase configuration.
  3696. It accepts in input an arithmetic expression representing a rational.
  3697. The expression can contain the constants "AVTB" (the
  3698. default timebase), "intb" (the input timebase) and "sr" (the sample rate,
  3699. audio only).
  3700. The default value for the input is "intb".
  3701. @subsection Examples
  3702. @itemize
  3703. @item
  3704. Set the timebase to 1/25:
  3705. @example
  3706. settb=1/25
  3707. @end example
  3708. @item
  3709. Set the timebase to 1/10:
  3710. @example
  3711. settb=0.1
  3712. @end example
  3713. @item
  3714. Set the timebase to 1001/1000:
  3715. @example
  3716. settb=1+0.001
  3717. @end example
  3718. @item
  3719. Set the timebase to 2*intb:
  3720. @example
  3721. settb=2*intb
  3722. @end example
  3723. @item
  3724. Set the default timebase value:
  3725. @example
  3726. settb=AVTB
  3727. @end example
  3728. @end itemize
  3729. @section concat
  3730. Concatenate audio and video streams, joining them together one after the
  3731. other.
  3732. The filter works on segments of synchronized video and audio streams. All
  3733. segments must have the same number of streams of each type, and that will
  3734. also be the number of streams at output.
  3735. The filter accepts the following named parameters:
  3736. @table @option
  3737. @item n
  3738. Set the number of segments. Default is 2.
  3739. @item v
  3740. Set the number of output video streams, that is also the number of video
  3741. streams in each segment. Default is 1.
  3742. @item a
  3743. Set the number of output audio streams, that is also the number of video
  3744. streams in each segment. Default is 0.
  3745. @item unsafe
  3746. Activate unsafe mode: do not fail if segments have a different format.
  3747. @end table
  3748. The filter has @var{v}+@var{a} outputs: first @var{v} video outputs, then
  3749. @var{a} audio outputs.
  3750. There are @var{n}×(@var{v}+@var{a}) inputs: first the inputs for the first
  3751. segment, in the same order as the outputs, then the inputs for the second
  3752. segment, etc.
  3753. Related streams do not always have exactly the same duration, for various
  3754. reasons including codec frame size or sloppy authoring. For that reason,
  3755. related synchronized streams (e.g. a video and its audio track) should be
  3756. concatenated at once. The concat filter will use the duration of the longest
  3757. stream in each segment (except the last one), and if necessary pad shorter
  3758. audio streams with silence.
  3759. For this filter to work correctly, all segments must start at timestamp 0.
  3760. All corresponding streams must have the same parameters in all segments; the
  3761. filtering system will automatically select a common pixel format for video
  3762. streams, and a common sample format, sample rate and channel layout for
  3763. audio streams, but other settings, such as resolution, must be converted
  3764. explicitly by the user.
  3765. Different frame rates are acceptable but will result in variable frame rate
  3766. at output; be sure to configure the output file to handle it.
  3767. Examples:
  3768. @itemize
  3769. @item
  3770. Concatenate an opening, an episode and an ending, all in bilingual version
  3771. (video in stream 0, audio in streams 1 and 2):
  3772. @example
  3773. ffmpeg -i opening.mkv -i episode.mkv -i ending.mkv -filter_complex \
  3774. '[0:0] [0:1] [0:2] [1:0] [1:1] [1:2] [2:0] [2:1] [2:2]
  3775. concat=n=3:v=1:a=2 [v] [a1] [a2]' \
  3776. -map '[v]' -map '[a1]' -map '[a2]' output.mkv
  3777. @end example
  3778. @item
  3779. Concatenate two parts, handling audio and video separately, using the
  3780. (a)movie sources, and adjusting the resolution:
  3781. @example
  3782. movie=part1.mp4, scale=512:288 [v1] ; amovie=part1.mp4 [a1] ;
  3783. movie=part2.mp4, scale=512:288 [v2] ; amovie=part2.mp4 [a2] ;
  3784. [v1] [v2] concat [outv] ; [a1] [a2] concat=v=0:a=1 [outa]
  3785. @end example
  3786. Note that a desync will happen at the stitch if the audio and video streams
  3787. do not have exactly the same duration in the first file.
  3788. @end itemize
  3789. @section showspectrum
  3790. Convert input audio to a video output, representing the audio frequency
  3791. spectrum.
  3792. The filter accepts the following named parameters:
  3793. @table @option
  3794. @item size, s
  3795. Specify the video size for the output. Default value is @code{640x480}.
  3796. @item slide
  3797. Specify if the spectrum should slide along the window. Default value is
  3798. @code{0}.
  3799. @end table
  3800. The usage is very similar to the showwaves filter; see the examples in that
  3801. section.
  3802. @section showwaves
  3803. Convert input audio to a video output, representing the samples waves.
  3804. The filter accepts the following named parameters:
  3805. @table @option
  3806. @item n
  3807. Set the number of samples which are printed on the same column. A
  3808. larger value will decrease the frame rate. Must be a positive
  3809. integer. This option can be set only if the value for @var{rate}
  3810. is not explicitly specified.
  3811. @item rate, r
  3812. Set the (approximate) output frame rate. This is done by setting the
  3813. option @var{n}. Default value is "25".
  3814. @item size, s
  3815. Specify the video size for the output. Default value is "600x240".
  3816. @end table
  3817. Some examples follow.
  3818. @itemize
  3819. @item
  3820. Output the input file audio and the corresponding video representation
  3821. at the same time:
  3822. @example
  3823. amovie=a.mp3,asplit[out0],showwaves[out1]
  3824. @end example
  3825. @item
  3826. Create a synthetic signal and show it with showwaves, forcing a
  3827. framerate of 30 frames per second:
  3828. @example
  3829. aevalsrc=sin(1*2*PI*t)*sin(880*2*PI*t):cos(2*PI*200*t),asplit[out0],showwaves=r=30[out1]
  3830. @end example
  3831. @end itemize
  3832. @c man end MULTIMEDIA FILTERS
  3833. @chapter Multimedia Sources
  3834. @c man begin MULTIMEDIA SOURCES
  3835. Below is a description of the currently available multimedia sources.
  3836. @section amovie
  3837. This is the same as @ref{src_movie} source, except it selects an audio
  3838. stream by default.
  3839. @anchor{src_movie}
  3840. @section movie
  3841. Read audio and/or video stream(s) from a movie container.
  3842. It accepts the syntax: @var{movie_name}[:@var{options}] where
  3843. @var{movie_name} is the name of the resource to read (not necessarily
  3844. a file but also a device or a stream accessed through some protocol),
  3845. and @var{options} is an optional sequence of @var{key}=@var{value}
  3846. pairs, separated by ":".
  3847. The description of the accepted options follows.
  3848. @table @option
  3849. @item format_name, f
  3850. Specifies the format assumed for the movie to read, and can be either
  3851. the name of a container or an input device. If not specified the
  3852. format is guessed from @var{movie_name} or by probing.
  3853. @item seek_point, sp
  3854. Specifies the seek point in seconds, the frames will be output
  3855. starting from this seek point, the parameter is evaluated with
  3856. @code{av_strtod} so the numerical value may be suffixed by an IS
  3857. postfix. Default value is "0".
  3858. @item streams, s
  3859. Specifies the streams to read. Several streams can be specified, separated
  3860. by "+". The source will then have as many outputs, in the same order. The
  3861. syntax is explained in the @ref{Stream specifiers} chapter. Two special
  3862. names, "dv" and "da" specify respectively the default (best suited) video
  3863. and audio stream. Default is "dv", or "da" if the filter is called as
  3864. "amovie".
  3865. @item stream_index, si
  3866. Specifies the index of the video stream to read. If the value is -1,
  3867. the best suited video stream will be automatically selected. Default
  3868. value is "-1". Deprecated. If the filter is called "amovie", it will select
  3869. audio instead of video.
  3870. @item loop
  3871. Specifies how many times to read the stream in sequence.
  3872. If the value is less than 1, the stream will be read again and again.
  3873. Default value is "1".
  3874. Note that when the movie is looped the source timestamps are not
  3875. changed, so it will generate non monotonically increasing timestamps.
  3876. @end table
  3877. This filter allows to overlay a second video on top of main input of
  3878. a filtergraph as shown in this graph:
  3879. @example
  3880. input -----------> deltapts0 --> overlay --> output
  3881. ^
  3882. |
  3883. movie --> scale--> deltapts1 -------+
  3884. @end example
  3885. Some examples follow.
  3886. @itemize
  3887. @item
  3888. Skip 3.2 seconds from the start of the avi file in.avi, and overlay it
  3889. on top of the input labelled as "in":
  3890. @example
  3891. movie=in.avi:seek_point=3.2, scale=180:-1, setpts=PTS-STARTPTS [movie];
  3892. [in] setpts=PTS-STARTPTS, [movie] overlay=16:16 [out]
  3893. @end example
  3894. @item
  3895. Read from a video4linux2 device, and overlay it on top of the input
  3896. labelled as "in":
  3897. @example
  3898. movie=/dev/video0:f=video4linux2, scale=180:-1, setpts=PTS-STARTPTS [movie];
  3899. [in] setpts=PTS-STARTPTS, [movie] overlay=16:16 [out]
  3900. @end example
  3901. @item
  3902. Read the first video stream and the audio stream with id 0x81 from
  3903. dvd.vob; the video is connected to the pad named "video" and the audio is
  3904. connected to the pad named "audio":
  3905. @example
  3906. movie=dvd.vob:s=v:0+#0x81 [video] [audio]
  3907. @end example
  3908. @end itemize
  3909. @c man end MULTIMEDIA SOURCES