media

sjs/media

mirror of https://github.com/samsonjs/media.git synced 2026-04-18 13:25:47 +00:00

Author	SHA1	Message	Date
ibaker	d356d88c4f	Improve test dump output for role and selection flags PiperOrigin-RevId: 589878576	2023-12-11 11:05:40 -08:00
tofunmi	cd346ca14d	Transformer: Add support for transmuxing audio in trim optimization PiperOrigin-RevId: 588711597	2023-12-07 02:18:21 -08:00
tofunmi	2d77e4d22c	Implement trim optimization in Transformer PiperOrigin-RevId: 584622392	2023-11-22 07:28:34 -08:00
andrewlewis	7b9aa87344	Allow allocating more buffers when transmuxing When transmuxing, the `EncodedSampleExporter` maintains a queue of input buffers that get filled with encoded data by the asset loader. The number of buffers was limited to avoid using more and more memory if producer (asset loader) gets far ahead of the consumer (exporter). Previously this limit was fixed at 10 buffers, but increasing the number of buffers can make some transmux operations much faster. Allow allocating between a min and max number of buffers, and also set a target allocation size beyond which new buffers can't be allocated. This allows audio formats which require many small buffers to be processed more quickly, while preventing allocating too much memory for hypothetical very high bitrate formats. 'Remove video' edits on local videos in particular get much faster, because audio buffers are very short and there are lots of them. With a sample 10 minute video, a 'remove video' edit took 2 seconds (36 seconds before this change). With a sample 1 minute removing video took 0.25 seconds after this change (2.5 seconds before). The speed improvement is smaller for other types of edits that retain the video track. Transmuxing a 10 minute video retaining the video track took 26 seconds (40 seconds before). PiperOrigin-RevId: 583390284	2023-11-17 08:11:25 -08:00
samrobinson	d5fbf0007b	Migrate to Util.durationUsToSampleCount in transformer audio. PiperOrigin-RevId: 582700443	2023-11-15 09:15:10 -08:00
microkatz	716240776e	Updated golden file tests and readded constructor	2023-10-17 11:42:33 +00:00
microkatz	e28288f9c5	Updated golden file tests and removed deprecated ColorInfo constructor usage	2023-10-17 11:42:33 +00:00
michaelkatz	c12fb67468	Update dumpfile tests to only print ColorInfo fields that are set PiperOrigin-RevId: 568789489	2023-09-27 02:34:29 -07:00
samrobinson	866d62dd34	Improve CompositionExportTest assertions by using dump files. PiperOrigin-RevId: 563708666	2023-09-08 04:09:21 -07:00
samrobinson	763dddfbd4	Dump with C.TrackType as key, rather than muxer track index. Modifying dumping to not required "released" to be called. Track index is an arbitrary value based on the order of addTrack calls. Samples are dumped by track (rather than as soon as they are written), so it's preferable to use a value that provides more context. By using the track type as a key, dump files will be more deterministic and will have more similarities when branched. PiperOrigin-RevId: 563700982	2023-09-08 03:25:13 -07:00
samrobinson	cff2816da4	Add silence generation parameterized test case. PiperOrigin-RevId: 563098931	2023-09-06 07:34:32 -07:00
samrobinson	92814b84a8	Add alternate MP4 asset with PCM audio. Audio is Mono 44.1kHz. Created using: `ffmpeg -i <input> -c:v copy -c:a pcm_s16be <output>` PiperOrigin-RevId: 563079553	2023-09-06 05:58:27 -07:00
samrobinson	ef45c0fe5d	Add parameterized dump tests for single item exports. PiperOrigin-RevId: 563075771	2023-09-06 05:35:51 -07:00
samrobinson	ff39726368	Remove redundant "aac" tag for dump file. PiperOrigin-RevId: 559147235	2023-08-24 09:17:32 +01:00
samrobinson	ae7667783c	Split dump file directories based on input file name. Remove old unused dump files. PiperOrigin-RevId: 558820926	2023-08-22 15:39:28 +01:00
samrobinson	2db6f0aee7	Ensure audio components check incoming data is valid. Default PCM encoding is only set for decoders outputting raw. Tests migrated to abide by tighter restrictions. PiperOrigin-RevId: 558129452	2023-08-18 15:33:44 +01:00
Googler	4fe55b8b63	Return the correct output buffer from audio processing pipeline PiperOrigin-RevId: 554851456	2023-08-10 12:07:15 +00:00
samrobinson	54093a152e	Integrate AudioMixer for audio export. Adds support for compositions with multiple audio sequences. PiperOrigin-RevId: 550880626	2023-08-01 14:03:22 +01:00
samrobinson	357c458028	Ensure EOS is queued after processing generated silence with effects. When generating silence for AudioProcessingPipeline, audio never queued EOS downstream. Linked to this, when silence followed an item with audio, the silence was added to SilentAudioGenerator before the mediaItem reconfiguration occurred. If the silence had effects, the APP would be flushed after silence queued EOS, resetting APP.isEnded back to false, so AudioGraph never ended. Regression tests reproduce failure without fix, but pass with it. PiperOrigin-RevId: 550853714	2023-08-01 14:02:09 +01:00
samrobinson	847cc9b841	Use asset with encoded video & raw audio for Robolectric test. Test requires file to have video track for forcing silence. PiperOrigin-RevId: 547839076	2023-07-14 10:19:29 +01:00
samrobinson	f60f79bb10	Handle media item (Effects/Format) changes in AudioSamplePipeline. On a MediaItem change, the input Format (and Effects to apply) may be different. Therefore the AudioProcessingPipeline must be reconfigured to determine what processing is active, and what the AudioFormat of the data output is. In the event that it is different, additional AudioProcessor instances must be used to ensure the encoder will still be able to accept the audio buffers. PiperOrigin-RevId: 544338451	2023-06-29 23:14:10 +00:00
samrobinson	0d67733d28	Update media in silence concatentation test to match silent format. Goal of tests (SequenceExportTest) that use this media is for the silence and the media to match exactly with audio format, however `sample_with_increasing_timestamps.mp4` had a different sample rate. testvid_1022ms.mp4: channel count = 2, sample rate = 44100. PiperOrigin-RevId: 543458948	2023-06-29 22:52:32 +00:00
samrobinson	b46b6a8278	Use stereo audio in silence -> audio SequenceExportTest. With the upcoming "handle format changes" CL, stereo -> mono audio would add an AudioProcessor. Robolectric decodes output encoded data, which crashes some AudioProcessors because the number of frames may not be an integer. PiperOrigin-RevId: 542568875	2023-06-23 16:40:57 +00:00
sheenachhabra	911a6430f3	Remove UdtaInfo class The class seems unnecessary and the code can be simplified by removing this. PiperOrigin-RevId: 541675378	2023-06-20 14:00:50 +01:00
samrobinson	6d648f8bdb	Add dump tests for concatenating 2 audio items. Audio only tests are now using RAW audio where possible, which is passed through the Robolectric decoders/encoders, and can be handled by the AudioProcessor instances accurately. PiperOrigin-RevId: 541648853	2023-06-20 13:57:33 +01:00
sheenachhabra	c0e8513b7a	Make dump files deterministic Issue: When running the Transformer related test cases, the tests are flaky because the order in which audio and video samples are interleaved seems to differ in few instances. Root cause: When running a transformation the sample producer (Asset loader) and sample consumer (Sample pipeline) both runs on different thread and theoretically there is no reason for behaviour to be deterministic because the number of samples produced/written depends on how fast individual thread works. So it is indeed surprising that test somehow worked deterministically in majority of instances (may be something to do with Robolectric environment). Solution: Since we don't expect the order of sample interleaving to be deterministic, make the dumping logic deterministic where all the video samples will be collected and then dumped together (similarly for audio). This would mean we won't be able to see the interleaving so for that we need to add separate test case verifying the interleaving logic only. Pending: Test case for interleaving. PiperOrigin-RevId: 540930871	2023-06-19 16:28:07 +01:00
samrobinson	1236d37acb	Improve SequenceExportTest test and dump file naming. No-op change to highlight when video and/or audio are transmuxed and reorder the methods. PiperOrigin-RevId: 540567375	2023-06-19 16:14:38 +01:00
sheenachhabra	d0eda433ea	Replace CreationTime class with Mp4TimestampData class PiperOrigin-RevId: 540257484	2023-06-14 20:42:36 +01:00
sheenachhabra	53c174f047	Add support for passing custom metadata via transformer Changes included: 1. Enable MP4 extractor to read all types of metadata. 2. Allow passing String and Float metadata via Transformer. Reference to QuickTime spec: https://developer.apple.com/library/archive/documentation/QuickTime/QTFF/Metadata/Metadata.html#//apple_ref/doc/uid/TP40000939-CH1-SW21 PiperOrigin-RevId: 538783982	2023-06-09 13:51:15 +00:00
sheenachhabra	7e14811e25	Add support for passing creation time via InAppMuxer PiperOrigin-RevId: 538175466	2023-06-06 18:12:51 +00:00
kimvde	f4d1a6c453	Transmux video if rotation is only effect applied PiperOrigin-RevId: 535554628	2023-05-26 15:14:52 +00:00
sheenachhabra	a944ffecb9	Add support for adding capture FPS via transformer PiperOrigin-RevId: 534814892	2023-05-24 16:11:27 +01:00
sheenachhabra	7c477589e5	Add support for updating Metadata entries via InAppMuxer Mp4Muxer already supports writing Mp4LocationData so added that as supported Metadata entry. Support for more Metadata entries will be added in upcoming CLs. PiperOrigin-RevId: 534473866	2023-05-24 16:02:48 +01:00
sheenachhabra	b11dd106ae	Replace MediaFormat with Format class in muxer module PiperOrigin-RevId: 526655859	2023-04-26 15:40:40 +01:00
andrewlewis	d43fe3470f	Fix audio encode timestamp off by one Simplify the audio encoder input timestamp calculation. The new calculation avoids drifting by tracking the total number of bytes encoded rather than tracking the timestamp and remainder separately, and also makes the timestamps match the decoder output buffer timestamps. Also switch one of the export tests that was passing through AMR samples over to using WAVE audio. The problem with using AMR is that the compressed samples are not necessarily an integer number of audio frames and the shadow decoder would pass them from input to output, so the audio encoder was receiving non-integer numbers of audio frames. Tested by logging the timestamps at the decoder output and encoder input with forcing transcoding audio, and verifying that after this change the audio timestamps are no longer off by one. PiperOrigin-RevId: 523409869	2023-04-12 16:50:25 +01:00
kimvde	0bfe43866a	Add test for clipped media items concatenation This was broken and has been fixed in <unknown commit>. PiperOrigin-RevId: 521380415	2023-04-05 15:40:09 +01:00
sheenachhabra	69cece1d82	Make getMp4LocationData method inline PiperOrigin-RevId: 518827223	2023-03-30 17:04:46 +00:00
sheenachhabra	669008f437	Use FakeExtractorOutput in Mp4MuxerMetadataTest The FakeExtractorOutput dumps data in more readable format so using that where ever possible. The MdtaMetadataEntry which contains key and value dumped only "key". Added fix to dump "value" as well. PiperOrigin-RevId: 517968996	2023-03-21 14:19:12 +00:00
sheenachhabra	7b8c562d7b	Remove setLocation() method from Muxer interface PiperOrigin-RevId: 516314175	2023-03-14 07:56:52 +00:00
sheenachhabra	2f01f9c53b	Add support for reading location data in MP4 extractor The geodata is stored in the "udta" box as per MediaMuxer reference https://cs.android.com/android/platform/superproject/+/master:frameworks/av/media/libstagefright/MPEG4Writer.cpp;drc=master;l=5588 PiperOrigin-RevId: 515095127	2023-03-14 07:46:28 +00:00
samrobinson	3264cb8271	Pass Metadata to Muxer when adding a track. PiperOrigin-RevId: 514575400	2023-03-07 11:56:53 +00:00
kimvde	0f8b67b875	Make onOutputFormat nullable - This is to make sure we know about all the tracks before initializing the SamplePipelines. This allows to set the muxer and the fallback listener track count before the SamplePipelines are built. - As a result, the test files had to be updated because the order in which the tracks are written has changed. - The ImageAssetLoader also had to be updated to call onOutputFormat repeatedly until it returns a non-null SampleConsumer. - Also fix the trackCount sent to the muxer and fallback listener. The correct track count can be computed now that we know about all the tracks before building the SamplePipelines. PiperOrigin-RevId: 514426123	2023-03-07 11:52:52 +00:00
kimvde	5c54a7dffb	Add unit tests for audio track dis(appearing) during export PiperOrigin-RevId: 511764841	2023-02-27 18:28:14 +00:00
kimvde	16db2bd0a1	Add an API entry point to pass a Composition PiperOrigin-RevId: 508031337	2023-02-08 14:11:13 +00:00
samrobinson	dfa98ae791	Generate silent audio if no audio track is available. To always generate silent audio, `removeAudio(true)` can be used in conjunction. PiperOrigin-RevId: 502814315	2023-01-18 12:03:59 +00:00
kimvde	63b8cae263	Add a queue at the start of the buffer SamplePipelines This improves performance and makes the code more intuitive. PiperOrigin-RevId: 501220234	2023-01-17 01:48:51 +00:00
kimvde	98bc817fe7	Fix sample interleaving Whether to write a sample or not was based on the timestamp of the previous sample, rather than the current sample. PiperOrigin-RevId: 500195279	2023-01-10 18:35:11 +00:00
andrewlewis	beee4732fb	Generate complete silent audio frames `SilentAudioGenerator` could output a fractional audio frame, and this could cause downstream components to throw because of trying to read a complete audio frame but only seeing a partial one. Calculate the output buffer size based on the frame size (which is a no-op for stereo 16-bit audio) and calculate a total number of frames to output then multiple by the frame size. PiperOrigin-RevId: 494992941	2022-12-15 15:58:50 +00:00
kimvde	85c48c481e	Make sure that the sample pipeline data is processed regularly. This is necessary to move video decoding to the AssetLoader. Otherwise, if the decoder max pending frame count is reached, the AssetLoader will stop queuing frames to the pipeline, and process data will not be called anymore. PiperOrigin-RevId: 492392621	2022-12-12 11:01:36 +00:00
samrobinson	e5727a2cc7	Add an E2E test for changing sample rate with AudioProcessor. PiperOrigin-RevId: 492160193	2022-12-12 10:54:06 +00:00

1 2

66 commits