1 /* 2 * This file is part of gtkD. 3 * 4 * gtkD is free software; you can redistribute it and/or modify 5 * it under the terms of the GNU Lesser General Public License 6 * as published by the Free Software Foundation; either version 3 7 * of the License, or (at your option) any later version, with 8 * some exceptions, please read the COPYING file. 9 * 10 * gtkD is distributed in the hope that it will be useful, 11 * but WITHOUT ANY WARRANTY; without even the implied warranty of 12 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the 13 * GNU Lesser General Public License for more details. 14 * 15 * You should have received a copy of the GNU Lesser General Public License 16 * along with gtkD; if not, write to the Free Software 17 * Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110, USA 18 */ 19 20 // generated automatically - do not change 21 // find conversion definition on APILookup.txt 22 // implement new conversion functionalities on the wrap.utils pakage 23 24 25 module gst.base.BaseParse; 26 27 private import gst.base.BaseParseFrame; 28 private import gst.base.c.functions; 29 public import gst.base.c.types; 30 private import gstreamer.Element; 31 private import gstreamer.TagList; 32 33 34 /** 35 * This base class is for parser elements that process data and splits it 36 * into separate audio/video/whatever frames. 37 * 38 * It provides for: 39 * 40 * * provides one sink pad and one source pad 41 * * handles state changes 42 * * can operate in pull mode or push mode 43 * * handles seeking in both modes 44 * * handles events (SEGMENT/EOS/FLUSH) 45 * * handles queries (POSITION/DURATION/SEEKING/FORMAT/CONVERT) 46 * * handles flushing 47 * 48 * The purpose of this base class is to provide the basic functionality of 49 * a parser and share a lot of rather complex code. 50 * 51 * # Description of the parsing mechanism: 52 * 53 * ## Set-up phase 54 * 55 * * #GstBaseParse calls #GstBaseParseClass.start() to inform subclass 56 * that data processing is about to start now. 57 * 58 * * #GstBaseParse class calls #GstBaseParseClass.set_sink_caps() to 59 * inform the subclass about incoming sinkpad caps. Subclass could 60 * already set the srcpad caps accordingly, but this might be delayed 61 * until calling gst_base_parse_finish_frame() with a non-queued frame. 62 * 63 * * At least at this point subclass needs to tell the #GstBaseParse class 64 * how big data chunks it wants to receive (minimum frame size ). It can 65 * do this with gst_base_parse_set_min_frame_size(). 66 * 67 * * #GstBaseParse class sets up appropriate data passing mode (pull/push) 68 * and starts to process the data. 69 * 70 * ## Parsing phase 71 * 72 * * #GstBaseParse gathers at least min_frame_size bytes of data either 73 * by pulling it from upstream or collecting buffers in an internal 74 * #GstAdapter. 75 * 76 * * A buffer of (at least) min_frame_size bytes is passed to subclass 77 * with #GstBaseParseClass.handle_frame(). Subclass checks the contents 78 * and can optionally return #GST_FLOW_OK along with an amount of data 79 * to be skipped to find a valid frame (which will result in a 80 * subsequent DISCONT). If, otherwise, the buffer does not hold a 81 * complete frame, #GstBaseParseClass.handle_frame() can merely return 82 * and will be called again when additional data is available. In push 83 * mode this amounts to an additional input buffer (thus minimal 84 * additional latency), in pull mode this amounts to some arbitrary 85 * reasonable buffer size increase. 86 * 87 * Of course, gst_base_parse_set_min_frame_size() could also be used if 88 * a very specific known amount of additional data is required. If, 89 * however, the buffer holds a complete valid frame, it can pass the 90 * size of this frame to gst_base_parse_finish_frame(). 91 * 92 * If acting as a converter, it can also merely indicate consumed input 93 * data while simultaneously providing custom output data. Note that 94 * baseclass performs some processing (such as tracking overall consumed 95 * data rate versus duration) for each finished frame, but other state 96 * is only updated upon each call to #GstBaseParseClass.handle_frame() 97 * (such as tracking upstream input timestamp). 98 * 99 * Subclass is also responsible for setting the buffer metadata 100 * (e.g. buffer timestamp and duration, or keyframe if applicable). 101 * (although the latter can also be done by #GstBaseParse if it is 102 * appropriately configured, see below). Frame is provided with 103 * timestamp derived from upstream (as much as generally possible), 104 * duration obtained from configuration (see below), and offset 105 * if meaningful (in pull mode). 106 * 107 * Note that #GstBaseParseClass.handle_frame() might receive any small 108 * amount of input data when leftover data is being drained (e.g. at 109 * EOS). 110 * 111 * * As part of finish frame processing, just prior to actually pushing 112 * the buffer in question, it is passed to 113 * #GstBaseParseClass.pre_push_frame() which gives subclass yet one last 114 * chance to examine buffer metadata, or to send some custom (tag) 115 * events, or to perform custom (segment) filtering. 116 * 117 * * During the parsing process #GstBaseParseClass will handle both srcpad 118 * and sinkpad events. They will be passed to subclass if 119 * #GstBaseParseClass.event() or #GstBaseParseClass.src_event() 120 * implementations have been provided. 121 * 122 * ## Shutdown phase 123 * 124 * * #GstBaseParse class calls #GstBaseParseClass.stop() to inform the 125 * subclass that data parsing will be stopped. 126 * 127 * Subclass is responsible for providing pad template caps for source and 128 * sink pads. The pads need to be named "sink" and "src". It also needs to 129 * set the fixed caps on srcpad, when the format is ensured (e.g. when 130 * base class calls subclass' #GstBaseParseClass.set_sink_caps() function). 131 * 132 * This base class uses %GST_FORMAT_DEFAULT as a meaning of frames. So, 133 * subclass conversion routine needs to know that conversion from 134 * %GST_FORMAT_TIME to %GST_FORMAT_DEFAULT must return the 135 * frame number that can be found from the given byte position. 136 * 137 * #GstBaseParse uses subclasses conversion methods also for seeking (or 138 * otherwise uses its own default one, see also below). 139 * 140 * Subclass @start and @stop functions will be called to inform the beginning 141 * and end of data processing. 142 * 143 * Things that subclass need to take care of: 144 * 145 * * Provide pad templates 146 * * Fixate the source pad caps when appropriate 147 * * Inform base class how big data chunks should be retrieved. This is 148 * done with gst_base_parse_set_min_frame_size() function. 149 * * Examine data chunks passed to subclass with 150 * #GstBaseParseClass.handle_frame() and pass proper frame(s) to 151 * gst_base_parse_finish_frame(), and setting src pad caps and timestamps 152 * on frame. 153 * * Provide conversion functions 154 * * Update the duration information with gst_base_parse_set_duration() 155 * * Optionally passthrough using gst_base_parse_set_passthrough() 156 * * Configure various baseparse parameters using 157 * gst_base_parse_set_average_bitrate(), gst_base_parse_set_syncable() 158 * and gst_base_parse_set_frame_rate(). 159 * 160 * * In particular, if subclass is unable to determine a duration, but 161 * parsing (or specs) yields a frames per seconds rate, then this can be 162 * provided to #GstBaseParse to enable it to cater for buffer time 163 * metadata (which will be taken from upstream as much as 164 * possible). Internally keeping track of frame durations and respective 165 * sizes that have been pushed provides #GstBaseParse with an estimated 166 * bitrate. A default #GstBaseParseClass.convert() (used if not 167 * overridden) will then use these rates to perform obvious conversions. 168 * These rates are also used to update (estimated) duration at regular 169 * frame intervals. 170 */ 171 public class BaseParse : Element 172 { 173 /** the main Gtk struct */ 174 protected GstBaseParse* gstBaseParse; 175 176 /** Get the main Gtk struct */ 177 public GstBaseParse* getBaseParseStruct(bool transferOwnership = false) 178 { 179 if (transferOwnership) 180 ownedRef = false; 181 return gstBaseParse; 182 } 183 184 /** the main Gtk struct as a void* */ 185 protected override void* getStruct() 186 { 187 return cast(void*)gstBaseParse; 188 } 189 190 /** 191 * Sets our main struct and passes it to the parent class. 192 */ 193 public this (GstBaseParse* gstBaseParse, bool ownedRef = false) 194 { 195 this.gstBaseParse = gstBaseParse; 196 super(cast(GstElement*)gstBaseParse, ownedRef); 197 } 198 199 200 /** */ 201 public static GType getType() 202 { 203 return gst_base_parse_get_type(); 204 } 205 206 /** 207 * Adds an entry to the index associating @offset to @ts. It is recommended 208 * to only add keyframe entries. @force allows to bypass checks, such as 209 * whether the stream is (upstream) seekable, another entry is already "close" 210 * to the new entry, etc. 211 * 212 * Params: 213 * offset = offset of entry 214 * ts = timestamp associated with offset 215 * key = whether entry refers to keyframe 216 * force = add entry disregarding sanity checks 217 * 218 * Returns: #gboolean indicating whether entry was added 219 */ 220 public bool addIndexEntry(ulong offset, GstClockTime ts, bool key, bool force) 221 { 222 return gst_base_parse_add_index_entry(gstBaseParse, offset, ts, key, force) != 0; 223 } 224 225 /** 226 * Default implementation of #GstBaseParseClass.convert(). 227 * 228 * Params: 229 * srcFormat = #GstFormat describing the source format. 230 * srcValue = Source value to be converted. 231 * destFormat = #GstFormat defining the converted format. 232 * destValue = Pointer where the conversion result will be put. 233 * 234 * Returns: %TRUE if conversion was successful. 235 */ 236 public bool convertDefault(GstFormat srcFormat, long srcValue, GstFormat destFormat, out long destValue) 237 { 238 return gst_base_parse_convert_default(gstBaseParse, srcFormat, srcValue, destFormat, &destValue) != 0; 239 } 240 241 /** 242 * Drains the adapter until it is empty. It decreases the min_frame_size to 243 * match the current adapter size and calls chain method until the adapter 244 * is emptied or chain returns with error. 245 * 246 * Since: 1.12 247 */ 248 public void drain() 249 { 250 gst_base_parse_drain(gstBaseParse); 251 } 252 253 /** 254 * Collects parsed data and pushes this downstream. 255 * Source pad caps must be set when this is called. 256 * 257 * If @frame's out_buffer is set, that will be used as subsequent frame data. 258 * Otherwise, @size samples will be taken from the input and used for output, 259 * and the output's metadata (timestamps etc) will be taken as (optionally) 260 * set by the subclass on @frame's (input) buffer (which is otherwise 261 * ignored for any but the above purpose/information). 262 * 263 * Note that the latter buffer is invalidated by this call, whereas the 264 * caller retains ownership of @frame. 265 * 266 * Params: 267 * frame = a #GstBaseParseFrame 268 * size = consumed input data represented by frame 269 * 270 * Returns: a #GstFlowReturn that should be escalated to caller (of caller) 271 */ 272 public GstFlowReturn finishFrame(BaseParseFrame frame, int size) 273 { 274 return gst_base_parse_finish_frame(gstBaseParse, (frame is null) ? null : frame.getBaseParseFrameStruct(), size); 275 } 276 277 /** 278 * Sets the parser subclass's tags and how they should be merged with any 279 * upstream stream tags. This will override any tags previously-set 280 * with gst_base_parse_merge_tags(). 281 * 282 * Note that this is provided for convenience, and the subclass is 283 * not required to use this and can still do tag handling on its own. 284 * 285 * Params: 286 * tags = a #GstTagList to merge, or NULL to unset 287 * previously-set tags 288 * mode = the #GstTagMergeMode to use, usually #GST_TAG_MERGE_REPLACE 289 * 290 * Since: 1.6 291 */ 292 public void mergeTags(TagList tags, GstTagMergeMode mode) 293 { 294 gst_base_parse_merge_tags(gstBaseParse, (tags is null) ? null : tags.getTagListStruct(), mode); 295 } 296 297 /** 298 * Pushes the frame's buffer downstream, sends any pending events and 299 * does some timestamp and segment handling. Takes ownership of 300 * frame's buffer, though caller retains ownership of @frame. 301 * 302 * This must be called with sinkpad STREAM_LOCK held. 303 * 304 * Params: 305 * frame = a #GstBaseParseFrame 306 * 307 * Returns: #GstFlowReturn 308 */ 309 public GstFlowReturn pushFrame(BaseParseFrame frame) 310 { 311 return gst_base_parse_push_frame(gstBaseParse, (frame is null) ? null : frame.getBaseParseFrameStruct()); 312 } 313 314 /** 315 * Optionally sets the average bitrate detected in media (if non-zero), 316 * e.g. based on metadata, as it will be posted to the application. 317 * 318 * By default, announced average bitrate is estimated. The average bitrate 319 * is used to estimate the total duration of the stream and to estimate 320 * a seek position, if there's no index and the format is syncable 321 * (see gst_base_parse_set_syncable()). 322 * 323 * Params: 324 * bitrate = average bitrate in bits/second 325 */ 326 public void setAverageBitrate(uint bitrate) 327 { 328 gst_base_parse_set_average_bitrate(gstBaseParse, bitrate); 329 } 330 331 /** 332 * Sets the duration of the currently playing media. Subclass can use this 333 * when it is able to determine duration and/or notices a change in the media 334 * duration. Alternatively, if @interval is non-zero (default), then stream 335 * duration is determined based on estimated bitrate, and updated every @interval 336 * frames. 337 * 338 * Params: 339 * fmt = #GstFormat. 340 * duration = duration value. 341 * interval = how often to update the duration estimate based on bitrate, or 0. 342 */ 343 public void setDuration(GstFormat fmt, long duration, int interval) 344 { 345 gst_base_parse_set_duration(gstBaseParse, fmt, duration, interval); 346 } 347 348 /** 349 * If frames per second is configured, parser can take care of buffer duration 350 * and timestamping. When performing segment clipping, or seeking to a specific 351 * location, a corresponding decoder might need an initial @lead_in and a 352 * following @lead_out number of frames to ensure the desired segment is 353 * entirely filled upon decoding. 354 * 355 * Params: 356 * fpsNum = frames per second (numerator). 357 * fpsDen = frames per second (denominator). 358 * leadIn = frames needed before a segment for subsequent decode 359 * leadOut = frames needed after a segment 360 */ 361 public void setFrameRate(uint fpsNum, uint fpsDen, uint leadIn, uint leadOut) 362 { 363 gst_base_parse_set_frame_rate(gstBaseParse, fpsNum, fpsDen, leadIn, leadOut); 364 } 365 366 /** 367 * Set if frames carry timing information which the subclass can (generally) 368 * parse and provide. In particular, intrinsic (rather than estimated) time 369 * can be obtained following a seek. 370 * 371 * Params: 372 * hasTiming = whether frames carry timing information 373 */ 374 public void setHasTimingInfo(bool hasTiming) 375 { 376 gst_base_parse_set_has_timing_info(gstBaseParse, hasTiming); 377 } 378 379 /** 380 * By default, the base class might try to infer PTS from DTS and vice 381 * versa. While this is generally correct for audio data, it may not 382 * be otherwise. Sub-classes implementing such formats should disable 383 * timestamp inferring. 384 * 385 * Params: 386 * inferTs = %TRUE if parser should infer DTS/PTS from each other 387 */ 388 public void setInferTs(bool inferTs) 389 { 390 gst_base_parse_set_infer_ts(gstBaseParse, inferTs); 391 } 392 393 /** 394 * Sets the minimum and maximum (which may likely be equal) latency introduced 395 * by the parsing process. If there is such a latency, which depends on the 396 * particular parsing of the format, it typically corresponds to 1 frame duration. 397 * 398 * Params: 399 * minLatency = minimum parse latency 400 * maxLatency = maximum parse latency 401 */ 402 public void setLatency(GstClockTime minLatency, GstClockTime maxLatency) 403 { 404 gst_base_parse_set_latency(gstBaseParse, minLatency, maxLatency); 405 } 406 407 /** 408 * Subclass can use this function to tell the base class that it needs to 409 * be given buffers of at least @min_size bytes. 410 * 411 * Params: 412 * minSize = Minimum size in bytes of the data that this base class should 413 * give to subclass. 414 */ 415 public void setMinFrameSize(uint minSize) 416 { 417 gst_base_parse_set_min_frame_size(gstBaseParse, minSize); 418 } 419 420 /** 421 * Set if the nature of the format or configuration does not allow (much) 422 * parsing, and the parser should operate in passthrough mode (which only 423 * applies when operating in push mode). That is, incoming buffers are 424 * pushed through unmodified, i.e. no #GstBaseParseClass.handle_frame() 425 * will be invoked, but #GstBaseParseClass.pre_push_frame() will still be 426 * invoked, so subclass can perform as much or as little is appropriate for 427 * passthrough semantics in #GstBaseParseClass.pre_push_frame(). 428 * 429 * Params: 430 * passthrough = %TRUE if parser should run in passthrough mode 431 */ 432 public void setPassthrough(bool passthrough) 433 { 434 gst_base_parse_set_passthrough(gstBaseParse, passthrough); 435 } 436 437 /** 438 * By default, the base class will guess PTS timestamps using a simple 439 * interpolation (previous timestamp + duration), which is incorrect for 440 * data streams with reordering, where PTS can go backward. Sub-classes 441 * implementing such formats should disable PTS interpolation. 442 * 443 * Params: 444 * ptsInterpolate = %TRUE if parser should interpolate PTS timestamps 445 */ 446 public void setPtsInterpolation(bool ptsInterpolate) 447 { 448 gst_base_parse_set_pts_interpolation(gstBaseParse, ptsInterpolate); 449 } 450 451 /** 452 * Set if frame starts can be identified. This is set by default and 453 * determines whether seeking based on bitrate averages 454 * is possible for a format/stream. 455 * 456 * Params: 457 * syncable = set if frame starts can be identified 458 */ 459 public void setSyncable(bool syncable) 460 { 461 gst_base_parse_set_syncable(gstBaseParse, syncable); 462 } 463 464 /** 465 * This function should only be called from a @handle_frame implementation. 466 * 467 * #GstBaseParse creates initial timestamps for frames by using the last 468 * timestamp seen in the stream before the frame starts. In certain 469 * cases, the correct timestamps will occur in the stream after the 470 * start of the frame, but before the start of the actual picture data. 471 * This function can be used to set the timestamps based on the offset 472 * into the frame data that the picture starts. 473 * 474 * Params: 475 * offset = offset into current buffer 476 * 477 * Since: 1.2 478 */ 479 public void setTsAtOffset(size_t offset) 480 { 481 gst_base_parse_set_ts_at_offset(gstBaseParse, offset); 482 } 483 }