Smart profiles for capturing and publishing audio and video streams

Smart profiles for capturing and publishing audio and video streams
US007409145B2
(12)
(54)
United States Patent
(10) Patent N0.:
Antoun et al.
(45) Date of Patent:
SMART PROFILES FOR CAPTURING AND
PUBLISHING AUDIO AND VIDEO STREAMS
2002/0051493 A1 *
2005/0165779 A1 *
5/2002 Shin et a1. ............ .. 375/240.l1
7/2005 Kaiser et al. ................. .. 707/6
Olivier Colle, Redmond, WA (Us);
Ripley, “DVIiA Digital Multimedia Technology,” Communica
Gareth Alan Howell’ Kirkland’ WA
tions ofthe ACM, Jul. 1989, vol. 32, No. 7.
(Us)
Mukhopadhyay et al., “Passive Capture and Structuring of Lectures,”
Department of Computer Science, 1999, New York, U.S.A.
_
.
.
Myers et al., “A Multi-VieW Intelligent Editor for Digital Video
(73) Asslgnee' hglscrosoft corporatlon’ Redmond’ WA
(
_
Libraries,” Human Computer Interaction Institute, 2001, Pennsylva
)
nia, U.S.A.
_
( * ) Not1ce:
Aug. 5, 2008
OTHER PUBLICATIONS
(75) Inventors: Tony M. Antoun, Renton, WA (US);
-
US 7,409,145 B2
_
_
_
Finkel stein et al., “Multiresolution Video,” Department of Computer
Subject' to any d1scla1mer,~ the term of this
Science and Engineering, 1996, Washington, USA‘
patent is extended or adjusted under 35
_
U.S.C. 154(b) by 1430 days.
_
* clted by examlner
_
Primary ExamineriRobert Chevalier
(21) Appl' NO" 10/335’631
(74) Attorney, Agent, or FirmiSenniger Powers LLP
(22) F1led:
(57)
Jan. 2, 2003
(65)
Prior Publication Data
US 2004/0131340 A1
ABSTRACT
A method and system for capturing video and/or audio
Jul, 8, 2004
streams and for publishing consolidated video and/ or consoli
dated audio streams. A capture pro?le specifying parameters
51
Int. CI.
H04N 5/00
(2006.01)
for caPturing the video stream and/ or audio stream is selected
from a list of prede?ned pro?les as a function of data con
H04N 7/00
(2006.01)
tained in the video and/or audio stream. A publish pro?le
(52)
U.S. Cl. ..................................... .. 386/125; 386/ 105
specifying parameters for publishing the video stream and/or
(58)
Field of Classi?cation Search ............... .. 386/125,
audio Stream is Selected from a list Ofprede?ned Pro?les as 21
386/124’ 46’ 117, 107, 455 104, 105
See application ?le for Complete Search history
function of data contained in the consolidated video and/or
consolidated audio stream. Moreover, the publish pro?le is
_
References Clted
(56)
selected to provide the best quality video and/ or audio during
playback, or to provide the best ?r for a maximum ?le Size
designated by the user.
US. PATENT DOCUMENTS
4,974,178 A *
11/1990
79 Claims, 12 Drawing Sheets
IZeki etal. ................ .. 715/202
ANALYZE FRAMES ON TIMELINE TO
DETERMINE FORMAT (8.9.. NTSC. PAL)
MAX FILE
SIZE DESIRED
NO
705
EXECUTE BQPR
706
~ ANALVZE FRAMES ON TIMELINE TO DETERMINE MAX
BIT RATE (TBR) AND MAX RESOLUTION (TR)
f 707
FROM A PROFILE DATABASE IDENTIFY A FIRST LIST OF PROFILES
SPECIFYING A FIRST RESOLUTION R1 IDENTIFY A SECOND LIST OF PROFILES
SPECIFYING A SECOND RESOLUTION R2 SUCH THAT R1 > TR > R2
712
7101
FSELECT SECOND
YES
Z
LIST OF PROFILES
0
SELECT FIRST LIST
OF PROFILES
FROM SELECTED LIST OF PROFILES IDENTIFY A FIRST PROFILE
SPECIFYING A FIRST BIT RATE BR1 IDENTIFY A SECOND PROFILE
SPECIFYING A SECOND BIT RATE BR2 SUCH THAT BR1 > TBR > 5R2
A
713
SELECT SECOND
PROFILE
i‘ YES
)
SELECT FIRST
PROFILE
716/
71B
ADJUST an RATE m PROFILE TO MATCH ‘rER
SUGGEST PUBLISHING CONSOLIDATED VIDEO STREAM
USING SELECTED PROFILE WITH ADJUSTED BIT RATE.
I
I
US. Patent
Aug. 5, 2008
Sheet 1 0f 12
“2
US 7,409,145 B2
COMPUTER
\
COMPUTER READABLE MEDIUM
124x
126
CONSOLIDATED vIDEO
FILES
120
\_§
F|LE#1
CONSOLIDATED AUDIO
FILES
.. @
F|LE#1
...
+
4
FILE#P
H191
MULTl-MEDIA
EDITING APPLICATION
“NvIDEO FILES I
E
FILE #1
...
/
I
r118
AUDIO FILES
FILE #N
FILE #1
II
.
.
FILE #M
w
II
108
106
CAPTURE TOOL
—
“\
I
I
114
102
104
k
_/
VIDEO/AUDIO
SOURCE
\
110
K128
124\
DESTINATION CRM
[126
CONSOLIDATED vIDEO
CONSOLIDATED AUDIO
FILES
FILES
FILE#1
. . .
F|LE#O
FILE #1
. . .
S
FILE#P
US. Patent
Aug. 5, 2008
Sheet 2 0f 12
US 7,409,145 B2
E3$2E85N.:;
i
A
E
v
i
0
m.5525<95E0556E502E
EmNE2%2m28H26.»
5 01
US. Patent
Aug. 5, 2008
Sheet 3 0f 12
US 7,409,145 B2
FIG. 18
Save Movie Wizard
Movie Setting
Select the setting you want to use to save your movie. The setting
you select determines the quality and file size of your saved movie.
0 Best quality for playback on my computer (recommended)
Q Best fit to file size:
75 3
O Qther settings:
Video for
Show fewer choices... Video for
Video for
Video for
Pocket
Pocket
Pocket
Pocket
MB Y
PC
PC
PC
PC
(218 Kbps)
Y
(218 Kbps)
(143 Kbps)
(Full screen 218 Kbps)
High quality video (large)
High quality video (small)
Video for local playback (2.1 Mbps)
Video for LAN (1.0 Mbps)
Video for LAN (768 Kbps)
Video for broadband (512 Kbps)
Video for broadband (340 Kbps)
Video for broadband (150 Kbps)
Video for ISDN (48 Kbps)
Setting details i
Video for dial-up access (38 Kbps)
File type: Windows DV-AVI (NTSC)
Bit rate: 218 Kbps High quality video (NTSC)
DiSP'aY 92?: 208 X Video for local playback (2.1 Mbps NTSC)
Aspect ratlo: 43
Frames per second
Video for local playback (1.5 Mbps NTSC)
_
Test quallty of 320x240 vs 640x480
< Back
?ext >
Cancel
US. Patent
Aug. 5, 2008
Sheet 4 0f 12
US 7,409,145 B2
204
FIG. 2
,0,
200
\
MULTI-MEDIA EDITING
APPLICATION
PROFILE DATABASE
——-'
CAPTURING ROUTINE
PM
BEST QUALITY
PUBLISHING ROUTINE
BEST FIT
PUBLISHING ROUTINE
US. Patent
Aug. 5, 2008
Sheet 5 0f 12
US 7,409,145 B2
FIG. 3
302
VIDEO PROFILE DATABASE
RESOLUTION FIELD
-
TOTAL BIT RATE FIELD
-—
FRAME RATE FIELD
—~
AUDIO PROFILE FIELD
~—
_______________________________________________________ -
L
_
_
_ _
_ _
_
.
.
_
_
_
.
_
_
_ _ _
_
_ _
_
_
_
_ _ _
_
_
_
_
.
_
. .
_
.
_ -
_
_
_ _
_
_
_ _
_
_
_
_ _
_
_ __
(.0 O CO
um
FIG. 3A
320
\
AUDIO PROFILE DATABASE
BIT RATE FIELD
—-—
SAMPLE RATE FIELD
.J
OUTPUT FILED
——
_______________________________________________________ __
\ _
. _ . .
. . . . _ _ _
_ . . . . . . .
. - -
-
.
. _ .
_
_ _ _
_ _ _ _ _
_ _
_ _ . _ _ _
.
. . . _ . . _ _
__
3
0
N
US. Patent
Aug. 5, 2008
Sheet 6 0f 12
US 7,409,145 B2
402
\
VIDEO CAPTURING ROUTINE
vIDEO FORMAT INSTRUCTIONS
404
-/
406
vIDEO PROPERTY INsTRUCTIONs 4/
408
vIDEO PROFILE INsTRUCTIONs
-/
4 0
vIDEO CAPTURING INSTRUCTIONS -j
420
\
AUDIO CAPTURING ROUTINE
AUDIO PROPERTY INSTRUCTIONS
422
#
424
AUDIO PROFILE INsTRUCTIONs
H
426
AUDIO CAPTURING INsTRUCTIONs J/
US. Patent
Aug. 5, 2008
Sheet 7 0f 12
US 7,409,145 B2
FIG. 4B
Options
General Advanced
*Default durations
A default duration will be assigned to each picture or
transistion when it is added to the storyboard or timeline.
Eicture duration:
Iransition duration:
2
2
5
5
seconds
seconds
* Video properties
The video properties specify how video is captured and
movies are saved.
Video format: @?TSC @PAt
Aspect ratio: @>4:§ @163
Learn more about video settings.
*E-mail
Maximum file size for sending a movie as an attachment in
an e-mail message.
we MB
Restore All Defaults
OK
Cancel
US. Patent
502
Aug. 5, 2008
Sheet 8 0f 12
US 7,409,145 B2
VIDEO BEST QUALITY (VBQ)
PUBLISHING ROUTINE
vBO FORMAT INSTRUCTIONS
504
#436
VBQ PROPERTY INSTRUCTIONS J/
508
VBQ PROFILE IDENTIFICATION
J/
INSTRUCTIONS
VBQ PROFILE SELECTION
510
__,/
INSTRUCTIONS
vBO PUBLICATION INSTRUCTIONS
520
\
512
q__--—/
AUDIO BEST QUALITY (ABQ)
PUBLISHING ROUTINE
ABO PROPERTY INSTRUCTIONS
522
J
5 4
ABC PROFILE IDENTIFICATION
INSTRUCTIONS
ABO PROFILE SELECTION
INSTRUCTIONS
ABQ PUBLICATION INSTRUCTIONS
j
‘’
526
___,/
528
._--—-—/
US. Patent
602
\
Aug. 5,2008
Sheet 9 0f 12
US 7,409,145 B2
VIDEO BEST FIT (VBF) PUBLISHING
ROUTINE
VBF FORMAT INSTRUCTIONS
VBF PROPERTY INSTRUCTIONS
604
—~/
J’
£6
608
VIDEO CALCULATING INSTRUCTIONS J/
VBF PROFILE IDENTIFICATION
J
INSTRUCTIONS
J0
612
VBF PROFILE SELECTION
"2/
INSTRUCTIONS
614
VBF PUBLICATION INSTRUCTIONS
620
\
M/
AUDIO BEST FIT (ABF)PUBLISH|NG
ROUTINE
622
AUDIO CALCULATING INSTRUCTIONS .1
624
ABF PROFILE IDENTIFICATION
d/
INSTRUCTIONS
626
ABF PROFILE SELECTION
H/
INSTRUCTIONS
628
ABF PUBLICATION INSTRUCTIONS
--/
US. Patent
Aug. 5, 2008
Sheet 10 0f 12
US 7,409,145 B2
F ANALYZE FRAMES ON TIMELINE TO
702
I DETERMINE FORMATLag, N‘I'g), LAL)
/
// \(
\
FIG
-
7
/
MAX FILE
SEE
\ vYES
\\IZE DESIRED/
'L‘WFIG. i
\
NO
705
/
EXECUTE BQPR II/
706
ANALYZE FRAMES ON TIMELINE TO DETERMWE MAXT //
BIT RATE (TBR) AND MAX RESOLUTION (TR)
/ 707
FROM A PROFILE DATABASE IDENTIFY A FIRST LIST OF PROFILES
SPECIFYING A FIRST RESOLUTION R1 IDENTIFY A SECOND LIST OF PROFILES
SPECIFYING A SECOND RESOLUTION R2 SUCH THAT R1 > TR > R2
71 *
IL /708
O i\\# "A
I SELECT SECOND
YES
LIST OF PROFILES
T FT’
V
/712
///
TV
///
/
TR < OR =
NO
—\Fg+.1*(R1-R9///
4*
SELECT FIRST LIST
OF PROFILES
\V//
FROIvI SELECTED LIST OF PROFILES IDENTIFY A FTRS? PROFILE
SPECIFYING A FIRST BIT RATE BR1 IDENTIFY A SECOND PROFILE
SPECIFYING A SECOND BIT RATE BR2 SUCH THAT BR1 > TBR > 5K2
SELECT SECOND <—YES
PROFILE
716/
I
/
//TBR+<19(';;1_\
I NO
BR2
.
"\\ 8R2) ///“
/
SELECFEEST
PROFILE
/
\
‘
E
s
V
/
71 J
ADJUST BIT RATE IN PROFILE TO MATCH TER F?
SUGGEST PUBLISHING CONSOLIDATED vIDEO STREAM
USING SELECTED PROFILE WITH ADJUSTED BIT RATE. ,
722
US. Patent
Aug. 5, 2008
Sheet 11 0f 12
US 7,409,145 B2
SEE
YES
F'G' 8
EXECUTE BFPR
ANALYZE FRAMES ON TIMELINE TO _/80
4
DETERMINE MAX RESOLUTION (TR)
I
806
CALCULATE TOTAL TARGET BIT RATE (TTBR) J
l
[T 807
FROM A PROFILE DATABASE IDENTIFY A FIRST LIST OF
PROFILES SPECIFYING A FIRST BIT RATE BR1 IDENTIFY A
SECOND LIST OF PROFILES SPECIFYING A SECOND BIT RATE
BR2 SUCH THAT BR1 > TTBR > BR2
810
K 812
SELECT
SECOND LIST +YES
OF PROFILES
Tr
< OR =
BR2 +B$,(BR1_BR2)
SELECT FIRST
NO—>
'
LIST OF
PROFILES
808
FROM SELECTED LIST OF PROFILES IDENTIFY A FIRST PROFILE
SPECIFYING A FIRST RESOLUTION R1 IDENTIFY A SECOND PROFILE
SPECIFYING A SECOND RESOLUTION R2 SUCH THAT R1 > TR > R2
K4:13
SELECT
SECOND <—YES
PROFILE
SELECT
NO—> FIRST
PROFILE
816)
(820
818
—>I ADJUST BIT RATE IN PROFILE TO MATCH TTBR
‘
EXECUTE
SUGGEST PUBLISHING CONSOLIDATED VIDEO
STREAM usme SELECTED PROFILE WITH TBR
826
BQPR
(SEE FIG. 7)
\826
SUGGEST PUBLISHING CONSOLIDATED VIDEO
STREAM usme SELECTED PROFILE WITH TTBR
828
US 7,409,145 B2
1
2
SMART PROFILES FOR CAPTURING AND
PUBLISHING AUDIO AND VIDEO STREAMS
SUMMARY OF THE INVENTION
The invention provides an MEA for capturing and publish
ing video and/or audio streams. In particular, the invention
automatically suggesting parameters settings to use for cap
turing and/or publishing video or audio streams.
TECHNICAL FIELD
The present invention relates to the ?eld of capturing and
publishing audio and video streams. Speci?cally, the present
invention pertains to the automatic selection of appropriate
audio and video pro?les for use during capture and publica
In accordance With one aspect of the invention, a computer
readable medium includes executable instructions for captur
ing a video stream. The executable instructions include for
tion of audio and video streams.
mat instructions for determining a format of the video stream.
The executable instructions also include property instructions
for determining a property of the of the video stream. The
executable instructions also include selecting instructions for
BACKGROUND OF THE INVENTION
An increasing number of people oWn and use camcorders
to make videos that capture their experiences and document
selecting a pro?le that corresponds to the determined format
and the determined property of the video stream. The execut
their lives. Multimedia editing applications (MEA) alloW
users to bring neW life to such videos via their personal
computer by bringing professional video and audio editing to
personal computers. However, MEAs are not alWays user
friendly and users Who are less experienced With such appli
cations may be overWhelmed.
The ?rst step in editing video and audio material With an
20
MEA is importing a video and/or an audio stream from a
source to a computer storing the MEA. This process of
importing video and/ or audio streams to the computer is also
knoWn as capturing. The next step is converting the video and
audio data contained in the respective stream into the appro
priate format (i.e., digital) for storage on the computer as
individual ?les or clips. Traditional MEAs include a library or
a bin in Which the user can assemble and organiZe clips stored
on disk, vieW the name and a thumbnail of each clip, and
25
30
previeW their contents. Each clip can then be edited, and
dragged to a timeline or storyboard to assemble a production.
The centerpiece of any video editing tool is the storyboard
or the timeline in Which the user arranges the video or audio 35
production. This alloWs the user to organiZe the video and
audio clips, insert transitions betWeen them, and apply special
effects, and add titles and background audio. The consoli
dated video or audio production can then be saved or “pub
lished” to the hard drive of the computer or to an external
45
50
a video property of the consolidated video stream.
In accordance With another aspect of the invention, a com
puter readable medium includes a capture routine for execut
Traditional MEAs have emphasiZed ease of use by con
lution, frame rate, and bit rate. These parameters can deter
mine the amount of disk space used, image quality and play
ing capturing instructions for capturing an audio stream
according to a capture pro?le. The capture pro?le de?nes an
audio property of the audio stream. The computer readable
back rate.
In fact, in order to use the MEA effectively, MEAs often
require the user to de?ne pro?les specifying parameters for
capturing video and audio material, and for publishing the
consolidated video and audio material. HoWever, de?ning a
pro?le for capturing and for publishing can be a daunting task
55
stream.
60
In accordance With yet another aspect of the invention, a
computer readable medium includes executable instructions
for publishing a consolidated video stream from a timeline
For these reasons, there is a need for an MEA that alloWs
the user to easily capture and publish their video and audio
material at the appropriate quality and/ or siZe for What they
are trying to achieve. Moreover, the capturing and publishing
process should be accomplished Without requiring the user to
have knoWledge of bit rates, resolution, frame rates, compres
sion formats, etc.
also includes a publishing routine for publishing a consoli
dated audio stream according to a publish pro?le. The publish
pro?le de?nes an audio property of the consolidated audio
for novice users, especially if the user does not knoW or
understand the meaning of resolution, frame rate, bit rate, etc.
In accordance With another aspect of the invention, a com
puter readable medium includes a capture routine for execut
ing capturing instructions for capturing a video stream
according to a capture pro?le selected from a list of pro?les.
The capture pro?le de?nes a video format and a video prop
er‘ty of the video stream. The computer readable medium also
includes a publishing routine for publishing a consolidated
video stream according to a publish pro?le selected from the
list of pro?les. The publish pro?le de?nes a video format and
tor or speakers, respectively, linked With the computer.
of design choices are made prior to capturing video and audio
material and prior to publishing the consolidated video and
audio material (i.e., production). These choices include reso
includes determining a format of the video stream. The
method also includes determining a property of the video
stream. The method also includes selecting one of the pro?les
from the list of pro?les that corresponds to the determined
format and the determined property of the video stream. The
method further includes capturing the video stream according
to the selected pro?le.
In accordance With another aspect of the invention, a
method is provided for publishing a video stream according to
one of a list of pro?les, each pro?le specifying a plurality of
parameters for publishing a stream of video. The method
includes determining a format of the video stream. The
method also includes determining a property of the video
stream. The method also includes selecting one of the pro?les
of list of pro?les that correspond to the determined format and
the determined property of the video stream. The method
further includes publishing the video stream according to the
selected pro?le.
40
storage medium such as a DVD or CD. In addition, the con
solidated video or audio production can be played on a moni
straining the user interface, simplifying the layout process,
restricting the number of parallel tracks, and providing a
step-by-step process for adding effects. HoWever, a number
able instructions fur‘ther include capturing instructions for
capturing the video stream according to the selected pro?le.
In accordance With another aspect of the invention, a
method is provided for capturing a video stream according to
one of a list of pro?les, each pro?le specifying a plurality of
parameters for capturing a stream of video. The method
including a plurality of individual video streams. Format
instructions determine a format of the consolidated video
stream. Property instructions determine a property of the
65
consolidated video stream. Identifying instructions identify a
list of pro?les that correspond to the determined format and
the determined property of the consolidated video stream.
US 7,409,145 B2
3
4
Selecting instructions select one of the identi?ed list of pro
?les that correspond to the determined format and the deter
mined property of the consolidated video stream. Publishing
In accordance With another aspect of the invention, a com
puter-readable medium stores a data structure. A ?rst data
?eld includes data representing a prede?ned list of pro?les,
and each pro?le speci?es parameters for capturing a data
instructions publish the consolidated video stream according
to the selected pro?le.
stream. A second data ?eld contains data representative of the
data stream. A third functioning ?eld retrieves one of the
prede?ned list of pro?les as a function the data representative
of the data stream. Parameters speci?ed in the retrieved pro
?le are used for capturing the data stream.
In accordance With another aspect of the invention, a com
puter-readable medium stores a data structure. A ?rst data
In accordance With another aspect of the invention, a com
puter readable medium includes executable instructions for
publishing a consolidated video stream from a timeline
including a plurality of individual video streams that have a
total length of time. Format instructions determine a format of
the consolidated video stream. Property instructions deter
?eld includes data representing a prede?ned list of pro?les,
and each pro?le speci?es parameters for publishing the con
mine a property of the consolidated video stream. Determin
ing instructions determine a target ?le siZe of the consolidated
video stream to be published. Calculating instructions calcu
solidated data stream. A second data ?eld contains data rep
resentative of the consolidated data stream. A third function
ing ?eld retrieves one of the prede?ned list of pro?les as a
late a total target bit rate based on the determined target ?le
siZe and the total length of time of the consolidated video
stream. Identifying instructions identify a list of pro?les that
correspond to the calculated target bit rate. Selecting instruc
tions select one of the pro?les of the identi?ed list of pro?les
that correspond to the calculated target bit rate. Publishing
instructions publish the consolidated video stream according
to the selected pro?le.
In accordance With another aspect of the invention, a com
puter readable medium includes executable instructions for
capturing an audio stream. The computer readable medium
also includes a list of pro?les, each specifying a plurality of
parameters for capturing a stream of audio. Property instruc
tions determine a property of the audio stream. Selecting
instructions selecting one of the pro?les from the list of pro
?les that corresponds to the determined property of the audio
stream. Capturing instructions capture the audio stream
20
BRIEF DESCRIPTION OF THE DRAWINGS
25
FIG. 1 is an exemplary block diagram illustrating a com
puter comprising a multimedia editing application for captur
ing a data stream and for publishing a consolidated data
stream.
30
35
a video stream and/ or publishing a consolidated video stream.
40
audio. Property instructions determine a property of the con
45
consolidated audio stream. Publishing instructions publish
the consolidated audio stream according to the selected pro
?le.
In accordance With another aspect of the invention, a com
puter readable medium includes executable instructions for
50
publishing a consolidated audio stream from a timeline that
includes a plurality of individual audio streams having a total
length of time. The computer readable medium also includes
a list of pro?les, each specifying a plurality of parameters for
publishing a stream of audio. Determining instructions deter
55
provide the best quality video during playback.
FIG. 6 is an exemplary block diagram illustrating the con
tents of a publishing routine comprising computer executable
instructions for publishing a consolidated video stream to ?t
a designated ?le siZe.
tions identify pro?les from the list of pro?les that correspond
the consolidated audio stream according to the selected pro
?le.
FIG. 4B is a screen shot illustrating a Tools/Options menu
of the MEA for setting defaults.
FIG. 5 is an exemplary block diagram illustrating the con
tents of a publishing routine comprising computer executable
instructions for publishing the consolidated video stream to
to provide the best quality audio during playback.
60
to the calculated target bit rate. Selecting instructions select
one of the pro?les from the list of pro?les that correspond to
the calculated target bit rate. Publishing instructions publish
FIG. 4 is an exemplary block diagram illustrating the con
tents of the video capture routine comprising computer
executable instructions for capturing video streams.
FIG. 4A is an exemplary block diagram illustrating the
contents of the audio capture routine comprising computer
executable instructions for capturing audio streams.
FIG. 5A is an exemplary block diagram illustrating the
contents of a publishing routine comprising computer execut
able instructions for publishing the consolidated audio stream
mine a target ?le siZe of the consolidated audio stream to be
published. Calculating instructions calculate a target bit rate
based on the determined target ?le siZe and the total length of
time of the consolidated audio stream. Identifying instruc
FIG. 3A is an exemplary block diagram illustrating the
contents of a pro?le database storing parameter data for cap
turing an audio stream and/or publishing a consolidated audio
stream.
solidated audio stream. Identifying instructions identify pro
?les from the list of pro?les that correspond to the determined
bit rate of the consolidated audio stream. Selecting instruc
tions select one of the pro?les from the identi?ed list of
pro?les that correspond to the determined bit rate of the
the invention.
FIG. 3 is an exemplary block diagram illustrating the con
tents of a pro?le database storing parameter data for capturing
for publishing a consolidated audio stream from a timeline
including a plurality of individual audio streams. The com
puter readable medium includes a list of pro?les, each speci
fying a plurality of parameters for publishing a stream of
FIG. 1A is a screen shot illustrating a timeline.
FIG. 1B is a screen shot illustrating a user interface alloW
ing the selection of a publication target.
FIG. 2 is an exemplary block diagram illustrating compo
nents of a multimedia editing application for implementing
according to the selected pro?le.
In accordance With yet another aspect of the invention, a
computer readable medium includes executable instructions
function the data representative of the consolidated data
stream. Parameters speci?ed in the retrieved pro?le are used
for publishing the consolidated data stream.
Alternatively, the invention may comprise various other
methods and apparatuses. Other features Will be in part appar
ent and in part pointed out hereinafter.
65
FIG. 6A is an exemplary block diagram illustrating the
contents of a publishing routine comprising computer execut
able instructions for publishing a consolidated audio stream
to ?t a designated ?le siZe.
US 7,409,145 B2
5
6
FIG. 7 is an exemplary ?oW chart illustrating operation of
the best-quality video publication routine as described in
In one embodiment, the capture tool 114 is a capture card
that is connected to the computer 112 via an ISA slot or PCI
slot. The capture card includes an input port that can be linked
reference to FIG. 5.
With an output port of the V/A source 110 via a cable such as
FIG. 8 is an exemplary ?oW chart illustrating operation of
S-video cable, or coaxial cable. Thereafter, video data or
the best-?t video publication routine as described in reference
audio data can be transferred from V/A source 110 to the
to FIG. 6.
computer 112. For example, the video stream 102 can be
captured from a VCR by connecting a cable to the video
output port on the back of the VCR and connecting the other
end of the cable to the video-in of the video capture card. In an
alternative embodiment, the capture tool 114 is a direct con
nection such as an Institute of Electrical and Electronic Engi
neers (IEEE) 1394 interface. Typically, the IEEE 1394 inter
FIG. 9 is a block diagram illustrating one example of a
suitable computing system environment in Which the inven
tion may be implemented.
Corresponding reference characters indicate correspond
ing parts throughout the draWings.
face connection is used for capturing digital video. For
DETAILED DESCRIPTION OF THE INVENTION
example, the IEEE 1394 interface can be connected to an
IEEE1394 connection port on a digital camcorder and con
nected to an IEEE1394 connection port on the computer 112
Referring ?rst to FIG. 1, an exemplary block diagram
illustrates a computer comprising an MEA 101 for capturing
to facilitate transfer the digital signal generated by digital
a video stream 102 or audio stream 104 and for publishing a
camcorder to the computer 112 for storage.
consolidated video stream 106 or audio stream 108 to be 20
Video capture and storage often involves large amounts of
captured from a video/ audio source (V/A source) 110 linked
to the computer 112.
The video stream 102 includes video data representative of
a visual image or a series of visual images. The audio stream
104 includes audio data representative of one or more sounds,
Which may or may not correspond to the visual image or
series of visual images. The V/A source 110 can be any device
used to record a visual image or a series of visual images
and/or sound data. The V/A source 110 may generate the
video stream 102 or audio stream 104 as an analog signal or
25
RAM of the computer 112.As a result, the available RAM can
be used up in a feW seconds. HoWever, because of the repeti
30
RAM. Consequently, the actual data stored in the system
analog camcorder that generates the video stream 102 and
audio stream 104 as analog signals. Alternatively, the V/A
source 110 can be a Web camera or DV camcorder that gen
RAM can be much less than it Would be otherWise. After the
video or audio ?les are compressed they can be decompressed
35
is often referred to as a CODEC. One of the draWbacks in
The video format data indicates a frame rate, a resolution, an
property data and/or the audio property data. As described
above, video and audio streams 102, 104 can be generated in
analog format or digital format. In the case of an analog
format, the capture tool 114 digitiZes the video stream or
for playback. The method of compression and decompression
reducing the siZe of the video or audio ?le is that there is some
loss in video or audio quality. CODECs often require users to
aspect ratio, and a frame display mode of the individual video
images or “frames” included in the video stream 102.
A capture tool 114 is linked to the computer 112 and the
V/A source 110 for capturing the video format data, the video
tive nature of the data contained in the video ?le 116 or audio
?le 118, compression schemes can be used to eliminate
redundant information before it is channeled into the system
a digital signal. For example, the V/A source 110 can be an
erates the video stream 102 as a digital signal. Whether analog
or digital, the video stream 102 includes video format data.
data. For instance, to display a single 320x200 frame (64000
pixels) on a computer monitor may require approximately
128 KB of RAM per frame (assuming each pixel requires
approximately 2 bytes; 16 bit display). In order to capture ten
(10) complete frames per second, more than 1.28 MB per
second (i.e., bit rate) Will have to be channeled into system
40
de?ne parameters that Will be used during the compression
and decompression process. The parameters typically include
bit rate, frame rate, and resolution. The selection of these
parameters can signi?cantly in?uence the video or audio
quality. CODEC functions can be implemented in hardWare
as a digital signal processor, or in softWare as a driver.
45
A MEA 101 such as [email protected] Movie Maker offered by
audio stream before transferring the format data to the com
Microsoft, Inc. alloWs a user to create a consolidated video
stream 106 and/or consolidated audio stream 108. The con
puter 112 for storage. In other Words, data contained in the
analog video stream 102 and/or analog audio stream 104 is
designed sequence of video frames or images that combine to
converted into binary format (i.e., digital format) for storage
solidated video stream 106 is representative of a user
50
associated With the computer 112. For example, the CRM 120
may be the random access memory (RAM) of the computer
112 or the hard drive of the computer 112. In contrast, a digital
stream is already in digital format and the capture tool 114
transfers the digital data directly to the CRM 120 of the
video frames, that combine to produce sounds via speakers. In
55
one embodiment, the MEA 101 provides a User-Interface 122
for stepping the user through the process of arranging video
?les, or video clips, into a desired sequence along a timeline.
(See FIG. 1A). The MEA 101 creates a consolidated video
computer for storage as a video ?le 116 or audio ?le 118.
After the video ?le stored is stored on the hard drive in digital
form (eg as a .avi, .Wmv) it also contains video property data.
form a motion video image on a television screen or computer
system monitor. The consolidated audio stream 108 is repre
sentative of a user designed sequence of audio ?les, Which
may or may not correspond to the user designed sequence of
as a video ?le 116 or audio ?le 118, respectively. In this case,
the video ?le 116 or audio ?le 118 is stored on a CRM 120
stream 106 corresponding to the video clips along the time
60
line, and/or creates a consolidated audio stream 108 corre
The video property data indicates a total bit rate and a reso
sponding to the audio clips along the time line. The MEA 101
lution of the individual video frames included in the video
stream 102. After the audio ?le stored is stored on the hard
of the invention further alloWs the user to indicate a publica
drive in digital form (eg as .Wma) it also contains audio
property data. The audio property data indicates a bit rate, a
65
tion target for publishing the consolidated video stream 106
and/or consolidated audio stream 108. In this case, publica
tions refers to saving or transferring the consolidated video
sample frequency, and an output mode of the audio signal
stream 106 and/ or consolidated audio stream 108 to a particu
included in the audio stream 104.
lar publication target as a consolidated video ?le 124 and/or
US 7,409,145 B2
7
8
consolidated audio ?le 126, respectively. For example, con
stream according to parameters speci?ed in one of the list of
pro?les and a maximum ?le siZe de?ned by the user.
Referring next to FIG. 3, an exemplary block diagram
illustrates the contents of a video pro?le database 302 storing
solidated video ?les or consolidated audio ?les can be stored
on the CRM 120 of the computer 112, or on a destination
CRM 128 such as a DVD data disk or hard drive of a remote
computer. As another example, the consolidated video stream
data relating to parameters settings for use during the capture
106 can be published to the destination CRM 128 via e-mail.
of a video stream and/ or for use during the publication of the
By indicating a publication target (e. g., e-mail, CD), the user
consolidated video stream. Each pro?le in the video pro?le
also determines Whether video or audio pro?les are selected
to achieve the best quality or selected to achieve a best ?t to
database 302 has a resolution ?eld 304, a total bit rate ?eld
siZe. (See FIG. 1B).
appendix, Table 1).
306, a frame rate ?eld 308, and an audio pro?le ?eld 310. (See
The resolution ?eld 304 contains resolution data. The reso
Although the invention is described beloW in terms of
lution data includes capture and storage information regard
consolidated video and consolidated audio streams as de?ned
ing image quality. More speci?cally, resolution data includes
along a timeline, it is contemplated that the invention can be
used for publishing consolidated streams With an MEA 100
that utiliZes a storyboard or any other method for constructing
information regarding the amount of information contained
in each frame (i.e., image) of the video stream. Resolution is
expressed as the number of horizontal pixels times the num
consolidated streams.
Existing MEAs require the user to select a video pro?le
When capturing the video stream 102 or publishing the con
solidated video stream, and select an audio pro?le When
capturing the audio stream 104 or publishing the consolidated
audio stream. As used herein, the pro?le refers to a set of
parameters that correspond to a desired format and/or prop
erty of the particular stream 102, 104 to be captured, or to a
desired format and/or property of the particular consolidated
stream 106, 108 to be published. Although format data and
ber of vertical pixels (e.g., 640x480 or 720x480). A higher
resolution yields a higher quality image.
The total bit rate ?eld 306 contains total bit rate data. Bit
20
rate indicates the speed at Which digital information (binary
data) is transferred to the RAM of the computer, and is gen
erally measured in multiple bits per second (e.g., kilobits or
thousands of bits per second, Kbps). The total bit rate data
includes information regarding the speed at Which digital
25
video and audio information corresponding to a particular
video stream is transferred to RAM.
The frame rate ?eld 308 contains frame rate data. The
property data can often be detected from the V/A source 110
during the capture process, there are instances Where the
format and/or property data cannot be detected. Moreover,
there are instances Where the user may prefer not to use the 30
detected format and property due to quality issues or limited
storage space. Thus, pro?le selection alloWs the user more
control over the ?nal output ?le. Unfortunately, selecting a
pro?le can be a daunting task for users, especially if the user
is unfamiliar With resolution, frame rate, bit rate, etc. The
MEA 101 according to the invention alloWs the user to control
the capture of video and audio streams 102, 104 by automati
consolidated video and audio streams 106, 108 by automati
sponding audio pro?le to use With each video pro?le.
In one embodiment, the video pro?le database 302
includes other ?elds 312 containing data such as buffer Win
35
40
seconds, the buffer is siZed to hold 3 seconds of 32 Kbps
content, or 12,000 bytes (32000 bits per second><3 seconds/ 8
bits per byte). Smoothness data includes information regard
ing the smoothness of the motion video during playback.
Decoding complexity data includes information regarding
error control techniques used to control errors that occur
45
during the transmission and storage of digital data. Such
?elds are less important for implementing the invention, but
can be used for the publication of the consolidated video.
204, a best-quality publishing routine 206, and a best-?t pub
lishing routine 208.
The pro?le database 202 stores a list of pro?les. Each
doW data, smoothness data, or decoding complexity data.
Buffer WindoW data includes information regarding hoW
much content can be buffered. For example, if you have a
stream With a bit rate of 32 Kbps and a buffer WindoW of 3
cally suggesting parameter settings for publication of the
consolidated video and audio stream 106,108.
Referring noW to FIG. 2, an exemplary block diagram
illustrates basic components of an MEA 200 for implement
ing the invention. In this example, the basic components of an
MEA 200 include a pro?le database 202, a capturing routine
The audio pro?le ?eld 310 contains audio pro?le data. The
audio pro?le data includes information regarding a corre
cally suggesting parameters settings for capture of the video
or audio stream 102, 104. Moreover, the MEA 101 according
to the invention alloWs the user control the publication of
frame rate data includes information regarding the number of
frames to display per second in a motion video.
Referring next to FIG. 3A, an exemplary block diagram
illustrates the contents of an audio pro?le database 320 stor
50
ing data relating to parameters settings for use during the
pro?le speci?es parameters for capturing video or audio
capture of an audio stream or during the publication of the
streams or for publishing consolidated video or consolidated
consolidated audio stream. Each pro?le in the audio pro?le
audio streams. In this case, each parameter speci?es a desired
database 320 has a bit rate ?eld 322, an audio sample rate ?eld
format value (e.g., frame rate, resolution) and/or a desired
property value (e.g., bit rate) to use for capturing video or
324, and an output ?eld 326. (See appendix, Table 2).
55
consolidated audio streams. The capturing routine 204
includes a set of computer executable instruction for captur
ing the video stream or audio stream according to parameters
speci?ed in one of the list of pro?les. The best-quality pub
computer.
60
lishing routine 206 includes a set of computer executable
instruction for publishing the consolidated video stream and/
or consolidated audio stream according to parameters speci
?ed in one of the list of pro?les that Will produce the best
quality video during playback. The best-?t publishing routine
208 includes a set of computer executable instructions for
publishing a consolidated video and/or consolidated audio
The bit rate ?eld 322 contains audio bit rate data. The audio
bit data includes information regarding the speed at Which
digital audio information is transferred to the RAM of the
audio streams, or to use for publishing consolidated video or
A sample rate ?eld 324 contains audio sample rate data.
The audio sample rate is the frequency at Which samples are
taken from an analog audio stream during the conversion of
the analog signal to a digital format. The higher the sampling
rate the more the digitiZed audio stream matches the original
65
sound during playback. Thus, the audio sample rate data
includes information regarding the frequency With Which
samples are taken from an analog audio stream. For example,
common sampling rates like 44.1 KHZ indicates that the
US 7,409,145 B2
10
video includes retrieving the region setting (i.e., country)
analog audio signal is sampled 44100 times per second to
convert the analog sound (source) into a digital stream.
The output ?eld contains 326 output data. The output data
includes information regarding Whether the audio stream is
from an operating system such as WindoWs XP®. The video
further include retrieving from a table (not shoWn) that lists
TV standards by country, the TV standard that corresponds to
the country retrieved from the region setting of the operating
system. In another embodiment, during the capture of the
stereo or mono.
In one embodiment, the audio pro?le database 320
includes other ?elds 328 such as a buffer WindoW ?eld, Which
video stream the format of the video stream is determined on
a frame-by-frame basis. That is, no conversion betWeen for
mats takes place even if formats of the frames included in the
is less important, but can be used for the publication of the
consolidated audio stream.
Referring next to FIG. 4, an exemplary block diagram
video stream are mixed. For example, during capture of a
video stream including 16:9 and 4:3 frames, frames that have
illustrates the contents of a video capture routine 402 com
prising computer executable instructions for capturing a
an aspect ratio of 16:9 Will be determined to have an aspect
ratio of 16:9 and frames that have an aspect ratio of 4:3 Will be
determined to have an aspect ration of 4:3.
In another embodiment, the user designates a default set
video stream.
Video format instructions 404 include determining the for
mat of the video stream being captured. In this embodiment,
determining the format includes determining the aspect ratio,
the frame rate, the resolution, and/ or the frame display mode
of the video stream. The frame rate corresponds to the frame
per second (fps) rate at Which the individual series of video
frames Will be displayed during playback on a vieWing device
20
(not shoWn) such as a television or computer monitor. Gen
erally, the geographic region in Which the video stream is
targeted for playback determines the frame rate of the video
stream. For example, a video stream targeted for playback in
Europe Will generally have a frame rate of approximately 25
fps, Which corresponds to an Phase Alternating Line (PAL)
format. In contrast, a video stream targeted for playback in
North America Will generally have a frame rate of approxi
mately 30 fps, Which corresponds to an National Television
Systems Committee (NTSC) format. PAL and NTSC are
25
case, determining one or more properties includes determin
ing the total bit rate of the video stream. In one embodiment,
30
commonly referred to as TV standards.
The aspect ratio indicates the Width to height proportion of
rectangular. As described above, the resolution indicates a
40
Whether the video frame is interlaced or progressive scan. For
45
“interlaced video.” HoWever, digital video displays use pro
50
55
stream as received from the video source. For example, it is
720x576 and 25 fps may be selected. In another embodiment,
the pro?le selection is dependent on the determined video
source. For example, if the video source is a Web camera a
pro?le specifying the determined resolution and the deter
mined frame may be selected.
60
format instructions 404 designate the format. For example, if
the TV standard (i.e., PAL or NTSC) of the video stream
cannot be determined, the video format instructions 404
assign a default TV standard. In one such embodiment, the
as possible (i.e., not throWing aWay resolution). In one
embodiment, the pro?le selection is dependent on the deter
mined format. For example, if the format of a digital video
stream is determined to be NTSC, a pro?le specifying a
resolution of 720x480 and a frame rate of 30 fps may be
selected. Alternatively, if the format of a digital video stream
is determined to be PAL, a pro?le specifying a resolution of
include automatically detecting the format of the captured
possible to detect from a digital camcorder driver in most
cases Whether the camcorder is NTSC or PAL (i.e., the frame
rate). Because there may be cases Where the camcorder
returns both NTSC and PAL or the value returned is incorrect,
the video format instructions 404 may include assigning a
default format for the video stream. That is, in the event the
format of the video stream cannot be determined the video
3. The purpose of using pro?les is to capture the video stream
at a resolution and frame rate that is not higher than the video
source, While at the same time capturing as much information
of these ?elds are displayed every second. This is called
video stream by analyZing encoded data included the video
determining one or more properties includes determining the
bit rate of the audio stream. In one embodiment, properties of
the video stream being captured are determined from encoded
data included in the video stream. For example, the resolution
and bit rate can be determined from resolution data encoded
into the video stream by the video source (e.g., digital camera
Video pro?le instructions 408 include selecting one of a list
of video pro?les such as described above in reference to FIG.
even lines (the even ?eld). The “odd ?eld” and the “even
gressive scan to update the screen. With this method, the
screen is not separated into ?elds. Rather, each line is dis
played in sequence, from top to bottom. The entire frame is
displayed 30 times every second. This is also called “non
interlaced video.”
In another embodiment, the video format instructions 404
properties of the audio stream being captured. In this case,
driver) or the capture tool 114 (i.e., capture card).
example, When a television set displays its analog video sig
nal, it displays the odd lines (the odd ?eld) ?rst and then the
?eld” together form a frame. For NTSC, approximately sixty
the bit rate can be determined from data encoded into the
instructions 406 further include determining one or more
35
frames displayed on Widescreen television sets appear more
quality of the image and is typically expressed horizontal
pixels times vertical pixels. The frame display mode indicates
properties of the video stream being captured are determined
from encoded data included in the video stream. For example,
video stream by the video source (e.g., digital camera driver)
or the capture tool 114 (i.e., capture card). Video property
a video frame. For example, traditional television sets have a
4:3 aspect ratio and video frames displayed on traditional
television appear almost square in appearance. In contrast,
Widescreen television sets have a 16: 9 aspect ratio, and video
ting using a Tools/Options menu of the MEA. Thereafter, the
video format instructions 404 designate the format of the
captured video stream to correspond to the default setting
designated in the Tools/Options menu. For example, the
MEA may designate 4:3 as the default for the aspect ratio and
designate interlaced as the default for the frame display mode
Within Tools/Options menu. (See FIG. 4B).
Video property instructions 406 include determining one
or more properties of the video stream being captured. In this
65
Video capturing instructions 410 include capturing the
video stream according to the selected pro?le. In this case,
capturing refers to the process of storing video data and audio
data on the computer in accordance With parameters speci?ed
in the selected video pro?le. In one embodiment, video and
sound data contained in the video stream are synchroniZed
together, and then stored on the hard drive in a standard ?le
format such as WindoWs MediaTM Video (.WMV).
US 7,409,145 B2
11
12
Referring noW to FIG. 4A, an exemplary block diagram
illustrates the contents of audio capture routine 420 compris
ing computer executable instructions for capturing an audio
cannot be determined, the video best-quality property
instructions 506 may designate the resolution as 320x240.
In another embodiment, the user designates a default set
stream.
ting using a Tools/Options menu of the multimedia editing
Audio property instructions 422 include determining one
or more properties of the particular audio stream being cap
application. Thereafter, the video best-quality format instruc
tured. In this case, determining one or more properties
includes determining the bit rate of the audio stream. In one
embodiment, properties of the audio stream being captured
are determined from encoded data included in the audio
stream. For example, the bit rate and output mode (i.e., stereo
or mono) can be determined from resolution data or other
metadata encoded into the audio stream by the audio source.
Audio pro?le instructions 424 include selecting one of a
list of pro?les such as described above in reference to FIG.
3A. In one embodiment, the audio pro?le selection is depen
dent on the audio source. For example, if the audio stream is
generated by a digital camcorder an audio pro?le specifying
the sample rate used by the digital camcorder is selected. In
another embodiment, the audio pro?le instructions 424
include designating a audio pro?le. For example, if the audio
20
source is a microphone an audio pro?le specifying a sample
rate of 44.1 khZ and a stereo output mode may be selected.
Audio capturing instructions 426 include capturing the
audio stream according to the selected pro?le. In this case,
audio capturing refers to the process of storing audio data on
25
the computer in accordance With parameters speci?ed in the
selected audio pro?le. In one embodiment, sound data con
tained in the audio stream is stored on the hard drive in a
standard ?le format such WindoWs MediaTM Audio (WMA).
tions 504 designate the format of the consolidated video
stream to correspond to the default setting as designated in the
Tools/Options menu. In one such embodiment, the aspect
ratio (e.g., 4:3 or 16:9) and the frame display mode (i.e.,
interlaced or progressive) is determined solely from Tools/
Options settings as designated by the user or as designated as
a default Within the MEA. For example, the MEA may des
ignate 4:3 as the default for the aspect ratio and designate
interlaced as the default for the frame display mode Within
Tools/Options menu. As another example, if the frame dis
play mode (i.e., TV standard) is determined to be in an NTSC
format the resolution is designated as 720x480. Alternatively,
if the frame display mode is determined to be in a PAL format
the resolution is designated as 720x576.
Video best-quality property instructions 506 include deter
mining one or more properties of the consolidated video
stream to be published. In this instance, determining one or
more properties includes determining the bit rate of the con
solidated video stream. In one embodiment, the bit rate of the
consolidated video stream is determined from a maximum bit
rate of one or more video clips that de?ne the consolidated
video stream, respectively. In other Words, each video clip
30
along the timeline is examined to determine its resolution and
bit rate. Thereafter, the maximum bit rate can be determined
After the video stream or audio stream has been captured,
the MEA can be used to publish a consolidated video stream
by comparing bit rate of each frame in the clip to determine
or consolidated audio stream.
maximum and minimum values. In an alternative embodi
Referring noW to FIG. 5, an exemplary block diagram
illustrates the contents of a video best quality publishing
routine 502 comprising computer executable instructions for
publishing a consolidated video stream that provides the best
ment, the video best-quality property instructions 506 include
assigning a default property for the consolidated video
35
stream. That is, in the event the property of the video stream
cannot be determined from examining the timeline, the video
best-quality property instructions 506 designate the property.
quality video during playback.
Video best-quality format instructions 504 include deter
mining the format of the consolidated video stream to be
published. In this case, determining the format includes deter
mining the aspect ratio, the frame rate, resolution and/or the
For example, if the bit rate of the video content cannot be
40
determined, the video best-quality property instructions 506
may designate the bit rate as 304 Kbps.
Video best-quality identifying instructions 508 include
the same as the format of the one or more captured video 45
identifying a set of video pro?les that corresponds to the
determined format and the determined property of the con
solidated video stream. In this case, the set of video pro?les is
frames from Which the video stream is de?ned. If the video
frames that de?ne the consolidated video have different for
identi?ed from a list of video pro?les such as stored in the
pro?le database described above in reference to FIG. 3. As
mats, video best-quality format instructions 504 may include
described above, the pro?le database includes resolution
frame display mode of the consolidated video stream. In one
embodiment, the format of the consolidated video stream is
assigning a default format to consolidated video such as
described above in reference to FIG. 4. In one embodiment,
the resolution of the consolidated video stream is determined
from a maximum resolution of one or more video clips that
data. The set of video pro?les may include tWo or more of the
50
list of video pro?les each specifying the timeline resolution.
mined by comparing the resolution of each frame in the clip to
Alternatively, the set of pro?les may include a ?rst list of
pro?les and a second list of pro?les. The ?rst list of pro?les
includes the list of pro?les specifying the same ?rst resolu
tion, Wherein the speci?ed ?rst resolution is greater than or
equal to timeline resolution. The second list of pro?les
includes the list of pro?les each specifying the same second
determine maximum and minimum values. One method of
resolution, Wherein the speci?ed second resolution less than
comparing resolutions involves calculating the number of
pixels in each frame. This achieved by multiplying the Width
60
or equal to the timeline resolution. In other Words, the time
line resolution falls betWeen the ?rst resolution speci?ed in
the ?rst list of pro?les and the second resolution speci?ed in
65
instructions 508 further include identifying a ?rst pro?le and
a second pro?le from the identi?ed set of video pro?les. The
?rst pro?le speci?es a bit rate less than the determined bit rate
of the consolidated video stream. The second pro?le speci?es
de?ne the consolidated video stream. In other Words, each
video clip along the timeline is examined to determine its
resolution. Thereafter, the maximum resolution can be deter
by the height. For example, a frame With a 420x420 resolu
55
second list of pro?les. The video best-quality identifying
tion has a total of 176,400 pixels, and a frame With a 480><3 60
resolution has a total of 172,360 pixels. Therefore, the reso
lution of the 420x420 frame is greater than the 480x360
frame. In the event the format of the video stream cannot be
determined from examining the timeline, the video best-qual
ity format instructions 504 can designate the format. For
example, if the resolution and bit rate of the video content
a bit rate greater than the determined resolution of the con
solidated video stream. (See appendix, Example 1).
US 7,409,145 B2
14
13
selection instructions. (See FIG. 1B). By indicating a publi
cation target (e.g., e-mail, CD), the user determines Whether
Video best-quality selection instructions 510 include
selecting one of the lists of pro?les as a function of the
difference betWeen the identi?ed ?rst list of pro?les and the
identi?ed second list of pro?les. In one embodiment, the
video best-quality selection instructions 510 include select
ing one of the identi?ed lists of pro?les as function of the
difference betWeen the ?rst resolution speci?ed in the ?rst list
of pro?les and the second resolution speci?ed in the second
video pro?les are selected to achieve the best-quality or
selected to achieve a best ?t to siZe.
Referring noW to FIG. 5A, an exemplary block diagram
illustrates the contents of an audio best-quality publishing
routine 520 comprising computer executable instructions for
publishing a consolidated audio stream that provides the best
list of pro?les. For example, video best-quality selection
quality audio during playback.
instructions 510 may include the following instructions:
If TR<:LR+0.l*(HR—LR), select list of pro?les With the
Audio best-quality property instructions 522 include deter
mining one or more properties of the consolidated audio
stream to be published. In this instance, determining one or
more properties includes determining the bit rate of the con
solidated audio stream. In one embodiment, the bit rate of the
consolidated audio stream is determined from a maximum
loWer resolution,
If TR>LR+O.I*(HR—LR), select list of pro?les With the
higher resolution,
If TR is loWer than the loWest resolution in the pro?le
database, select video pro?le With the loWest resolution
in the database, or
resolution and a maximum bit rate of one or more audio clips
If TR is higher than the highest resolution in the database,
select video pro?le With the highest resolution in the
pro?le database;
20
mined by comparing the bit rate of each clip to determine
Where TR is the maximum resolution determined from the
timeline, LR is the second resolution speci?ed by the second
maximum and minimum values. In an alternative embodi
ment the audio best-quality property instructions 522 include
assigning a default property for the consolidated audio
stream. That is, in the event the property of the audio stream
cannot be determined the audio best-quality property instruc
tions 522 designate the property.
list of pro?les, and HR is the ?rst resolution speci?ed by the
?rst list of pro?les. Notably, all other resolutions in the pro?le
database are either less than or equal to LR or greater than or
equal to HR. The video best-quality selection instructions 510
further include selecting one of the video pro?les as function
of the difference betWeen the identi?ed ?rst pro?le and the
identi?ed second pro?le. In another embodiment, the video
best-quality selection instructions 510 include selecting one
Audio best-quality identifying instructions 524 include
30
of the video pro?les as function of the difference betWeen the
bit rate speci?ed in the ?rst pro?le and the bit rate speci?ed in
the second pro?le. For example, video best-quality selection
instructions 510 may include the folloWing instructions:
IfTBR<:LBR+0.l *(HBR— BR), select video pro?le With the
35
loWer bit rate,
If TBR is loWer than the loWest bit rate in the list of pro?les,
(See appendix, Example 2).
select video pro?le With the loWest bit rate in the list of
pro?les, or
Audio best-quality selection instructions 526 include
selecting one of the audio pro?les as function of the differ
ence betWeen the identi?ed ?rst pro?le and the identi?ed
45
Where TBR is the maximum bit rate determined from the
timeline, LBR is the bite rate speci?ed by the second pro?le,
and HBR is the bite rate speci?ed by the ?rst pro?le. Notably,
all other bit rates in the pro?le database are either less than or
loWer bit rate,
If TBR>LBR+0T *(HBR— BR), select audio pro?le With the
higher bit rate,
55
Video best-quality publishing instructions 512 include
publishing the consolidated video stream according to the
selected pro?le. In this embodiment, video best-quality pub
lish instructions 512 includes local publishing and remote
60
and/or copying the consolidated video stream to the hard
drive of the computer that stores the MEA used to create the
consolidated video stream. In this case, remote publishing
indicates a publication target prior to the execution of the
If TBR is loWer than the loWest bit rate in the pro?le data
base, select audio pro?le With loWest bit rate in the
pro?le database, or
If TBR is higher than the highest bit rate in the pro?le
database, select audio pro?le With highest bit rate in the
pro?le database;
Where TER is the maximum bit rate determined from the
includes the process of saving and/or copying the consoli
dated video stream to an external CRM Where storage space
(i.e., ?le siZe) is not an issue. In one embodiment, the user
526 may include the folloWing instructions:
IfTBR<:LBR+0.l *(HBR— BR), select audio pro?le With the
the set of pro?les includes at least tWo of the list of video
pro?les each specifying the same resolution as the deter
mined resolution for the consolidated video stream, the ?rst
pro?le specifying a bit rate closest to the determined bit rate
publishing. Local publishing includes the process of saving
second pro?le. In one embodiment, audio best-quality select
ing instructions 526 include selecting one of the audio pro
?les as function of the difference betWeen the bit rate speci
?ed in the ?rst pro?le and the bit rate speci?ed in the second
pro?le. For example, audio best-quality selection instructions
equal to LBR or greater than or equal to HBR. In the case Where 50
is selected. (See appendix, Example 1)
identifying a set of audio pro?les that correspond to the deter
mined property of the consolidated audio stream. In this case,
the set of audio pro?les is identi?ed from a list of audio
pro?les such as stored in the audio pro?le database described
above in reference to FIG. 3A. As described above, the audio
pro?le database includes bit rate data. In one embodiment, the
set of pro?les includes a ?rst pro?le and a second pro?le from
the identi?ed set of audio pro?les. The ?rst pro?le speci?es a
bit rate less than the determined bit rate of the consolidated
audio stream. The second pro?le speci?es a bit rate greater
than the determined bit rate of the consolidated audio stream.
If TBR>LBR+O.I*(HBR—LBR), select video pro?le With the
higher bit rate,
If TBR is higher than the highest bit rate in the list of
pro?les, select video pro?le With the highest bit rate in
the list of pro?les;
that de?ne the consolidated audio stream. In other Words,
each audio clip along the timeline is examined to determine
its bit rate. Thereafter, the maximum bit rate can be deter
65
timeline, LBR is the bite rate speci?ed by the second pro?le,
and HER is the bite rate speci?ed by the ?rst pro?le. (See
appendix, Example 2). Notably, all other bit rates in the audio
pro?le database are either less than or equal to LBR or greater
than or equal to HBR.
Was this manual useful for you? yes no
Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Download PDF

advertisement