| Authors: | Editor: | |
|
Art Vandenberg Egon Verharen Samir Chatterjee (SIP Scenario) |
Steve Olshansky Comments/Discussion to
|
Several use scenarios are being developed to investigate the middleware requirements for videoconferencing.
The purpose is to:The scenarios will drive the mapping of realistic requirements against infrastructural elements and will therefore determine the architecture. The architecture model and the scenarios and their requirements will iterate off each other for a while. Seeing the model, people understand more about the specifics of the scenarios, and vice versa. Once the requirements are clear, the final mapping of them onto components will need video people working with IT architects.
It is hard to keep the scenarios realistic. It is tempting to add a lot of "second generation" desires. The goal is not to put everything in, but to avoid locking things out in subsequent development. If you notice we crossed the line here, please take up the discussion on the VidMid-VC mailing list. (If you are not already subscribed to this list, send a request to listmgr@internet2.edu)
It is also difficult, if not impossible, to generate all potential scenarios or, having generated them, to prioritize them in such a way that everyone would agree on which scenarios should be targeted for development first. With this in mind, the scenarios listed below represent only those generated to date with those that are under the most active development as indicated. In some cases, only "placeholder titles" are included. These indicated scenarios that we hope to elaborate on in the future. The work of Vidmid-VC in enabling the scenarios overall is anticipated to be ongoing, with refinements and reprioritization where and when necessary.
The scenarios below are intended to provoke discussion of what videoconferencing middleware should and shouldn't do. It is not necessarily the case that these represent a complete list of scenarios or that VidMid-VC should support all of those listed. These scenarios should encompass most of the sorts of things that we expect VidMid-VC to play a part in. Scenarios under active development within each category are designated as such.
Input for the scenario development process is:The format of the attached scenarios is:
Note: we are only concerned with callserver assisted call setup. Point-to-point videoconferencing where the IP address of the other person's device is used, although interesting for firewall scenarios, is not being addressed here.
The general scenario for scheduled multipoint conferencing
is:
Person A, or an administrator operating in behalf of person A, schedules a multipoint conference at a known MCU. Specification would include the call details of the participants, the meeting time to accommodate participants' existing calendar commitments or time zone preferences, the meeting mode, and security (public, private). Set up might include coordination or reservation of bandwidth, in case of limiting factors imposed by the video conferencing or other simultaneous activities.
Advanced: These settings (list of participants, their configurations, their time zone preferences, etc.) can be saved for reuse. It may be that the configuration aspects of individual end points are maintained in a manner that permits them to be mixed and matched dynamically for other multipoint (or endpoint-to-endpoint) sessions.
There is no end-user scenario worked out for the case where "interrealm" is used for "inter-protocol" videoconferencing.
All point-to-point and multipoint scenarios can be interrealm in the meaning of inter-administrative domain. Where needed this will be described explicitly.
Description
A person uses their videoconferencing device to contact another person who has a similar videoconferencing capability. They participate in a real-time video and audio discussion.
Example
Joan wants to contact Leo and discuss the issues related to a project they are working on together. Joan wants to have a visual component to the meeting, rather than using email or phone call, since they need to resolve some details that a more personal interaction is desirable. Both Joan and Leo have had videoconferencing devices set up in their offices for communications such as this.
Joan is able to lookup Leo's contact information in an authoritative directory, clicks on Leo's entry and a call connection is automatically initiated by the Joan's videoconferencing device. Leo's videoconferencing device alerts him to the connection request and identifies that it is Joan calling. As Leo is between meetings, he accepts the call and Joan and Leo meet with full audio and video capability.
Generic format
Two individuals with videoconferencing capability are able to make a connection. Connection information is looked up automatically by the videoconferencing device's gatekeeper that is closely integrated with LDAP directory services. The LDAP directory represents the people objects and resource objects such as individual classrooms or conference rooms that are videoconference enabled. Account objects represent individual accounts through which individuals access services. Endpoint objects represent the details of the videoconferencing equipment. When a connection request is made, the caller's identification and authentication is checked and then an authorization check ensures the caller has access to the accounts and other resources. Similar identification, authentication and authorization may be conducted for the call destination.
Needed components
Assumptions
The assumption is that both users have videoconferencing devices at an office or desktop level and it is realistic for them to "just call" one another in an ad hoc manner, like using the phone. Clearly, if the call were scheduled, rather than ad hoc, similar actions take place once a prior agreement determined the time of call.
There is an assumption that there identification, authentication and authorization are used to ensure appropriate use of resources - this is not a "personal" direct connection, but a connection mediated by gatekeeper functions. It is also assumed that a concept of caller id may be used, that is, the call is not just accepted as in "Hello, who's this?"
There is an assumption that the individuals are "where they are supposed to be" - perhaps a mechanism for call forwarding or redirection would be reasonable. This then becomes a scenario addressing mobility with attendant issues of managing mobility.
Uniqueness
This is a basic scenario that sets the minimum standard, if you will, for video middleware: identification, authentication, and authorization integrated with gatekeeper functions. The scenario is based on endpoint enabled videoconferencing capability and doesn't necessarily imply any larger infrastructure beyond directories, gatekeepers, and end users. There is an implication of "billing and accounting" issues associated with endpoint and account objects.
Description
Omar, Mary Lynn, Bill and Jill are from different universities, conference co-chairs for an upcoming seminar. Omar calls Mary Lynn and Bill via a videoconferencing device to discuss the plans. Mid-way in the call they connect Jill into the call for additional input.
Example
Omar, Mary Lynn, Bill and Jill are from four different universities in the region. They are conference co-chairs for an upcoming seminar. They are nearing the final stages or preparation. Omar needs to get feedback on some publicity plans including the layout model of the opening ceremony that has been brought to his office by his assistant. Omar calls Mary Lynn and Bill via a videoconferencing device and they discuss the plans for 30 minutes until they realize that they should get input from Jill. They connect Jill into the call and the four of them work through the details of the conference opening ceremony, including adjusting the layout model presented from Omar's office.
Generic format
Individuals use videoconferencing to work together. A multipoint connection is made via an MCU to enable simultaneous video and audio connections. Additional persons can be added to the call during the call, or individuals can drop out of the call. Connection information is looked up automatically by the videoconferencing device's gatekeeper that is closely integrated with LDAP directory services. The LDAP directory represents the people objects and resource objects such as individual classrooms or conference rooms that are videoconference enabled. Account objects represent individual accounts through which individuals access services. Endpoint objects represent the details of the videoconferencing equipment, including the MCU device that is being used for the call. The caller's identification and authentication is checked and authorization check ensures the caller has access to the MCU as well as other resources. Similar identification, authentication and authorization may be conducted for others who join the call.
Assumptions
Assumption is that this is an ad hoc situation: the availability of the MCU and other devices is a given. If this were to be a scheduled call, then after the arrangements were made to time, the scenario would proceed similarly.
It is assumed that persons are "where they are supposed to be" - perhaps a mechanism for call forwarding or redirection would be reasonable. This then becomes a scenario including mobility of users with attendant issues of managing mobility.
Not addressed is the mode of the conference, whether chair controlled, continuous presence, and voice switched.
Uniqueness
This scenario extends videoconferencing middleware
issues by the inclusion of Multipoint Control Units (MCUs) objects. Management
of MCU resources becomes important, as far as: availability on ad hoc basis,
discovery of the MCU resource, authorization to use.
Description
Omar, Mary Lynn, Bill and Jill are from different universities, conference co-chairs for an upcoming seminar. Omar's assistant schedules a videoconference call with Mary Lynn, Bill and Jill to discuss the final plans. The call arrangements are sent in advance to participants and the call is automatically started when the time arrives.
Example
Omar, Mary Lynn, Bill and Jill are from four different universities and are conference co-chairs for an upcoming seminar. They need to review the final plans and agree to schedule a videoconference call two weeks prior to the event. Omar's office assistant uses a web interface to schedule the call, resolving time zone differences and schedule conflicts of the participants. A final time slot is selected and the MCU and other resources are reserved. A reminder notice is sent to the participants and when the call time arrives, the session set up is automatically begun 15 minutes prior to the start time.
Generic format
Individuals use videoconferencing to work together. The schedule a multipoint connection is made via an MCU to enable simultaneous video and audio connections. An administrative function permits the scheduling of the call.
The connection set up may require coordination of time zones, participants' schedules, as well as availability of MCU and other devices. Set up also may involve setting format of the conference, whether chair controlled, continuous presence, voice switched.
Connection information is looked up automatically by the videoconferencing device's gatekeeper that is closely integrated with LDAP directory services. The LDAP directory represents the people objects and resource objects such as individual classrooms or conference rooms that are videoconference enabled. Account objects represent individual accounts through which individuals access services. Endpoint objects represent the details of the videoconferencing equipment, including the MCU device that is being used for the call. The caller's identification and authentication is checked and authorization check ensures the caller has access to the MCU as well as other resources. Similar identification, authentication and authorization may be conducted for others who join the call.
Needed components
Assumptions
Uniqueness
Description
A live videoconference session is streamed live to enable viewing by those without higher end videoconferencing capability and also as a future saved archive of the event.
Example
An Internet2 Member Meeting is planned. Live videoconference capability will be available for the event, permitting remote users with access to appropriate MCU and resources to participate in two-way mode with the events live. Additionally, the conference is simultaneously streamed to extend the conference (one way visual only) to a wider audience. This stream will be retained as well as an archive of the event that may be accessed "on-demand."
Generic format
A live videoconference is also captured for streaming. The streaming may be in parallel with live event or for later
"on-demand" use.
People and organizations often perceive streaming video and video-on-demand (VOD) as being not only alternative modes of participation in live videoconferencing events but also as separate communications modes of their own.
In some ways streaming video communication can be viewed as a superset of real-time videoconferencing with
In this view, an event worth supporting with videoconferencing access is also routinely streamed live and captured with the highest capture quality possible for later encoding in a variety of rates for viewing on-demand. The live streams make it possible for people anywhere, who will not have real-time H.323 equipped systems available to them, to participate actively with an audio and video feed of the event and with a telephone or text based back-channel for their interaction. The VOD files made available immediately after the event make it possible for remote participants to refresh aspects of the event or to review material for more intense study.
Needed components
Assumptions
Uniqueness
This raises the bar on "live" videoconferencing requirements. Issues
are now included relating to simultaneous capture, additional equipment needs,
and managing possible bandwidth contention. Managing videoconferencing assets
resulting from live events becomes important. This scenario can be viewed as
an overlap into the area of VidMid, Video-on-Demand working group.
Description
A security camera monitors the high security area of the university's operations
center that also manages the enterprise certificate authority and PK infrastructure.
Security cameras provide live video to monitoring stations and a streaming capability
ensures that any suspicious events can be reviewed immediately. A two-way videoconferencing
capability provides additional authentication and authorization security.
Example
A university's operations center that also manages the enterprise certificate
authority and PK infrastructure requires 24x7 monitoring of all access to the
operations room. Several security cameras provide live video to several monitoring
stations. A streaming capability ensures that any suspicious events can be reviewed
immediately, without interrupting the live video. Individual cameras may be
controlled by authorized operators. Additionally, the university's CIO has the
capability of back stream videoconferencing so that two-way real-time visual
authentication can occur between CIO and the monitoring station or operations
personnel.
Within the live-access situation there are several components. Initially a user (security personnel) needs to be authenticated and authorized to access a (set of) camera(s) and any associated alert systems. Once alerted, or on-demand/browsing, the user then needs to access a particular camera (implying a choice of video source), jump between cameras (tracking situation) and also drive the cameras that support pan-tilt-zoom (PTZ) operations. Certain actions require that visually verified authorization be obtained. A videoconferencing capability exists between the CIO's office and the personnel in operations and/or the monitoring stations.
Generic format (optional)
Needed components
Assumptions
Important to have:
Uniqueness
See Needed Components. This scenario adds highest levels of security
and encryption, need for full interactivity, and fast response requirements.
This scenario is complex but serves to illustrate a desirable end goal. This
scenario can be viewed as an overlap into the area of the VidMid Video-on-Demand
working group.

Figure 1: SIP scenario
Description
Bob invites Alice for a videoconferencing session to finish a paper they are
jointly co-authoring (see Fig. 1). Bob and Alice start discussing the last meetings
action items. After 30 minutes, Alice remembers that she has to pick up her daughter
from school, so she invites her own cell-phone to this conference, and talks
to Bob over the cell-phone for the next 20 minutes. After that, she enters her
home, and transfers the session to her home PC. They further discuss the rest
of the paper.
Example
Bob@cgu.edu invites Alice@unc.edu to a videoconferencing session to work jointly
on a paper. Alice has several ideas to discuss with Bob, however, she knows
that she will have to pick her daughter up after about 30 minutes into the call.
She simply informs Bob that she is switching to her cell-phone (audio only)
and they continue to discuss while Alice is driving to the elementary school.
After sometime, Alice enters her home, informs Bob that she is putting him on
hold, redirects the call to her home PC (that has DSL access) and terminates
the cell-phone conversation. They continue to use video/audio for the rest of
the discussions.
Generic format (optional)
Two individuals Bob and Alice use a SIP User Agent (either a soft-videophone
or a hard videophone) to establish a video-conferencing session. Both are registered
to their local domain SIP registrar that maintains bindings of their email type
SIP URI and their present IP addresses. Bob can input Alice's SIP URI from three
places: i) type the address, ii), lookup from the User Agent personal address
book or iii) lookup an campus-wide directory server (using LDAP). This campus-wide
directory server contains names and their SIP URI bindings for people who can
be called using voice or video-over IP. The LDAP directory may also contain
domain names and IP addresses of various infrastructure servers like the local
SIP proxy server. When Alice@unc.edu accepts the call, SDP negotiations agree
upon a video codec to be used for the call and both parties connect using media/RTP/UDP/IP.
While the call is in progress, Alice initiates another call from her soft-videophone
to bring in her own cell-phone into the call. The session only requests an audio
feed. An MCU mixes all three users (Bob, Alice, Alice's cell-phone) appropriately
for a brief time until Alice terminates her desktop session. Now Alice on her
cell-phone and Bob on his desktop switch back to a point-to-point voice only
session. Obviously, the IP packets are routed through gateways provided by a
third party (Level3 for example). When Alice reaches her home, she initiates
another video call to connect to Bob and they continue the session. The cell-phone
end-point is terminated.
Needed components
Assumptions
This is an ad hoc situation where one user moves across end-user device and
media type as call progresses. SIP and its registration service handles most
of the end-user mobility part. Alice's end-point should have a rich user interface
via which she can put one call on hold, make another call and transfer or redirect
as she pleases.
Uniqueness
This scenario presents a sophisticated mobility environment which is quite typical
in our daily lives. It also brings out some of the power of SIP signaling. Managing
mobility, managing media switching, phase-in and phase-out of MCU management,
whether to use MCU in this case or maintain simultaneous streams, discovery
of gateways, MCUs and proxies all become important.
Extended Scenario
Integrating the above scenario with a presence server. Presence server (similar
to instant messaging) lets end-user know the status of a person logged on to
the net. The person is there willing to accept calls or simply busy now and
will reject any calls. The presence information can be displayed right on the
soft-videophone user interface or can be looked up from a presence-enabled directory
service. This gets into several exciting areas of integrating directory servers
with presence servers.