GLIF: Re: [GLIF controlplane] RE: Network Control Architecture

Subject	Re: [GLIF controlplane] RE: Network Control Architecture
From	Gigi Karmous-Edwards <gigi@xxxxxxxx>
Date	Sun, 06 May 2007 08:17:44 -0400

Hi Jerry and All,

Ok Jerry, I stuck with you on your insightful email ( I started youremail a couple of weeks ago and just finished it this morning :-) ). IfI can summarize your assertions : When an interdomain lightpath isrequested, the resource broker (RB) (which is a servant of a userrather than a domain) talks only to the first Domain's NRM (networkresource manager) and then that NRM talks to the second NRM, and so ontill the destination. This requires each domain to have established somesort of agreement with all adjacent domains. In your second scenario itseems the user requested a source RM that is not in the RB's "domain"and that the RB will have to forward it to the right RM, then a repeatof the above process.

I think what you described is the ultimate goal of the community,however, due to complexities of the current infrastructures (NRENs,Research testbeds, Global government networks, etc) that requireinteroperation, it seems that we first need to take small "baby steps".Existing infrastructures include a variety of technologies, differentmanagement (TL1, SNMP, CLI, etc.) and control plane (very fewdeployments of GMPLS) tools for configuration and fault management, alsocurrent procedures for information exchange between network domainsrange from protocols to phone calls/emails. These complexities and other"policy" related challenges force us to break the problem up intosmaller functional blocks. I think the framework presented will give usa path forward based on "baby steps" to finally reach the scenario youdescribe.


I see the problem as having three key challenges:

1) Information dissemination (where is what resource? what are itscharacteristics? what are its policies for use?)2) Capability to request reservations on resources globally oncediscovered ( standard interfaces to query resource managers, with "NO"restrictions on how each resource manager accommodates each request,reuse of existing implementations)3) Scalability ( division of labor among functional components andresponsibilities per domain)

The assumption in the framework sent out has been that an RB takesrequests from a particular domain's user/application but behaves as aservant of the domain not a single user. In this case there will beseveral RBs worldwide, but not one for each user, rather one or two perdomain. It is assumed that the knowledge of the different resourcesglobally will be published per domain in a very distributed fashion(each RB will publish the resources and their characteristics hopefullyusing the schema from the OGF Network Markup Language working group. Aquery from one RB to the "distributed GLIF resources" will use a type ofcrawl mechanism to match the requested resources with the "published"resource information that each domain RB publishes on behalf of itsRMs. The assumption is, the information published by the RBs is notstatic and will be updated by each RB when necessary. This email isalready getting too long, I suggest that we have a conference call anduse a WEB based slide sharing application to go through some scenarios.Any interest?

To summarize, the strategy in your email will be the goal of thecommunity but it will take a while. I think, as a community we can startto develop standard interfaces for the various RMs such as the GenericNetwork Interface (GNI), this will help us towards interoperability intoday's environment.

Please let me know if we should have a GLIF control plane conferencecall in the next few weeks?


Kind regards,
Gigi

--------------------------------------------

Gigi Karmous-Edwards
Principal Scientist
Advanced Technology Group
http://www.mcnc.org

MCNCRTP, NC, USA

+1 919-248 -4121
gigi@xxxxxxxx
--------------------------------------------



Jerry Sobieski wrote:

Good comments both Steve and Bert...let me chime in: (this is abit long, but I think it is relevant)
I too think the reservation phase in each domain must be atomic -there are effective ways to do this. The overall process thoughbecomes two phase: HOLD a resource for some finite holding time andprovide an ACK to the requestor. At some later time the RM willreceive a CONFIRM from the requester, or a RELEASE. If the hold timeexpires, the resource is released unilaterally. On a macro basis,the reservation of the entire end-to-end lightpath must also be heldin the HOLD state while the rest of the application resources arereserved as there may be a dependency between availability ofnon-network resources and the reserved lightpath.As Steve suggests, this atomic two phase mechanism is used in manyother similar reservations systems.
The issue I am concerned about is the roles of the RB and RM. I thinkthe RBs will be numerous - possibly one for every user. I believe wemust assume that all networks will default to a stringent "selfsecure" stance and will only allow access to its RM from known andtrusted peers. It doesn't scale for every network to "know" aboutevery other RB in the world (RBs are agents of the user - not of thenetwork) Therefore, for scalability and security reasons, theseresource reservation requests must be made between directly peeringnetworks, and each network is responsible for recursively reservingthe resources forward toward the destination. This is still a twostage commit as described above but it solves two problems: a) itscales much better as each network only needs to expect queries fromits direct peers (and customers) and b) it allows each network tonegotiate aggregation policies with its peers for services (enablingeconomies of scale and global reach). This is not unlike how weplace a phone call to anywhere in the world - we don't go asking eachnetwork if we can use it, we ask our service provider to do so, theyask theirs, and so on, and so on,...
The above scenario assumes the RB poses the service request to the RMserving the source end of a path. There is a [common?] case wherethe RB is not at the endpoint(s) and does not know of any RMs at theendpoint (or in the middle for that matter). This brings us toanother assumption I think we must make: a RB only knows its *local*network RM. An appropriately designed algorithm should/could forwardthe request to the source address RM using the same forwarding processas the reservation (but crossgrain toward toward the source), and thenthe request can be serviced forward normally as described above. (Thisis the "third party" provisioning scenario.) An alternative modelasumes a "minion" agent at the path endpoints that is owned by the enduser and knows of its local RM- the minion agent acts as proxy for theRB and makes the reservation request to the minion's RM. (gotthat?:-) I think we *can* assume that the RB knows of these minionssince they reside at the end points (source or destination) at a wellknown port.
It is important to note that this process relies on each network RM(not the RB) knowing constrained reachability of all endpoints - notunlike current interdomain routing protocols. This allows the RM topostulate which "nexthop" network will provide the best path and trythat first. If the RM knows more than just reachability - i.e. if itknows topology, then the RM can select a more specific candidate pathand, via authorized recursive querires, can reserve the resource.Only the RM responsible for a network knows the state and availabilitydetails associated with the internal network resources, and thereforeonly the local RM can authoritatively and atomically reserve theresources in that network.
The beauty of this process is that from the RB perspective, the RBneed only ask one RM for the entire end-to-end network path. The RMwill either return a ticket indicating a path was successfullyreserved that meets the requested service characteristics, or a NACKindicating that the resource was not available for some reason. Theuser must change the requested services parameters somehow beforetrying again - i.e. change the source or destination addr, the starttime, the capacity, etc.)
As Gigi states, once all application resources are reserved in theHOLD state, then all must be CONFIRM'ed which will lock in thereservation.At some delta-t later (which could be 0) there is a separate processthat causes the reconfiguration of the network elements to make thereserved resources available for actual use (i.e. the provisioning orsignaling process). This process must be correlated to a previousreservation and so the provisioning request (separate from thereservation request) must contain some indicator that is trusted bythe network and indicates which reservation is being placed intoservice (see Leon's work on AAA)
Note that none of the above is predicated on any particular routing orsignaling protocol... That being said (:-), DRAGON has implementedmuch of this functionality using GMPLS protocols. -The DRAGONNetwork Aware Resource Broker (NARB) is analogous to the network RMand performs the path computation recursively reserving the resourcesalong the way.. It returns a path reservation in the form of anExplicit Route Object (ERO) to the source requestor. This loose hopERO specifies a path consisting of ingress and egress points at eachnetwork boundary. -RSVP then uses this ERO to provision themulti-domain end-to-end path. -The DRAGON Application SpecificTopology "Master" is an agent analogous to the RB mentioned above.AST Master queries all the various resource managers (compute nodes,storage, instruments, network, etc) to reserve groups of dependentresources. There is a significant protocol exchange defined for ASTsto construct a workable physical resource grid for the application.What DRAGON has not yet implemented: We have implemented schedulingand policy constraints in the traffic engineering database, but wehave not yet implemented the path computation to use those constraints(this will be coming soon).We have atomic reservations, but have not implemented the two phasecommit - though we have long recognized it as critical to thebookahead capability and a robust integrated resource scheduling process.
Thanks for sticking with me on this ...:-)
Jerry

Follow-Ups:
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Gigi Karmous-Edwards

References:
- Network Control Architecture
  - From: Gigi Karmous-Edwards
- RE: Network Control Architecture
  - From: Inder Monga
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Gigi Karmous-Edwards
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Gigi Karmous-Edwards
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Bert Andree
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Steve Thorpe
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Bert Andree
- Re: [GLIF controlplane] RE: Network Control Architecture
  - From: Jerry Sobieski

Prev by Date: Re: [GLIF controlplane] RE: Network Control Architecture
Next by Date: Re: [GLIF controlplane] RE: Network Control Architecture
Previous by thread: Re: [GLIF controlplane] RE: Network Control Architecture
Next by thread: Re: [GLIF controlplane] RE: Network Control Architecture
Index(es):
- Date
- Thread