Virtual tunnel port support

https://blueprints.launchpad.net/dragonflow/+spec/virtual-tunnel-port-support

The virtual tunnel port can leave the remote_ip field of tunnel port to be flow. And OpenFlow can designate remote_ip for tunnel port. So that every node only needs to create a few tunnel ports for tunnel connection.

Problem Description

Currently, Dragonflow node will create a tunnel port for every other Dragonflow node in the OpenStack cloud. This has several problems in real use case.

  • Every tunnel port will occupy one ofport in the bridge. Since Dragonflow doesn’t have a dedicated bridge(like the br-tun in neutron) for tunnel port, the more Dragonflow nodes in the cloud, the less number of ofport will be available for use. Meanwhile, too many tunnel ports will increase the burden of OpenVSwitch DB.
  • It is hard to maintain the tunnel ports in the Dragonflow node. When a Dragonflow node changes IP address, other Dragonflow nodes in the same cloud needs to update their tunnel ports. It has not been supported yet, and is being tracked in [1]. Other Dragonflow nodes will need to query the tunnel port from OpenVSwitch DB, delete it and then create a new one.
[1]https://review.openstack.org/#/c/365077/
  • Dragonflow only supports one type of tunnel underlay network one time. Multiple underlay tunnel types can be supported, but that will aggravate the problems described above.
  • Since one type of tunnel underlay network is supported one time, the multiple overlay network types, for example GRE, VXLAN, and Geneve, are using the same underlay network. This might cause problem, for example, the protocol overhead of GRE is 22, while the protocol overhead of Geneve is 30. User might see unexpected result when using Geneve as underlay network and GRE as overlay network. Besides, Dragonflow only uses segmentation ID to distinguish network packets from different networks in other Dragonflow nodes. The segmentation ID is assigned by neutron. Different overlay tunnel network might have duplicated segmentation ID. This will cause match problem. For example, a GRE neutron network has segmentation ID 100, and a Geneve neutron network also has segmentation ID 100. When the network packets from these two networks comes to a Dragonflow node, the Dragonflow node can’t distinguish which is from GRE network, and which is from Geneve network.

Proposed Change

Create one virtual tunnel port for each supported tunnel type. So, no matter how many Dragonflow nodes are there in the OpenStack cloud, each node will only need to create and maitain several tunnel ports. These tunnel ports will be created at the Dragonflow controller’s first startup.

For example, the tunnel port will be:

$ sudo ovs-vsctl show
Bridge br-int
    Controller "tcp:127.0.0.1:6653"
    fail_mode: secure
    Port gre-tunnel-port
        Interface gre-tunnel-port
            type: gre
            options: {key=flow,local_ip=192.168.31.91,remote_ip=flow}
    Port br-int
        Interface br-int
            type: internal
ovs_version: "2.5.0"

The supported tunnel types can be configured through configuration file. If new tunnel type is added, the new tunnel port will be created when administrator restarts Dragonflow controller. If a tunnel type is removed from the supported tunnel types, the tunnel port will be deleted when restart Dragonflow controller.

The ofport of each tunnel type will be recorded as global variable across the lifecyle of Dragonflow controller. If there are any changes to the tunnel port, the OpenVSwitch DB monitor will update the ofport of each tunnel type.

The tunnel_type field of chassis will be changed from a string to a list of supported tunnel types.

The chassis will be added to local cache. So that a remote port can find the chassis type and chassis IP address quickly.

A remote port might not be in current OpenStack cloud, but be in another OpenStack cloud that connects to current cloud. The old implementaion of remote port will create a tunnel port for each remote chassis, and maintain a relationship of remote port and remote chassis in the cache of Dragonflow controller. If there is no remote port in the remote chassis, the tunnel port for remote chassis will be deleted. By using the virtual tunnel port, there is no need to maintain tunnel port for remote chassis. When a remote port is added, the IP address and tunnel type of the remote chassis will be added in the binding_profile. The tunnel type value will be the type of the network of the remote port. If a network packet needs to go to the remote port, it will go to the virtual tunnel port, instead of the specific tunnel port created in the old implementation. The OpenFlow will designate the destination IP address, according to the information of remote port’s binding_profile. When the remote port is deleted, only the related OpenFlows needs to be deleted.

The chassis update event from northbound DB will be notified to dragonflow applications. So that the OpenFlow can be updated when chassis updates, for example, its IP address.

In the egress table, the flow will be added based on the type of the network, where the port is in, and the remote port’s chassis IP address.

In the ingress classification dispatch table, not only the tunnel ID will be used to match the incoming request, but also the in_port will be used. Different in_port means different tunnel types, so we can match network type together with network segmentation ID.

Installed flows

The following flow is installed in the ingress classification dispatch table for each tunnel network:

match=tun_id:T,in_port:I actions=load:N->OXM_OF_METADATA[],resubmit(,INGRESS_DESTINATION_PORT_LOOKUP_TABLE)

Where T is the segmentation ID of the network, I is the ofport of virtual tunnel port of the network type, and N is the local network ID of network in current dragonflow controller.

The following flow is installed in the egress table for each remote port:

match=reg7:R actions=load:T->NXM_NX_TUN_ID[],load:D->NXM_NX_TUN_IPV4_DST[],output:O

Where R is the tunnel key of logical port in dragonflow, T is the segmentation ID of the network, D is the IP address of the destination chassis, O is the ofport of virtual tunnel port of the network type.

Implementation

Assignee(s)

Primary assignee:
xiaohhui

Work Items

  1. Add two configuration options. One is enable_virtual_tunnel_port. Its default value will be false for backward compatibility. The other one is tunnel_types, which is a list option. When enable_virtual_tunnel_port is true, a virtual tunnel port for each tunnel type in tunnel_types will be created.
  2. Add chassis in local cache.
  3. Add flows based on virtual tunnel port, chassis and network type.
  4. Handle the chassis update event.
  5. Remove the enable_virtual_tunnel_port and tunnel_type in configuration option. And remove all code for current implementation of tunnel port.