Tools to Visualize the Internet Multicast Backbone

Brad Huffaker, kc claffy, Evi Nemeth
{bhuffake,kc,evi}@caida.org
CAIDA: Cooperative Association for Internet Data Analysis

(Note this paper uses a lot of color images for explanation and examples. Color printout strongly recommended. home URL: https://www.caida.org/publications/papers/1999/manta/)

Abstract

For the last several years, the Internet multicast backbone has been a growing part of the Internet infrastructure, and of strategic interest to the network research community experimenting with multicast technologies. In particular, the MBONE has been a deployment testbed for scalable distribution of group audio and video streams. Rapid growth in topology and traffic volume of the multicast infrastructure has brought inevitable scaling problems, not the least of which is the increased negative impact incurred by accidental misconfiguration of MBONE nodes. In pursuit of insight into the MBONE topology, its growth characteristics, and the extent of the transition from the tunnel-based architecture to the deployment of native multicast, we developed and applied visualization tools to the database of connectivity information collected with the mwatch [MWAT] utility from University College London. We have used two separate visualization tools for this task: MantaRay [MANTA], for geographic-map based depiction of MBONE data collected by mrinfo queries; and otter [OTTER], for topological visualizations of the same data.

MantaRay and Otter are part of the fleet of tools from the Cooperative Association for Internet Data Analysis (CAIDA) for visualizing various sets of Internet data. Both tools are Java-based and allow users to select parts of the data, focus or zoom in on particular areas, color by various parameters, and generally explore multicast topologies.

Introduction

The original multicast backbone (or MBONE) is a virtual network of Internet hosts and routers that can send and receive traffic destined for multicast IP addresses. Production Internet routers have supported native multicast in recent years, and the original MBONE provided intermediate support using a virtual layer of tunnels configured over the physical Internet. Note that the term `MBONE' could now be considered to comprise all existing multicast routing infrastructure: tunnels as well as native multicast. For this paper we will use the phrase 'tunneled MBONE' to refer to the original DVMRP-based [DVMRP] implementation, and simply `MBONE' to refer to the global multicast routing infrastructure (which includes e.g., PIM [PIM] infrastructure).

The tunneled MBONE works by connecting individual multicast-capable subnetworks with these logical tunnels whose endpoints encapsulate multicast packets into normal IP packets and send them across the unicast path between the tunnel endpoints. To non-multicast routers this traffic appears as normal IP traffic, while multicast routers treat it as IP encapsulated in IP, rather than the typical TCP or UDP encapsulated in IP. These multicast routers recognize such packets as special and process them accordingly: stripping the extra IP header to retrieve the multicast group destination, and consulting the multicast routing table to determine where to forward the packet, whether across native multicast links or across a tunnel to another multicast-capable router. Tunnel endpoints are either routers that support DVMRP or Unix workstations or workstation-based routers that have IP multicast support and run the multicast routing daemon mrouted. Currently the widest use of the MBONE is for real-time video and audio streams.

The next section offers a brief introduction to the multicast routing protocols that support the MBONE. We then describe the multicast data sets we analyzed, and aspects of the MBONE for which visualization could provide insight. Finally, we present an overview of Manta and Otter, the tools we used to visualize the multicast data, and provide example snapshots of various components of the multicast infrastructure.

Multicast routing protocols

A multicast IP address is in the range 224.0.0.0 to 239.255.255.255 and refers to a specific group of Internet hosts who have registered interest in a particular group conversation. These addresses are different from unicast IP addresses, which suggest transmission to only a single host interface (Internet destination). In fact, any host, even those not in the group corresponding to a given multicast address, can send data to that multicast address, i.e., to every host in the group. This functionality adds complexity to the task of maintaining the distribution tree information in a precise and memory efficient way.

There are two multicast routing protocols in common usage: DVMRP (Distance Vector Multicast Routing Protocol) and PIM (Protocol Independent Multicast). DVMRP originated with Steve Deering [DEER] and its canonical Internet implementation, mrouted, is currently maintained by Bill Fenner of Xerox PARC. PIM was defined by RFC 2117 [PIM] and implemented in Cisco routers by Dino Farinacci. A new multicast paradigm under recent development maps a multicast group to a single fixed source and a group of receivers [EXPR]. This model may be able to accommodate the requirements of many multicast applications, but is not yet implemented or deployed anywhere.

DVMRP infrastructure comprised the vast majority of the MBONE until routers began to support native multicast via PIM, which is what new participants tend to deploy as they add multicast support to their network. In addition, administrators of pieces of the DVMRP infrastructure are independently and gradually transitioning to PIM. We are now even seeing PIM clouds interconnect, similar to unicast exchange points, using the MBGP [MBGP] and MSDP [MSDP] protocols for inter-domain routing support (MBGP to discover topology and MSDP to help construct the inter-domain multicast distribution tree.) One important visualization possibility is to depict the relative prominence of various multicast routing protocols in the current Internet infrastructure (including which versions of each since significant functionality changes are not rare between versions).

Multicast data collection

The most common tool to gather data about Mbone nodes and tunnels is mrinfo [MRINF]. An mrinfo query to an MBONE node returns information on the multicast software running, connectivity to other nodes, and known attributes of any tunnels. Another tool, mwatch, recursively calls the mrinfo program to find all (hopefully) the multicast routers on the MBONE. The mwatch tool was written by Atanu Ghosh of University College London as part of the MICE (Multimedia Integrated Conferencing for European Researchers) project. Piete Brooks (University of Cambridge) enhanced and currently maintains it. (Unfortunately, they have had to deactivate their Mbone topology monitoring effort due to a new policy of their ISP to charge for traffic on a per-byte basis.)

The mwatch database was too large to gracefully use for testing while developing these visualization tools, so we built the crawl tool to produce smaller data sets. crawl starts with a set of root nodes (hostnames), a depth level, and then calls mrinfo on those root nodes. It then collects a list of all the hostnames they are connected to and repeats the process until it has reached the specified depth from these root nodes. Basically, crawl is a modified version of mwatch that can limit the depth of the mrinfo recursion to produce smaller, focused data sets.

As of August 1998, the size of the dataset generated by mwatch is currently about 4.5 MB; the execution time to traverse the entire MBONE topology increased from about 12 hours when mwatch was first run regularly at the University of Cambridge to over 24 hours as of August 1998. Our utility, crawl, generates a file of approximately 3.5 MB in 13 hours when run across the same scope in topology. The difference appears to derive from mwatch's inclusion of information from previous passes, while crawl retains only information collected in the current pass. The designers of mwatch might have included this memory feature to account for machines that temporarily disappear due to inclement network/host/configuration weather.

A sample of the mrinfo output follows:

198.17.46.39 (mbone.sdsc.edu) [version 3.255,prune,genid,mtrace]:
  198.17.47.39 -> 0.0.0.0 (local) [1/1/disabled]
  198.17.46.39 -> 198.17.46.56 (pinot.sdsc.edu) [1/1]
  198.17.46.39 -> 198.17.46.43 (cs-f-vbns.sdsc.edu) [1/1]
  198.17.46.39 -> 198.17.46.10 (medusa.sdsc.edu) [1/1]
  198.17.46.39 -> 198.17.46.205 (bigmama.sdsc.edu) [1/1]
  198.17.46.39 -> 138.18.5.224 (nccosc-mbone.dren.net) [1/32/tunnel]
  198.17.46.39 -> 192.215.245.10 (mbone.cerf.net) [5/32/tunnel]
  198.17.46.39 -> 198.202.127.101 (dasher.doct.sdsc.edu) [1/32/tunnel/down/leaf]
  198.17.46.39 -> 198.202.127.22 (itlhp1.doct.sdsc.edu) [1/32/tunnel/down/leaf]
  198.17.46.39 -> 132.249.30.13 (avatar.sdsc.edu) [1/1/tunnel/down/leaf]
  198.17.46.39 -> 132.249.30.6 (mithrandir.sdsc.edu) [1/1/tunnel/down/leaf]

The first line of mrinfo output shows that the host queried is mbone.sdsc.edu and it is running version 3.255 of mrouted with mtrace support. Each additional line represents a connection to another mrouter with characteristics listed in brackets. The word "tunnel" in the attributes means that a multicast path exists between mbone.sdsc.edu and the named host via encapsulated IP. Lines with no "tunnel" attribute are native multicast links, in this case to other mrouters on the local network.

Querying the medusa.sdsc.edu mrouter, mrinfo returns:

198.17.47.10 (medusa.sdsc.edu) [version 11.2,prune,mtrace]:
  132.249.30.10 -> 0.0.0.0 (local) [1/6/pim/querier/disabled/down/leaf]
  198.17.46.10 -> 198.17.46.43 (cs-f-vbns.sdsc.edu) [1/1/pim]
  198.17.46.10 -> 198.17.46.205 (bigmama.sdsc.edu) [1/1/pim]
  198.17.46.10 -> 198.17.46.56 (pinot.sdsc.edu) [1/1/pim]
  132.249.30.10 -> 132.249.30.6 (mithrandir.sdsc.edu) [1/6/pim]
  132.249.30.10 -> 132.249.30.7 (curunir.sdsc.edu) [1/6/pim]
  132.249.30.10 -> 132.249.30.11 (tigerfish.sdsc.edu) [1/6/pim]

showing that medusa is a Cisco router running IOS 11.2 with 6 neighbors. Note that medusa does not show its DVMRP neighbors on its local network, that is, the DVMRP Unix host mbone.sdsc.edu sees and reports its neighbor medusa, but medusa does not mention mbone.sdsc.edu. Cisco's running PIM know that they have DVMRP neighbors, but do not know who they are (i.e., don't retain neighbor state).

An mwatch or crawl traversal of the whole MBONE would never find mbone.sdsc.edu from medusa. Such a traversal is potentially incomplete since nodes can be down (skipped) and DVMRP tunnels can be hidden behind Cisco routers, both of which can create MBONE partitioning, black holes, and failed pruning situations. The last of these, failed pruning, can result in considerable extraneous copies of multicast traffic, imposing unnecessary load on the infrastructure.

Another data collection problem we have experienced is that mrinfo can not access information on nodes behind firewalls that filter IGMP (Internet Group Multicast Protocol) packets. Fortunately not every site's firewalls are so conservative, but it does poes an obstacle to obtaining a comprehensive picture.

Security for some sites also dictates that global routes are not advertised outside of their internal infrastructure. In such cases the mrinfo query may reach its destination mrouter, but may not be able to get a reply back for lack of a return path. Furthermore, misconfigured routers or incompatible software can also provide potentially inaccurate information, e.g., some routers run native multicast software that does not respond to mrinfo queries.

Aspects of the data worth visualizing

We enumerate the concepts by which we would like to be able to structure visualizations; the keywords listed in bold serve as menu items in the user interface for both our visualization tools.

multicast routing protocol, relevant for tracking the infrastructural transition from DVMRP to PIM

DVMRP mrinfo did not label the link PIM

PIM mrinfo did label the link PIM

link type, relevant for identifying asymmetric or suboptimally configured tunnels

tunnel	mrinfo did label the link as a tunnel
native	mrinfo did not label the link as a tunnel
unidirectional tunnel	mrinfo sees one end of a connection but not the other

address aggregation (otter only), relevant for identifying suboptimal topologies, e.g., unnecessarily redundant or long paths, and to assess extent of hierarchy naturally present

`/8`	first (left) octet of IP address is the same for both endpoints
`/16`	first 2 octets of IP address are the same
`/24`	first 3 octets are the same
`/0`	no leading octets the same

tunnel status, relevant for debugging performance of mbone paths

down mrinfo reports tunnel is down

up mrinfo reports tunnel is not down

uni mrinfo does not report either tunnel or down, we assume native multicast

numeric attributes, relevant for engineering topology or configuration (otter only)

metric	mrinfo-reported routing cost
threshold	mrinfo-reported topology scoping parameter
degree	number of links connected to this node
multicast version	numeric version of multicast software/OS

Both of our visualization tools, Manta and Otter, label the above semantic attributes of the database as generated by mwatch and crawl. The keywords above appear in the menus and color legends associated with various displays as seen in the next section.

Visualization Tools

The visualization work we describe in this paper was inspired by limitations inherent in two existing visualization tools, Mapnet and Planet Multicast. Mapnet [MAPN] is a Java applet to visualize the topologies of backbones of major U.S. Internet Service Providers. Planet Multicast [PMUL] uses VRML to produce a 3-dimensional view of the Multicast backbone (MBONE).

We have extended Mapnet to visualize the Internet multicast topology. The resulting tool, called MantaRay, displays the geographical placement of MBONE infrastructure, using data from the mwatch utility. A second tool, Otter, displays topological views of the (same) multicast infrastructure. Otter is derived from Manta and from Plankton [PLANK], the latter tool originally developed to visualize the NLANR Squid Cache Hierarchy.

In both visualization tools we will describe, MantaRay and Otter, nodes represent multicast capable hosts and routers. As mentioned earlier, links represent either native multicast peering, or a tunnel between two multicast nodes through non-multicast speaking nodes. The main difference is that MantaRay is limited to geographic depiction while otter can support topological layout as well.

MantaRay: geographic depiction


yellow - tunnel	green - native	red - down

Figure 1 above shows a Manta display of a subset of a particular geographic region of the MBONE (the mid-Atlantic coast of the United States). Not all nodes and links are drawn because we do not have access to geographic coordinate (latitude/longitude) information for all nodes. Unfortunately, accurate latitude/longitude information for Internet hosts and routers is largely unavailable and not easily determined.

We pursued a variety of approaches and heuristics for finding geographical coordinates. Often the closest we could get was the InterNIC database records for the domain name that corresponded to the IP address. These records contain one geographical location per each entire domain (the `headquarters'), which may have no relation to the location of the particular IP address we need to locate on the map. If the IP address does not map to a domain name, one can attempt to map it directly to an autonomous system (AS) via a core BGP routing table and then use the InterNIC-registered headquarters location for that autonomous system. This method involves the same problematic assumption, particularly unaccepable for mapping nodes of national backbone service providers. Nonetheless we derived a basic image which we can then refine as we gain more accurate location data. Note: to help with such visualization studies, we encourage service providers to put geographical information into their DNS configuration as outlined in RFC 1876 [DNSL].

MantaRay Features

MantaRay's layout of nodes is strictly geographical, which means we cannot draw infrastructure for which we lack geographical information. MantaRay can manipulate the image with focus and zoom capabilities, change node size and line size to fit the complexity of the image, and also color the nodes and links by different attributes. We illustrate some of these features in the images that follow.

Figure 2: The small window displays information for the white link.

Manta identifies nodes and links in the status text window on the lower right of the java applet window. When the mouse moves over a link or node, this window contains raw data corresponding to that object in the database, as shown in figure 2.

Zoom and focus on a particular region

Manta can zoom and focus on a particular geographic region. Figure 3 below shows the Washington DC area shown in Figure 2, at three levels of granularity: full size, zoomed in once (2X), and then zoomed again (4X). Unfortunately, zooming in too far magnifies the background world map image so much as to become almost useless for geographical identification.

Washington DC is a rich area with respect to network connectivity and there are many nodes and links in close proximity. Manta adjusts the positions of nodes if their geographical information would place them on top of each other. For multiple links between the same two cities, Manta draws distinct parallel lines, which although distorting the geographic representation, provides a more compelling sense of the connectivity. The granularity of the geographic information is often at the level of city rather than exact lat/longs, so there are also many placement collisions in larger cities althought the ISPs points of presence are likely spread across the city.

Figure 3: Zoom and focus on the Washington DC area

Full size (100%) Zoomed once (50%) Zoomed twice (25%)

yellow - tunnel green - native red - down

Full size (100%)	Zoomed once (50%)	Zoomed twice (25%)

yellow - tunnel	green - native	red - down

Color by link status: down, tunnel, or native

Color is the primary attribute we use to distinguish semantic attributes of the database generated by mrinfo. Figure 4 colors the links according to their 'status': down (red), tunnels (yellow), and native multicast (green).


yellow - tunnel	green - native	red - down

Color by link attribute: degree, metric, and threshold

Manta offers the ability to color links in the graph according to its corresponding metric or threshold, as well as by the degree of a node (i.e., number of links from it to other nodes). The metric represents the cost of the link from a routing perspective. The threshold represents a ttl (time-to-live) limit: the router will not forward packets whose ttl is less than the threshold value.

A link with a node of degree X at one end and of degree Y at the other end is colored with the color corresponding to the larger of X and Y. Similarly for links with differing thresholds and metrics. A threshold value of 0 is used for native PIM links, since PIM does not use the concept of threshold for forwarding.

There are two coloring modes: spectrum mode and binary mode. Spectrum mode sorts the values into buckets and assigns each bucket a color. Spectrum displays include a color legend. Figure 5, is an example of spectrum coloring.

Figure 5: Color by metric (spectrum):

In contrast to spectrum mode, which uses an array of colors, binary mode uses uses only two colors. Binary mode uses the truth value of a boolean statement to determine color: blue for false, yellow for true. The user specifies the desired boolean expression via a pop-up window. Objects with no value for the specified attribute are ignored. For example, we can color all links with metric less than 10 yellow (true) and all others blue (false). In binary mode, Manta can also hide (mask) all links for whom the boolean value of the given statement is false.

Figure 6: Threshold greater than 32

Yellow links with threshold > 32 Gray links with threshold <= 32

Figure 7: Only nodes/links with degree > 10

Figures 6 and 7 are examples of binary coloring. Note that metrics and thresholds are quantities associated with directional links between nodes. Since we display links unidirectionally, we assign a 'true' color to the link if it meets the condition in both directions.

Color by Internet Service Provider (ISP)

Manta can also draw each Internet service provider's (ISP's) multicast infrastructure in a different color. We currently use an admittedly coarse heuristic to determine who is an ISP: nodes whose domain names end in .net. Links take on a particular ISP label if both ends have the same ISP string in their hostnames, or if one has an ISP hostname and the other is "unknown". If the data contains too many ISPs to comfortably distinguish with a color key, Manta truncates the list and only displays those with the most multicast nodes. On smaller data sets that do "fit" the color spectrum, Manta uses the last color in the spectrum for the category "unknown". Figure 8 below shows the U.S. ISPs with the most multicast nodes and links in the gathered data set, and also the vBNS. [VBNS]

Application	Applet	Source Code
Manta	no longer supported by CAIDA
Otter	https://www.caida.org/tools/visualization/otter/mbone/	https://www.caida.org/tools/visualization/otter/source

Tools to Visualize the Internet Multicast Backbone

Abstract

Introduction

Multicast routing protocols

Multicast data collection

Aspects of the data worth visualizing

Visualization Tools

MantaRay: geographic depiction

MantaRay Features

Zoom and focus on a particular region

Color by link status: down, tunnel, or native

Color by link attribute: degree, metric, and threshold

Color by Internet Service Provider (ISP)

Limitations of Manta

Otter: general-purpose visualization

Otter's graph layout algorithms

Otter's utility for ISP's

Internet Service Providers Multicast Infrastructure

Partitioning: a case study

Conclusions

Future Directions

Availability

Acknowledgements

References

Appendix A: Otter visualizations of various ISP's multicast infrastructure

DVMRP	mrinfo did not label the link PIM
PIM	mrinfo did label the link PIM

down	mrinfo reports tunnel is down
up	mrinfo reports tunnel is not down
uni	mrinfo does not report either tunnel or down, we assume native multicast