diff --git a/_data/concepts.yml b/_data/concepts.yml
index ba80652028..0de6106856 100644
--- a/_data/concepts.yml
+++ b/_data/concepts.yml
@@ -59,7 +59,6 @@ toc:
   - docs/concepts/cluster-administration/logging.md
   - docs/concepts/cluster-administration/resource-usage-monitoring.md
   - docs/concepts/cluster-administration/kubelet-garbage-collection.md
-  - docs/concepts/cluster-administration/multiple-clusters.md
   - docs/concepts/cluster-administration/federation.md
   - docs/concepts/cluster-administration/federation-service-discovery.md
   - docs/concepts/cluster-administration/sysctl-cluster.md
diff --git a/docs/concepts/cluster-administration/federation.md b/docs/concepts/cluster-administration/federation.md
index 7a34b19db4..96dccfde7d 100644
--- a/docs/concepts/cluster-administration/federation.md
+++ b/docs/concepts/cluster-administration/federation.md
@@ -3,6 +3,8 @@ title: Federation
 redirect_from:
 - "/docs/user-guide/federation/"
 - "/docs/user-guide/federation/index.html"
+- "/docs/admin/multi-cluster/"
+- "/docs/admin/multi-cluster.html"
 ---
 
 {% capture overview %}
@@ -117,6 +119,56 @@ delete --cascade=false`
 
 Note: Kubernetes version 1.5 included cascading deletion support for a subset of
 federation resources.
+
+## Scope of a single cluster
+
+On IaaS providers such as Google Compute Engine or Amazon Web Services, a VM exists in a
+[zone](https://cloud.google.com/compute/docs/zones) or [availability
+zone](http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/using-regions-availability-zones.html).
+We suggest that all the VMs in a Kubernetes cluster should be in the same availability zone, because:
+
+  - compared to having a single global Kubernetes cluster, there are fewer single-points of failure
+  - compared to a cluster that spans availability zones, it is easier to reason about the availability properties of a
+    single-zone cluster.
+  - when the Kubernetes developers are designing the system (e.g. making assumptions about latency, bandwidth, or
+    correlated failures) they are assuming all the machines are in a single data center, or otherwise closely connected.
+
+It is okay to have multiple clusters per availability zone, though on balance we think fewer is better.
+Reasons to prefer fewer clusters are:
+
+  - improved bin packing of Pods in some cases with more nodes in one cluster (less resource fragmentation)
+  - reduced operational overhead (though the advantage is diminished as ops tooling and processes matures)
+  - reduced costs for per-cluster fixed resource costs, e.g. apiserver VMs (but small as a percentage
+    of overall cluster cost for medium to large clusters).
+
+Reasons to have multiple clusters include:
+
+  - strict security policies requiring isolation of one class of work from another (but, see Partitioning Clusters
+    below).
+  - test clusters to canary new Kubernetes releases or other cluster software.
+
+## Selecting the right number of clusters
+
+The selection of the number of Kubernetes clusters may be a relatively static choice, only revisited occasionally.
+By contrast, the number of nodes in a cluster and the number of pods in a service may change frequently according to
+load and growth.
+
+To pick the number of clusters, first, decide which regions you need to be in to have adequate latency to all your end users, for services that will run
+on Kubernetes (if you use a Content Distribution Network, the latency requirements for the CDN-hosted content need not
+be considered).  Legal issues might influence this as well. For example, a company with a global customer base might decide to have clusters in US, EU, AP, and SA regions.
+Call the number of regions to be in `R`.
+
+Second, decide how many clusters should be able to be unavailable at the same time, while still being available.  Call
+the number that can be unavailable `U`.  If you are not sure, then 1 is a fine choice.
+
+If it is allowable for load-balancing to direct traffic to any region in the event of a cluster failure, then
+you need at least the larger of `R` or `U + 1` clusters.  If it is not (e.g. you want to ensure low latency for all
+users in the event of a cluster failure), then you need to have `R * (U + 1)` clusters
+(`U + 1` in each of `R` regions).  In any case, try to put each cluster in a different zone.
+
+Finally, if any of your clusters would need more than the maximum recommended number of nodes for a Kubernetes cluster, then
+you may need even more clusters.  Kubernetes v1.3 supports clusters up to 1000 nodes in size.
+
 {% endcapture %}
 
 {% capture whatsnext %}
diff --git a/docs/concepts/cluster-administration/multiple-clusters.md b/docs/concepts/cluster-administration/multiple-clusters.md
deleted file mode 100644
index 5d2a8dfe55..0000000000
--- a/docs/concepts/cluster-administration/multiple-clusters.md
+++ /dev/null
@@ -1,69 +0,0 @@
----
-assignees:
-- davidopp
-title: Using Multiple Clusters
-redirect_from:
-- "/docs/admin/multi-cluster/"
-- "/docs/admin/multi-cluster.html"
----
-
-You may want to set up multiple Kubernetes clusters, both to
-have clusters in different regions to be nearer to your users, and to tolerate failures and/or invasive maintenance.
-This document describes some of the issues to consider when making a decision about doing so.
-
-If you decide to have multiple clusters, Kubernetes provides a way to [federate them](/docs/admin/federation/).
-
-## Scope of a single cluster
-
-On IaaS providers such as Google Compute Engine or Amazon Web Services, a VM exists in a
-[zone](https://cloud.google.com/compute/docs/zones) or [availability
-zone](http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/using-regions-availability-zones.html).
-We suggest that all the VMs in a Kubernetes cluster should be in the same availability zone, because:
-
-  - compared to having a single global Kubernetes cluster, there are fewer single-points of failure
-  - compared to a cluster that spans availability zones, it is easier to reason about the availability properties of a
-    single-zone cluster.
-  - when the Kubernetes developers are designing the system (e.g. making assumptions about latency, bandwidth, or
-    correlated failures) they are assuming all the machines are in a single data center, or otherwise closely connected.
-
-It is okay to have multiple clusters per availability zone, though on balance we think fewer is better.
-Reasons to prefer fewer clusters are:
-
-  - improved bin packing of Pods in some cases with more nodes in one cluster (less resource fragmentation)
-  - reduced operational overhead (though the advantage is diminished as ops tooling and processes matures)
-  - reduced costs for per-cluster fixed resource costs, e.g. apiserver VMs (but small as a percentage
-    of overall cluster cost for medium to large clusters).
-
-Reasons to have multiple clusters include:
-
-  - strict security policies requiring isolation of one class of work from another (but, see Partitioning Clusters
-    below).
-  - test clusters to canary new Kubernetes releases or other cluster software.
-
-## Selecting the right number of clusters
-
-The selection of the number of Kubernetes clusters may be a relatively static choice, only revisited occasionally.
-By contrast, the number of nodes in a cluster and the number of pods in a service may change frequently according to
-load and growth.
-
-To pick the number of clusters, first, decide which regions you need to be in to have adequate latency to all your end users, for services that will run
-on Kubernetes (if you use a Content Distribution Network, the latency requirements for the CDN-hosted content need not
-be considered).  Legal issues might influence this as well. For example, a company with a global customer base might decide to have clusters in US, EU, AP, and SA regions.
-Call the number of regions to be in `R`.
-
-Second, decide how many clusters should be able to be unavailable at the same time, while still being available.  Call
-the number that can be unavailable `U`.  If you are not sure, then 1 is a fine choice.
-
-If it is allowable for load-balancing to direct traffic to any region in the event of a cluster failure, then
-you need at least the larger of `R` or `U + 1` clusters.  If it is not (e.g. you want to ensure low latency for all
-users in the event of a cluster failure), then you need to have `R * (U + 1)` clusters
-(`U + 1` in each of `R` regions).  In any case, try to put each cluster in a different zone.
-
-Finally, if any of your clusters would need more than the maximum recommended number of nodes for a Kubernetes cluster, then
-you may need even more clusters.  Kubernetes v1.3 supports clusters up to 1000 nodes in size.
-
-## Working with multiple clusters
-
-When you have multiple clusters, you would typically create services with the same config in each cluster and put each of those
-service instances behind a load balancer (AWS Elastic Load Balancer, GCE Forwarding Rule or HTTP Load Balancer) spanning all of them, so that
-failures of a single cluster are not visible to end users.