EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes,...

35
EVPN BUM Flooding Reduction Krzysztof Grzegorz Szarkowicz, PLM [email protected]

Transcript of EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes,...

Page 1: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN BUM Flooding ReductionKrzysztof Grzegorz Szarkowicz, [email protected]

Page 2: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN Introduction• EVPN is getting traction in DC/Cloud deployments, replacing other

(legacy) L2 architectures (i.e. VPLS)• It has many benefits, like for example:

• Unified, standardized control plane (BGP)• Unified, standardized A/A and A/S multi-homing

• Multi-vendor interoperability• Near Hitless Host Mobility• Dramatic reduction of broadcast and multicast traffic

• This session covers the last bullet point in more details

Page 3: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN in DC

DC Interconnect

DC1 DC2

DC3 DC4

EVPN-VXLAN

EVPN-VXLAN

EVPN-VXLAN

EVPN-VXLAN

EVPN-MPLSEV

PN D

omai

n

• Mega DC• Many 100k

hosts• DCs are being

interconnected• It all results in

large broadcast (flooding) domains

Page 4: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Session Agenda

ARP Flooding Reduction

Multicast Flooding Reduction

Efficient Replication of BUM Traffic

Inter-Subnet Multicast

Page 5: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Broadcast Flooding• Large broadcast flooding (e.g. ARP) might negatively impact DC

operation• 600k hosts with 10 min ARP cache timeout à average 1k pps of ARP

Requests • Routers connected to DC might need to process large number of ARPs

• Typically, it happens in “slow path” (software processing)• Can cause heavy load on the router’s CPU• Typically limitation are low thousands per second

• Historically, some attempts have been made to address the problem:• RFC 6820: Address Resolution Problems in Large Data Center Networks

• EVPN brings holistic way to suppress ARP storms

Page 6: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (1)

RR

S-MAC: AS-IP: A

T-MAC: ?T-IP: B

ARP Req

1

AB

Host ’A’ issues ARP Request to resolve IP address ‘B’

Page 7: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (2)

RR

S-MAC: AS-IP: A

T-MAC: ?T-IP: B

ARP Req

EVPN PE router, where ARP Request (with broadcast D-MAC) arrives, floods its via EVPN machinery, eventually arriving to host B S-MAC: A

S-IP: AT-MAC: ?

T-IP: B

ARP Req

S-MAC: AS-IP: A

T-MAC: ?T-IP: B

ARP Req

S-MAC: AS-IP: A

T-MAC: ?T-IP: B

ARP Req

S-MAC: AS-IP: A

T-MAC: ?T-IP: B

ARP Req

S-MAC: AS-IP: A

T-MAC: ?T-IP: B

ARP ReqS-MAC: A

S-IP: AT-MAC: ?

T-IP: B

ARP Req

2A

B

Page 8: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (3)

RR

3

MAC-A:IP-A

In the mean time, ingress EVPN PE intercepts ARP Request, learns MAC-A:IP-A association from it, and updates its EVPN database

AB

Page 9: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (4)

RR4

MAC-A:IP-A

BGP EVPN MAC/IP RouteMAC:A, IP:A

Ingress EVPN informs remaining PEs about learned MAC-A:IP-A via BGP EVPN MAC/IP (Type 2) Route

AB

Page 10: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (5)

RR

MAC-A:IP-A

MAC-A:IP-A MAC-A:IP-A

MAC-A:IP-A

MAC-A:IP-A

MAC-A:IP-AMAC-A:IP-A

MAC-A:IP-A

5 5

5

5

55

5

Remaining EVPN PEs update their EVPN database with MAC-A:IP-A association learned from ingress PE. Eventually, all PEs know about MAC-A:IP-A

AB

Page 11: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (6)

RR

MAC-A:IP-A

MAC-A:IP-A MAC-A:IP-A

MAC-A:IP-A

MAC-A:IP-A

MAC-A:IP-AMAC-A:IP-A

MAC-A:IP-A

S-MAC: BS-IP: B

T-MAC: AT-IP: A

ARP Rep

Host-B answers with ARP Reply

6A

B

Page 12: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (7)

RR

S-MAC: BS-IP: B

T-MAC: AT-IP: A

ARP Rep

MAC-A:IP-A

MAC-A:IP-A MAC-A:IP-A

MAC-A:IP-A

MAC-A:IP-A

MAC-A:IP-AMAC-A:IP-A

MAC-A:IP-A

EVPN PE router, where ARP Reply arrives, has already MAC-A entry in its EVPN database, so ARP Reply is unicasted (not broadcasted) via EVPN machinery, and eventually arrives at Host-A

7

AB

Page 13: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (8)

RR

MAC-A:IP-A

MAC-A:IP-A MAC-A:IP-A

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-A

MAC-A:IP-AMAC-A:IP-A

MAC-A:IP-A

8

In the mean time, EVPN PE intercepts ARP Reply, learns MAC-B:IP-B association from it, and updates its EVPN database

AB

Page 14: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (9)

RR9

MAC-A:IP-A

MAC-A:IP-A MAC-A:IP-A

MAC-A:IP-A

MAC-A:IP-AMAC-A:IP-A

MAC-A:IP-A

BGP EVPN MAC/IP RouteMAC:B, IP:BIngress EVPN

informs remaining PEs about learned MAC-B:IP-B via BGP EVPN MAC/IP (Type 2) Route

MAC-A:IP-AMAC-B:IP-B

AB

Page 15: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (10)

RR

10

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B-IP-B

10 10

10

1010

10

Remaining EVPN PEs update their EVPN database with MAC-B:IP-B association learned from ingress PE. Eventually, all PEs know about MAC-A:IP-A and MAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

AB

Page 16: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (11)

RR

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B:IP-B

MAC-A:IP-AMAC-B-IP-B

Host ’C’ issues ARP Request to resolve IP address ‘B’

MAC-A:IP-AMAC-B:IP-B

S-MAC: CS-IP: C

T-MAC: ?T-IP: B

ARP Req

11

B

C

Page 17: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (12)

RR

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B-IP-BMAC-C:IP-C

EVPN PE already has an entry for MAC-B:IP-B, so it

§ sends ARP Reply to host C

§ Learns MAC-C:IP-C

§ Informs remaining PEs about MAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

S-MAC: BS-IP: B

T-MAC: CT-IP: C

ARP Rep

12

12

12

12

1212

12

12

12

12

BGP EVPN MAC/IP RouteMAC:C, IP:C

B

C

Page 18: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ARP Suppression Operation (13, 14)

RR

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

MAC-A:IP-AMAC-B-IP-BMAC-C:IP-C

When ARP cache on Host-B expires, Host-B issues ARP Request§ suppressed on PE§ PE sends

immediate ARP Reply

§ No update in EVPN BGP machinery required

MAC-A:IP-AMAC-B:IP-BMAC-C:IP-C

S-MAC: BS-IP: B

T-MAC: ?T-IP: C

ARP Req

13

14

S-MAC: CS-IP: C

T-MAC: BT-IP: B

ARP RepB

C

Page 19: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN ND Suppression• ND suppression follows similar concepts to ARP suppression, hence

not discussed explicitly in this session

Page 20: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Session Agenda

ARP Flooding Reduction

Multicast Flooding Reduction

Efficient Replication of BUM Traffic

Inter-Subnet Multicast

Page 21: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Basic EVPN Multicast Distribution (1)Multicast is delivered from ingress PE to allegress PEs participating in given EVPN via ingress replicationEgress PE delivers/blocks MCAST to local receivers based on

§ DF/non-DF state§ Local IGMP

membership state

S-A

S-B

R1-B

R1-AR2-A

R2-B

R3-A

R4-A

DF

DF

DFDF

DF

ESI

ESI

ESIESI

ESI

ESI

ESI

MCAST distribution very inefficient

Page 22: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Basic EVPN Multicast Distribution (2)• Two aspects of inefficient MCAST distribution in basic EVPN

deployments• MCAST distributed to all PEs

• EVPN creates states basic on• Data plane or PE-CE control plane (for traffic received from CE)

» IGMP• PE-PE BGP EVPN control plane (for traffic received via EVPN core)

» BGP EVPN extensions required to accomplish that à SMET (Type 6) Route• Ingress replication

• More efficient replication methods required• P2MP (i.e. PIM, mLDP, RSVP, BIER)• Assisted Replication

Page 23: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Selective Multicast Ethernet Tag (SMET) Route (1)Receives reports the willingness to receive MCAST traffic via standard IGMP (v1/v2/v3) Group Membership (“Join”) messages

S-A

S-B

R1-B

R1-AR2-A

R2-B

R3-A

R4-A

DF

DF

DFDF

DF

ESI

ESI

ESIESI

ESI

ESI

ESI

RR

IGMP (*, A) Join

IGMP (*, A) Join

IGMP (*, B) Join

IGMP (*, A) Join

IGMP (*, A) Join

1

1

1

1

1

Page 24: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Selective Multicast Ethernet Tag (SMET) Route (2)First hop PEs convert IGMP Group Membership messages to BGP EVPN Selective Multicast Ethernet Tag (SMET) messages (Type 6)

§ Only R4-A shown, as an example

§ Based on that information, all involved PEs are aware, where multicast receivers for specific MCAST flows reside

S-A

S-B

R1-B

R1-AR2-A

R2-B

R3-A

R4-A

DF

DF

DFDF

DF

ESI

ESI

ESIESI

ESI

ESI

RR

IGMP (*, A) Join

1

2

BGP EVPN SMET Route(*, A)

Page 25: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Selective Multicast Ethernet Tag (SMET) Route (3)Based on BGP EVPN SMET (Type 6) Route, PEs with attached sources can send MCAST flows to specific PEs only S-A

S-B

R1-B

R1-AR2-A

R2-B

R3-A

R4-A

DF

DF

DFDF

DF

ESI

ESI

ESIESI

ESI

ESI

ESI

Page 26: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

BGP EVPN Join Sync (Type 7) RouteBGP EVPN Leave Sync (Type 8) RouteIn EVPN A/A multi-homing

1) IGMP Join/Leave might arrive to non-DF

2) It is converted to EVPN Join/Leave Sync (Type 7/8) Route

3) SMET (Type 6) Route announced by DF only based on local IGMP Join or EVPN Join

S-A

S-B

R1-B

R1-AR2-A

R2-B

R3-A

R4-A

DF

DF

DFDF

DF

ESI

ESI

ESIESI

ESI

ESI

12

BGP EVPN SMET Route(*, A)

RR

3IGMP (*, A) Join

BGP EVPN Join Sync Route(*, A), ESI=X

Page 27: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Session Agenda

ARP Flooding Reduction

Multicast Flooding Reduction

Efficient Replication of BUM Traffic

Inter-Subnet Multicast

Page 28: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN P2MP Multicast DistributionBUM frames are replicated on transit nodes, according to the P2MP structure

§ Universally deployable in any arbitrary topology

§ Requires consistent P2MP support on all nodes

§ Information about P2MP tunnel distributed via Provider Multicast Service Interface (PMSI) attribute in the Inclusive Multicast Ethernet Tag (Type 3) EVPN Route

S-A

R1-AR2-A

R3-A

R4-A

DF

DF

DFDF

DF

ESI

ESIESI

ESI

ESI

Page 29: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN Assisted Replication• Referred often as

“Optimized Ingress Replication”

• Selected (powerful) nodes are designated to perform replication

• Typically suitable to NVO/DC (Leaf/Spine) designs, with powerful Spines, and low performance Leafs

S-A

R1-AR2-A

R3-A

R4-A

DF

DF

DFDF

DF

ESI

ESIESI

ESI

ESI

AR

Page 30: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Session Agenda

ARP Flooding Reduction

Multicast Flooding Reduction

Efficient Replication of BUM Traffic

Inter-Subnet Multicast

Page 31: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN Legacy Inter-Subnet Multicast

PE1

VRF

VRF

VRF

PE2

PE3

source

receiver1

receiver2

receiver3

receiver4

BD 1, subnet 1

BD 2, subnet 2

IRB

receiver5PIM DR on subnet2

EVPN

Page 32: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN Optimized Inter-Subnet Multicast

PE1

VRF/L3

VRF/L3

VRF/L3

PE2

PE3

source

receiver1

receiver2

Receiver3

receiver4

BD1, subnet 1

BD2, subnet 2

IRB

receiver5

EVPN

Page 33: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

EVPN Optimized Inter-Subnet MulticastSource BD not present on all PEs

PE1

VRF/L3

VRF/L3

VRF/L3

PE2

PE3

source

receiver1

receiver2

Receiver3

BD1, subnet 1

BD2, subnet 2

IRB

receiver5

EVPN

SBD, subnet 0

Page 34: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

Summary – Standardization StatusFeature Specification EVPNRouteTypesInvolved

ARP/ND FloodingReduction(APR/NDSnooping/Proxy) RFC7432, Section10 Type2:MAC/IPAdvertisementRoute

Multicast FloodingReduction(IGMP/MLDSnooping/Proxy) draft-ietf-bess-evpn-igmp-mld-proxy-01

Type6:SelectiveMulticastEthernetTagRouteType7:IGMPJoinSynchRouteType8:IGMP LeaveSynchRoute

P2MPBUMTrees RFC7432, Section16.2à RFC7117 Type3:InclusiveMulticastEthernetTagRoute

Assisted Replication draft-ietf-bess-evpn-optimized-ir-03 Type 3:InclusiveMulticastEthernetTagRouteType 11:LeafAuto-Discovery(AD)route

Optimized Inter-SubnetMulticast draft-ietf-bess-evpn-irb-mcast-00Type 3:InclusiveMulticastEthernetTagRouteType6:SelectiveMulticastEthernetTagRouteType 10:S-PMSIAuto-Discovery(AD)route

Multicast FloodingReduction(PIMSnooping/Proxy) draft-skr-bess-evpn-pim-proxy-01

Type6:SelectiveMulticastEthernetTagRouteType7:IGMP/PIMJoinSynchRouteType<tbd>:MulticastRouterDiscovery(MRD)RouteType <tbd>:PIMRPT-PruneRouteType<tbd>:PIMRPT-PruneJoinSynchRoute

DHCPFloodingReduction(DHCPSnooping/Proxy) draft-surajk-evpn-access-security-00 Type <tbd>:DHCPSnoopAdvertisementRoute

Page 35: EVPN BUM Flooding Reduction - Juniper Networks€¦ · BUM frames are replicated on transit nodes, according to the P2MP structure § Universally deployable in any arbitrary topology

THANK YOU

Krzysztof Grzegorz Szarkowicz, [email protected]