Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

73
Disaster Recovery in distant DC; people, hardware, latency.

description

 

Transcript of Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Page 1: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Disaster Recovery in distant DC;people, hardware, latency.

Page 2: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

• What is really the Allegro.pl?

• Distaster Recovery - true story

• Latency - why 20 ms matters

• Data Centers topology

agenda

Page 3: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Allegro ... QXL ?

Serwisy Aukcyjne Serwisy Ogłoszeniowe Płatności

.pl.cz

.ro

.ua.bg

.hu.sk .lt

.pl

.pl

Sklepy Internetowe

Page 4: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Allegro... beginings

Page 5: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

o Allegro... today

Page 6: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

o Allegro... today

about 1000 servers

Page 7: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

o Allegro... today

about 1000 serversabout 300 IT staff

Page 8: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

o Allegro... today

about 1000 serversabout 300 IT staff

about 40 in infrastructure

Page 9: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

we are growing

Source: http://www.aukcjostat.pl/

Page 10: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

why bother?

Page 11: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Qvo vadis?

Page 12: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Qvo vadis?

Page 13: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Qvo vadis?

Page 14: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Qvo vadis?

Page 15: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Qvo vadis?

Page 16: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Qvo vadis?

Page 17: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

2 hours by plane...

~800 km

Go west...

Page 18: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

2 hours by plane...

~800 km

Go west...

Page 19: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

WWCKD ?

Page 20: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

WWCKD ?

Hope is the mother of lost battles

No guts, no glory.

No pain, no game.

No risk, no fun.

Page 21: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

WWCKD ?

Hope is the mother of lost battles

No guts, no glory.

No pain, no game.

tiaaaa...

No risk, no fun.

Page 22: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Building DC3

Page 23: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Building DC3

Page 24: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

DC net topology

Page 25: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

DC net topology

ISP 1

ISP 2

DC2 , Poznań

Page 26: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

DC net topology

ISP 1

ISP 2

ISP 1

ISP 2

DC2 , Poznań DC3 , Frankfurt

Page 27: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

DC net topology

ISP 1

ISP 2

ISP 1

ISP 2

DC2 , Poznań DC3 , Frankfurt

2G, EoMPLS

Page 28: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

DC net topology

ISP 1

ISP 2

ISP 1

ISP 2

DC2 , Poznań DC3 , Frankfurt

2G, EoMPLS

10G, DWDM

Page 29: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Global Traffic Manager

DC2 , Poznań

DNS req: www.allegro.pl

DC3 , Frankfurt

www.allegro.pl

F5 LTMF5 LTM

F5 LTMF5 LTM

F5 GTM F5 GTM F5 GTM F5 GTM

www.allegro.pl

DNSDNS

$ host -t ns allegro.pl allegro.pl name server gtmdc3.allegro.pl. allegro.pl name server gtmdc2.allegro.pl.

Page 52: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Latency vs tennis

Page 53: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

SYN

SYN/ACK

ACK

TCP/IP explained

FIN

ACK

ACK

DATA

DATA

ACK

Control packets

Data packets

bandwidth

RTT / latency

window size = 64 kBytes

Bandwidth-delay Product

BDP (bytes) = bandwidth (KBytes/sec) x RTT (ms)

Page 54: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

0,0005secSYN

SYN/ACK

ACK

TCP/IP explained

FIN

ACK

0,0005sec

0,0005sec

ACK

DATA

DATA

ACK

0,0005sec

0,0005sec

0,0005sec

0,0005sec

0,0005sec

0,0005sec

Control packets

Data packets

Page 55: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

0,0005secSYN

SYN/ACK

ACK

TCP/IP explained

FIN

ACK

0,0005sec

0,0005sec

ACK

DATA

DATA

ACK

0,0005sec

0,0005sec

0,0005sec

0,0005sec

0,0005sec

0,0005sec

Control packets

Data packets

Scenario #1

• bandwidth = 10Gbit/sec• RTT / latency = 1 ms (1/1000 sec)• window size = 64 kBytes

• data to transfer = 640 000 kBytes (1CD)

What is the time to deliver CD from point A to B?

Page 56: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

0,0005secSYN

SYN/ACK

ACK

TCP/IP explained

FIN

ACK

0,0005sec

0,0005sec

ACK

DATA

DATA

ACK

0,0005sec

0,0005sec

0,0005sec

0,0005sec

0,0005sec

0,0005sec

Control packets

Data packets

Scenario #1

• bandwidth = 10Gbit/sec• RTT / latency = 1 ms (1/1000 sec)• window size = 64 kBytes

• data to transfer = 640 000 kBytes (1CD)

What is the time to deliver CD from point A to B?

1 sec !!!

(640 000 kBytes) / (64 kBytes/ 1ms) = 1000 ms

Page 57: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

0,5sekSYN

SYN/ACK

ACK

TCP/IP explained

FIN

ACK

0,5sek

0,5sek

ACK

DATA

DATA

ACK

0,5sek

0,5sek

0,5sek

0,5sek

0,5sek

0,5sek

Control packets

Data packets

Page 58: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

0,5sekSYN

SYN/ACK

ACK

TCP/IP explained

FIN

ACK

0,5sek

0,5sek

ACK

DATA

DATA

ACK

0,5sek

0,5sek

0,5sek

0,5sek

0,5sek

0,5sek

Control packets

Data packets

Scenario #2

• bandwidth = 10Gbit/sec• RTT / latency = 1 sek• window size = 64 kBytes

• data to transfer = 640 000 kBytes (1CD)

What is the time to deliver CD from point A to B?

Page 59: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

0,5sekSYN

SYN/ACK

ACK

TCP/IP explained

FIN

ACK

0,5sek

0,5sek

ACK

DATA

DATA

ACK

0,5sek

0,5sek

0,5sek

0,5sek

0,5sek

0,5sek

Control packets

Data packets

Scenario #2

• bandwidth = 10Gbit/sec• RTT / latency = 1 sek• window size = 64 kBytes

• data to transfer = 640 000 kBytes (1CD)

What is the time to deliver CD from point A to B?

1000 sec !!!

(640 000 kBytes) / (64 kBytes/1000ms) = 1000 sec

Page 60: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

800km distance...

Page 61: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

800km distance...

storage data base

application / transactionsAIX Oracle

MySQLbackend daemonsfile transfers

server deployment

Linux

Page 62: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

800km distance...

just 20ms...

Page 63: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

co-operation

Page 64: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

co-operation

developers

Page 65: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

co-operation

developers infrastructure

Page 66: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

co-operation

developers infrastructure

Page 67: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

co-operation

developers infrastructure

Page 68: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Recipe for success

Disaster recovery ingredients

Page 69: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Recipe for success

Page 70: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Recipe for success

Page 71: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Recipe for success

Page 72: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Recipe for success

Page 73: Disaster Recovery in distant DC; people, hardware, latency - PLNOG 200909

Thank You, Q&A