oracle rac 12 - doag.org · pdf file11 12.1 node eviction •2 knoten cluster: knoten mit...
TRANSCRIPT
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 1
Oracle RAC 12.2 Mai 2017
Sebastian Solbach BU Core & Cloud Technologies Oracle Deutschland B.V. & Co KG
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 2
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 3
Cluster Neuigkeiten
• Stretch Cluster
• Cluster Node Weight
• Cluster Health Framework
• Cluster Ressource Gruppen
• Service Orientierter Buffer Cache, Multitenant Cache Optimierung
• Recovery Buddy
• Domain Service Cluster
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 4
12.2 RAC Installation
• Kein C++ Kompiler mehr notwendig
• Separation of Duty: SYSRAC
• Vereinfachte Cluster Installation
– Unzip & ./gridSetup.sh
– ASM Filter Driver Integration
– separate ASM Diskgroup für GI Management Repository
– Stretch Cluster Setup und Definition
– ASM immer notwendig
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 5
Stretch Cluster
• Installation ist Stretch Cluster Aware
– Cluster Setup im Extended Modus
– Knoten, Failgroups werden „Site“ Aware
• Neues „Extended“ Attribut für Flex Diskgruppen – Diskgruppen mit “Extended Redundancy”
sind Site-Aware
– Extended Diskgruppen unterstützen mehrere Voting Disks Failgroups
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 6
Stretch Cluster Setup bis 12.1
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 7
Voting Disk Konfiguration
Stretch Cluster Setup bis 12.1
Quorum VD3
VD1 VD2
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 8
Voting Disk Konfiguration
Stretch Cluster Setup 12.2
Quorum VD7
VD1 VD2 VD3 VD4 VD5 VD6
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 9
Konvertierung eines Cluster
• FLEX Diskgruppe muss neu angelegt werden
• Datendateien können bei 12.2 Online migriert werden
• Zuweisen der einzelnen Knoten zu Sites
http://docs.oracle.com/database/122/CWLIN/extending-a-cluster-after-
upgrading-grid-infrastructure.htm
rootcrs.pl -converttoextended -first -sites list_of_sites
-site node_site
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 10
Cluster Neuigkeiten
• Stretch Cluster
• Cluster Node Weight
• Cluster Health Framework
• Cluster Ressource Gruppen
• Service Orientierter Buffer Cache, Multitenant Cache Optimierung
• Recovery Buddy
• Domain Service Cluster
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 11
12.1 Node Eviction
• 2 Knoten Cluster: Knoten mit niedrigster Knotennummer überlebt
• Wichtig: Nicht darauf verlassen (da sich das mit 12.2 ändert)!
✔ 1 2
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 12
Node Weighting in Oracle RAC 12c Release 2
• Node Weighting betrachtet den Workload eines Knoten während des Fencings
• Beispiel: Der Knoten mit mehr Services (zum Fence Zeitpunkt) überlebt
Wenn alles gleich ist, den Knoten mit mehr “Arbeit” leben lassen
✔ 1 2
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 13
3 Knoten Cluster profitiert von “Node Weighting”,
wenn 3 gleich große SubCluster übrig bleiben (2 unterschiedlich große
Cluster sind nicht gleich)
“Secondary failure” Betrachtung damit sind die Cluster nicht gleich. Weitere Verbesserungen geplant
Fallback falls keine eindeutige Entscheidung
getroffen werden kann
Was heißt “gleich”?
✔
Public network card failure.
“Conflict”.
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 14
CSS_CRITICAL kann auf verschiedenen Ebenen
gesetzt werden um Resourcen als kritisch zu markieren.
CSS_CRITICAL geht vor falls keine technischen Probleme
vorliegen überlebt der Knoten mit der kritischen Resource
Fallback wenn nach Auswertung von
CSS_CRITICAL beide “gleich” sind.
CSS_CRITICAL – Manuelles Überschreiben
crsctl set server css_critical {YES|NO}
+ server restart
srvctl modify database -help |grep critical
… -css_critical {YES | NO}
Define whether the database or service is CSS critical
✔
Node eviction despite WL; WL
will failover. “Conflict”.
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 15
Cluster Neuigkeiten
• Stretch Cluster
• Cluster Node Weight
• Cluster Health Framework
• Cluster Ressource Gruppen
• Service Orientierter Buffer Cache, Multitenant Cache Optimierung
• Recovery Buddy
• Domain Service Cluster
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 16
Cluster Verification
Utility
ORAchk
Cluster Health
Monitor
Cluster Health Advisor
Trace File Analyzer
Hang Manager
Memory Guard
Quality of Service
Management Oracle 12c Autonomous Health Framework
Arbeitet im Hintergrund
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 17
Zieht Baselines und überprüft Best Practices
Oracle Cluster Verification Utility (CVU)
• Läuft automatisch
• Prüft O/S, GI und DB auf wichtige Best Practices mit “healthcheck”
• Daemon läuft alle 6 Stunden
• ASM Best Practices Check
• Anlegen von Baseline Sammlungen
• Benutzer definierte Checks / Disablen von Checks
• Neues Berichts Format
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 18
Best Practices und prüfen bekannter Probleme Oracle 12c ORAchk/EXAchk
• Proactiver Scan
• Email Benachrichtigung
(im Daemon Mode)
• Analysiert bekannte Probleme
• Berichte über den System Status
• Drill Down
• ORAchk Collection Manager :
Übersicht über mehrere Systeme
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 19
Cluster Health Monitor (CHM)
• Automatisch eingeschaltet
• Detaillierte Betriebssystem-Informationen
• Unterstützt Analyse von Kntoen Reboots
• Speichert alle Logs lokal
• Benutzerdefinierte “Pinned” Prozesse
• Listens to CSS und GIPC Events
• Kategorisierung nach Prozesstyp
• Unterstützt “Plug-Ins” (traceroute, netstat, ping, etc.)
• CSV Ausgabe für einfache Analyse
Metriken zur Diagnose von Cluster und Datenbanken
GIMR
ologgerd
(master)
osysmon
d
osysmond
osysmond
osysmond
12c Grid Infrastructure Management Repository
OS Data OS Data
OS Data
OS Data
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 20
Oclumon CLI oder Integration in EM Cloud Control Cluster Health Monitor (CHM)
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 21
Cluster Health Advisor (CHA)
• Automatisch angeschaltet
• Erkennt Knoten & Datenbank Performance Probleme
• Schnelle Warnungen & Empfehlungen
• Untersützt “Calibrierung” für bessere Analyse
• Integriert im EMCC Incident Manager
• Standalone GUI Tool
Erkennt Cluster & DB Probleme
OS Data
GIMR
ochad
DB Data
CHM
Node Health
Prognostics Engine
Database Health
Prognostics Engine
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 22
Trace File Analyser
TFA
File
Metadata
Collection
Repository
Node 1
TFA
File
Metadata
Collection
Repository
Node N
1
2
3
3
4
5
User/Daemon initiated
TFA initiated
1. Daemon initiates diagnostic collection
2. TFA signals collections on other nodes
3. Collections written to local TFA repositories
---------- Coming ----------
4. Local collections copied to TFA service in DSC
5. Analytics on the TFA service (Receiver)
TFA metadata
COMING Domain Services Cluster
Oracle Cluster Domain
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 23
Oracle 12c Hang Manager
• Hang Manager berücksichtigt nur Sessions, die Ressourcen blockieren, auf die andere Sessions warten
• Hang Manager erkennt Probleme über Layer hinweg (ASM, DB)
• Deadlocks & User Locks sind nicht vom Hang Manager verwaltet
• Warnungen über V$ Views
• Benutzerdefiniert Kontrolle über PL/SQL
Session
DIA0
EVALUATE
DETECT
ANALYZE
Hung?
VERIFY
Victim
QoS Policy
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 24
Oracle 12c Memory Guard
• Analysiert 1x pro Minute Memory Over-Commit
• Warnt wenn Server zum “Swappen” kommt
• Schützt Applikationen, indem keine neuen Sessions zugelassen werden
• Stopped Services Transaktional auf dem Server
• Wenn der “Stress” nachlässt werden Sessions wieder zugelassen
Schützt den Cluster vor Memory Engpässen
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 25
Oracle 12c Quality of Service Management
• Ab 12.1.0.2+ Grid Infrastructure
• Funktioniert auch mit 11.2 (RAC)
• Cloud Control Integration
• Clusterweites KPI Dashboard
• Measure, Monitor Only Modus
• Management Modus
SLA Überwachung für Services
Full EM Cloud Control Integration
Monitor and Alert
Define Policies
Evaluate and Report
Analyze and Aggregate
Classify and Measure
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 26
Cluster Neuigkeiten
• Stretch Cluster
• Cluster Node Weight
• Cluster Health Framework
• Cluster Ressource Gruppen
• Service Orientierter Buffer Cache, Multitenant Cache Optimierung
• Recovery Buddy
• Domain Service Cluster
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 27
Cluster Ressource Gruppen
• Gruppieren von Ressourcen im Cluster für das Failover
• Gemeinsame Abhängigkeiten
# Create resource group
crsctl add resourcegroup xag_group -type cluster_resourcegroup -attr
"SERVER_POOLS“
# Add resources to group
crsctl modify resource xag_vip -attr "RESOURCE_GROUP=xag_group"
crsctl modify resource xag_mp -attr "RESOURCE_GROUP=xag_group"
crsctl modify resource xag_app -attr "RESOURCE_GROUP=xag_group"
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 28
Cluster Activity Log
Node A
Oracle GI
Oracle RAC
Node B
Oracle GI
Oracle RAC
Cluster Activity Log
• Activity Einträge werden vom ologgerd (CHM) asynchron in die MGMTDB (GIMR) geschrieben
• Ergebnis: Clusterweites Log inkl. Reihenfolge der Events
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 29
CRSCTL Query CALOG Syntax
crsctl query calog -h
Usage:
crsctl query calog [-aftertime <after_timestamp>]
[-beforetime <before_timestamp>] [-duration <time_interval> | -follow]
[-filter <filter_expression>] [-fullfmt | -xmlfmt]
Lists the cluster activity log activities matching the specified criteria
Where
after_timestamp Time in format "YYYY-MM-DD HH24:MI:SS[.FF][TZH:TZM]"
OR "YYYY-MM-DD" OR "HH24:MI:SS[.FF][TZH:TZM]"
before_timestamp Time in format "YYYY-MM-DD HH24:MI:SS[.FF][TZH:TZM]"
OR "YYYY-MM-DD" OR "HH24:MI:SS[.FF][TZH:TZM]"
time_interval Time interval in format "DD HH:MM:SS"
filter_expression Refer to product documentation.
-follow Gives continuous output
-fullfmt Detail output format
-xmlfmt XML output format
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 30
CRSCTL Query CALOG Output
$ crsctl query calog -beforetime "2016-02-09 15:09:46.522-07:00"
2016-02-09 15:06:51.106-07:00 : Server 'tclust01' has been assigned to pool 'Free'. : 143198681110210633/0/1 :
2016-02-09 15:09:39.228-07:00 : Resource 'ora.net1.network' has been registered. : 143198681110610633/117/1 :
2016-02-09 15:09:40.933-07:00 : Resource 'ora.tclust01.vip' has been registered. : 143198697922810633/136/1 :
2016-02-09 15:09:41.827-07:00 : Resource 'ora.ons' has been registered. : 143198698093310633/143/1 :
2016-02-09 15:09:46.522-07:00 : Resource 'ora.ons' has been modified. : 143198698182710633/194/1 :
Format of output records is:
DATE & TIME (YYYY-MM-DD HH24:MI:SS[.FF][[+-]HH:MM]): Event text: ACTID
ACTID is a generated sequential identifier for related Clusterware events, use it to track related events
Such as,
crsctl query calog -filter "actid == 143198681110210633/0/1”
will only return events associated with that event.
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 31
Queries against Cluster Activity Log • crsctl query calog -duration "00 00:02:00“ << last two minutes >>
• crsctl query calog -duration "00 02:00:00“ << last two hours >>
• crsctl query calog –follow << like tail… >>
• crsctl query calog -aftertime "2016-02-16 17:49:12.397" –follow
• crsctl query calog –xmlfmt << in xml format >>
• crsctl query calog -filter "entity_name == foo“ << activity on foo >>
• crsctl query calog -filter "actid == 145567015239630019/49543/1"
<< activities related to this actid… heirarchy of actions >>
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 32
Cluster Neuigkeiten
• Stretch Cluster
• Cluster Node Weight
• Cluster Health Framework
• Cluster Ressource Gruppen
• Service Orientierter Buffer Cache, Multitenant Cache Optimierung
• Recovery Buddy
• Domain Service Cluster
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 33
Verbesserung Singleton Services
• Service orientierter Buffer Cache Zugriff ermittelt die Daten (auf Objekt Level) die vom Service zugegriffen werden
• Mastering auf dem Knoten des Services
Konsistente Performance nach Failover
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 34
• Cache Fusion verwaltet die Zuordnung Service zu Buffer Cache
– Analysiert welcher Service welche Blöcke in den Buffer Cache liest
• Statistiken:
– Mastern der Resourcen nur auf dem aktiven Knoten
– Pre-Warm Cache währen Service Relokierung (Planned Downtime)
Service-Oriented Buffer Cache Access
NodeA
Oracle GI
Oracle RAC
NodeB
Oracle GI
Oracle RAC
cons_1 cons_2
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 35
Pluggable Database und Service Isolation
• Verbessert Performance
• DLM Operationen von PDBs auf Services beschränkt
• Block Operationen basiert auf InMemory Block Trennung
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 36
• Single Domain (domain 0) auf CDB Ebene für alle PDBs
• Operationen wie PDB start, stop, reconfigurations basieren auf einzelner Domain
• Zugriffe auf die Resource Hash Table erhöht sich bei mehreren PDBs (>250)
Pdb-1
Pdb-2
Pdb-3
Pdb-4
Domain 0
Instance
SGA
Resource Hash
Table
Oracle Multitenant 12c Rel. 1 Implementierung
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 37
• Jede PDB hat eigene Domain ID
• GES & GCS Resources zwischen PDB-Spezifischen Domains aufgeteilt
• Mehr und konsistente Performance
– Eine “aktive” PDB beeinflusst nicht andere PDBs
– Kleinere Hash Table Größe in den Instanzen, die die PDB geöffnet haben
Oracle Multitenant 12c Rel. 2 Optimierung
Domain1 Resource Hash
Table
Resource Hash
Table
Resource Hash
Table
Resource Hash
Table
Domain 2
Domain 3
Domain 4
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 38
Cluster Neuigkeiten
• Stretch Cluster
• Cluster Node Weight
• Cluster Health Framework
• Cluster Ressource Gruppen
• Service Orientierter Buffer Cache, Multitenant Cache Optimierung
• Recovery Buddy
• Domain Service Cluster
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 39
• Detect: Festellen ob ein Knoten den Cluster verlässt oder beitritt
• Evict: Instanz aus dem Cluster entfernen (STONITH)
• Elect: Recovery Master Instanz (RM) benennen für Recovery
• Read Redo: RM liest Redo
• Apply Recovery: Anwenden des Redo für Recovery
Cluster Rekonfiguration
Detect
Evict
Elect Recovery
Read Redo
Apply Recovery
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 40
• Buddy Instances eliminiert die “Elect” Phase
• “Redo” ist bereits im Memory
• “Apply” geht schneller, da nicht mehr gelesen werden muss
Kürzere Brownout Zeit mit “Recovery Buddy”
Detect
Evict
Elect Recovery
Read Redo
Apply Recovery
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 41
1. Einfache Buddy Instance Zuteilung: ( 1 2, 2 3, 3 4, 4 1)
2. Recovery Buddies werden beim Instanz Start zugewiesen
3. RMS0 Prozess verwaltet die Redo Log Änderungen im Memory
4. Memory wird während des Recoveries verwendet
Recovery Buddy Instances
1. Inst1 ist Recovery Buddy für Inst2 2. Inst2 ist Recovery Buddy für 3 usw. 3. Recovery Buddy Neuverteilung beim
Hinzufügen/Verlassen von Knoten
Inst 1 Inst 2 Inst 3 Inst 4
Recovery Buddy 2
Recovery Buddy 3
Recovery Buddy 4
MyCluster
Recovery Buddy 1
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 42
• Recovery Zeit ist abhängig von Dirty Blöcken
• Recovery Zeit kann mit FAST_START_MTTR_TARGET angepasst werden: 1. “Select Target_mttr,Estimated_mttr
from gv$instance_recovery”
2. Set Fast_Start_MTTR_Target=
<value>
Geringerer Brownout
• Vendor Clusterware / NFS
– Instanz Rekonfiguration beruht auf der Oracle Clusterware. Zusätzliche Clusterware kann die Zeit verzögern
– Oracle Homes auf NFS können durch NFS “hangs” Recovery Zeiten vergrößern
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 43
Cluster Neuigkeiten
• Stretch Cluster
• Cluster Node Weight
• Cluster Health Framework
• Cluster Ressource Gruppen
• Service Orientierter Buffer Cache, Multitenant Cache Optimierung
• Recovery Buddy
• Domain Service Cluster
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 47
Domain Services Cluster
Cluster Domain
IO Service ASM
Service
Database Member Cluster
Uses ASM Service
Database Member Cluster
Uses IO & ASM Service of DSC
Trace File Analyzer
(TFA) Service
Mgmt Repository
(GIMR) Service
Application Member Cluster
GI only
Database Member Cluster
Uses local ASM
Shared ASM
Additional Optional Services
Rapid Home Provisioning
(RHP) Service
Private Network SAN NAS
Copyright © 2017, Oracle and/or its affiliates. All rights reserved. BU Core & Cloud Technologies 52
http://blogs.oracle.com/coretec
http://tinyurl.com/oratech-monthly
Nächste Ausgabe 07. Juni
@oraclebudb
@s2solbach