0% found this document useful (0 votes)
16 views46 pages

Martin Berger - Oracle Priva

The document provides an overview of the Oracle Private Cloud Appliance X9-2, detailing its architecture, benefits, and key features such as hardware specifications and management tools. It covers the transition from Oracle Exadata to the Private Cloud Appliance, including installation, patching, and VM provisioning processes. Additionally, it discusses the importance of automation and integrated teams in delivering tailored data platforms for clients.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views46 pages

Martin Berger - Oracle Priva

The document provides an overview of the Oracle Private Cloud Appliance X9-2, detailing its architecture, benefits, and key features such as hardware specifications and management tools. It covers the transition from Oracle Exadata to the Private Cloud Appliance, including installation, patching, and VM provisioning processes. Additionally, it discusses the importance of automation and integrated teams in delivering tailored data platforms for clients.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 46

Grüessech

1
Oracle Private Cloud Appliance X9-2
- There is a new kid on the Block!
... 75 Service Requests later...

Martin Berger
Trivadis – Part of Accenture
Agenda 1 Project

2 Machine

3 Launch

4 Patching & Upgrade

5 Management

6 VM Provisioning

7 Various Things

3
Martin Berger – Data Platforms
[email protected]

Tech Architect Associate Manager


Kestenholz / Jurasüdfuss / Switzerland
2 Junior-DBAs (9 & 10)
Fireman / 2nd in Cmd
E-Biker
Let’s jump into the cloud!

martinberger_ch www.martinberger.com

4
ASG DATA PLATFORMS 3 key benefits
1 Architecture expertise
WHY? We are the game changer for our from hands-on projects
client's data platform projects
2 Delivery of tailor-made
HOW? Maximum automation, maximum data platforms
efficiency, maximum quality!
3 Integrated Teams - Like
WHAT? We build innovative data platforms a Rowing team, perfect
based on our blueprints, assets and tools. alignment and
interaction.

Tools and Continuous


Expertise
Blueprints Optimization
Key enabler for the Tools and Blueprints Expert group for
implementation of are continuously modern data platforms
modern data platforms optimized to the from technical
at a high speed and customer and project’s implementation to
quality. needs. project management
and organization
1
Project

6
A federal office moves
• Hardware change from Oracle Exadata to Private Cloud Appliance X9-2
• Consolidation of the virtual and bare-metal application platform
• Application server: Oracle Weblogic
• Database: Oracle 19c
• POC in year 2022 on PCA X8-2 e.g., with JMeter transaction measurements

Weblogic on SLES Server


90 Applikationsserver
Oracle Private Cloud
Appliance X9-2
Oracle on Exadata
4 RAC One Node DB + ADG

7
2
Machine

8
Private Cloud Appliance X9-2 Factsheet
• 180 – 1,080 OCPUs- Intel Xeon Processors (Icelake - 2593.952 MHz)
• 3 – 18 TB Memory
• 100 TB – 8.4 PB combined Block, File und Object Storage
• NFS v3, v4.1, SMB 3.1/2.0
• Flex Shapes or Fixed Shapes
• Oracle Linux, Oracle Solaris, 3rd Party Linux und Microsoft Windows.
• Oracle ZFS Storage ZS9-2 Dual-Controller HA Cluster mit 2x 24-core 2.1 GHz Intel® Xeon®
• Network 100Gbit redundant
• KVM Virtualization Layer
• OVN (Open Virtual Networking) for Open vSwitch (OVS) – Software-defined Networking

9
Front and rear of of a newly racked PCA

10
Components - basic equipment

11
Data center network connection

12
Storage types

High Capacity – DE3-24C High Performance – DE3-24P


– 20x 18 TB, SAS-3, 3.5-inch, 7200 RPM HDDs
– 20x 7.68TB SAS-3 2.5-inch SSDs
– 2x read SSD accelerator
– 2x write SSD accelerator
– 2x write SSD accelerator

13
Enclave concept
• Separation of administration from the user
• Separate web based user interfaces

Tenancy Netzwerk
Rack-Provisioning Compute
Netzwerkeinstellungen Storage
Upgrade und Patching DNS
ASR Phone Home IAM
Exadata Networks Governance

https://adminconsole.<pca-name>.<domain> https://console.<pca-name>.<domain>

14
Service Enclave User Interface SEUI

15
Compute Enclave User Interface CEUI

16
From ZFSSA to the virtual machine

17
Just like in the Oracle Cloud?

Architecture and terms Network configuration in OCI CLI command line tool
• Compute Instance, File System, the Compute Enclave • Slow response time
Storage, Image etc. • Routing, Security List, Local- • Certificate used
Peering Gateway etc.

API for interaction with GUI


tools like Terraform • Style not quite up to date
• with reduced command set • Missing search functions
• Destroy actions clean up in the • Missing order functions
wrong order, e.g. with File
System Storage the subnets are
removed "from under the feet"

18
Prometheus – Grafana - Stack

Source: Oracle PCA X9-2 Monitoring and Alerting with Grafana 19


HA or not?
• Scenarios
– 1. Manual migration for Compute Node maintenance or patching
• When start the migrateVM command, running virtual machines are moved to other available
FDs. There is no virtual machine downtime.
– 2. Compute Node outage < 10min
• The virtual machines are restarted automatically as soon the Compute Node is available.
– 3. Compute Node outage > 10min
• According documentation: A compute node is considered failing when it has been
disconnected from the data network or has been in powered-off state for more than 10
minutes.
• Polling is done in 5min interval. After 2 attempts, internally the Compute Node is change to
FAIL state and agents in EVACUATING mode. When the evacuating starts, instances are flagged
in CEUI with state MOVING and then RUNNING on the new Compute Node. Evacuating results
in downtime of the virtual machines.

20
3
Launch

21
PCA provision – Installation Checklist

Day0 Compute Tenant


Setup in data
Node
center Configuration Configuration
Provisioning

22
Fault domain status display
• Display utilization of the fault domains:

PCA-ADMIN> getFaultDomainInfo
Command: getFaultDomainInfo
Status: Success
Time: 2023-05-18 00:15:18,204 UTC
Data:
id totalCNs totalMemory freeMemory totalvCPUs freevCPUs notes
-- -------- ----------- ---------- ---------- --------- -----
UNASSIGNED 0 0.0 0.0 0 0
FD1 1 984.0 392.0 120 40
FD2 1 984.0 744.0 120 48
FD3 1 984.0 775.0 120 84

23
HA or not?
• Scenarios
– 1. Manual migration for Compute Node maintenance or patching
• When start the migrateVM command, running virtual machines are moved to other available
FDs. There is no virtual machine downtime.
– 2. Compute Node outage < 10min
• The virtual machines are restarted automatically as soon the Compute Node is available.
– 3. Compute Node outage > 10min
• According documentation: A compute node is considered failing when it has been
disconnected from the data network or has been in powered-off state for more than 10
minutes.
• Polling is done in 5min interval. After 2 attempts, internally the Compute Node is change to
FAIL state and agents in EVACUATING mode. When the evacuating starts, instances are flagged
in CEUI with state MOVING and then RUNNING on the new Compute Node. Evacuating results
in downtime of the virtual machines.

24
4
Patching &
Upgrade

25
Patching and upgrade requirements
• Requires a CSI and ULN registration
• A local ULN mirror with activated channels
• Space for patches on the ULN mirror
• Via GUI or PCA-ADMIN CLIT
• The order in the patch and upgrade documents must be observed

Oracle recommends that systems on 3.0.1 code be patched to the


latest 3.0.1 ULN (301-b748945) prior to upgrading to 3.0.2.

26
Patching and upgrade information
• [PCA 3.x] Private Cloud Appliance Component Upgrade by Release Matrix (Doc ID
2907892.1)
• Not every component is patched in every release.
• The 3.0.2 ISO contains the latest version.Read the Manual...

27
Patching and upgrade infrastructure

root@<vm-pca-02>:~ [PROD]# uln-channel --list PCA-ADMIN> setupstreamUlnMirror ...


pca302_x86_64_mn
pca302_x86_64_hypervisor PCA-ADMIN> syncUpstreamUlnMirror
pca302_x86_64_containers
pca301_x86_64_oci
pca301_x86_64_mn
pca301_x86_64_hypervisor
pca301_x86_64_fw
pca301_x86_64_containers

root@<vm-pca-02>:~ [PROD]# ls
pca-3.0.2-b819070.iso

28
Patching – ULN Mirror Bug
• The local ULN mirror on the same PCA is not recognized correctly - an external ULN mirror is
required.
• Service request open.

29
Migrate before patch
• Move the virtual machines away first before patching the Compute Nodes.

-- evacuate
PCA-ADMIN> migrateVm id=c44901a6-3793-43bd-a3f8-3c7feab12a50
-- lock
PCA-ADMIN> provisioningLock id=c44901a6-3793-43bd-a3f8-3c7feab12a50
PCA-ADMIN> maintenanceLock id=c44901a6-3793-43bd-a3f8-3c7feab12a50
-- upgrade
PCA-ADMIN> upgradeCN hostIp=100.96.2.66 imageLocation="http://://<ULN-Mirror-
Hostname>:10001/yum/iso/pca-3.0.2-b819070.iso" isoChecksum="7a18a6..83e897de024859...
-- unlock
PCA-ADMIN> maintenanceUnlock id=c44901a6-3793-43bd-a3f8-3c7feab12a50
PCA-ADMIN> provisioningUnlock id=c44901a6-3793-43bd-a3f8-3c7feab12a50

30
5
Management

31
PCA-ADMIN
• Management of the service enclave
• e.g. patching, compute node provisioning

PCA-ADMIN> list ComputeNode


Command: list ComputeNode
Status: Success
Time: 2023-05-21 18:40:25,795 UTC
Data:
id name provisioningState provisioningType
-- ---- ----------------- ----------------
c44901a6-3793-43bd-a3f8-3c7feab12a50 pcacn001 Provisioned KVM
fb2342b9-ff8c-4a91-adb1-99102ecbe11e pcacn002 Provisioned KVM
159bb330-59ca-4e88-8d0b-2a4e4ec6e170 pcacn003 Provisioned KVM

32
OCI CLI
• Online and offline installation
• A certificate is required - must be initially "fetched" from the PCA.
• Not all queries known from the cloud are supported.
• Responds slowly, at least for us...

C:\Users\A60004958\.oci> oci iam group list --profile PCA01 --cert-bundle C:\Users\A60004958\.oci\ca-pca01.crt


{
"data": [
{
"compartment-id": "ocid1.tenancy.AK00958122.PCA01.p2jt2j7vxbq7u9gjdh6qf9z7ivrjkhinagzz8wsgc6yr2rkmloak00090146",
"defined-tags": {},
"description": "Default Administrator Group",
"freeform-tags": {},
"id": "ocid1.group.AK00958122.PCA01.oncxcjk14pkc8tic74hwkwkr33rgg5kzi35chzt6266oolbnca4e1s546j4b",
"inactive-status": null,
"lifecycle-state": "ACTIVE",
"name": "Administrators",
"time-created": "2023-02-27T11:34:56.305360+00:00"
},

33
Terraform
• Also requires the PCA certificate.
• Not 100% compatible.
• Not reliable, missing modules, wrong order when clearing, etc.
• Terraform Provider Region Variable:

variable "region" {
type = string
default = "pca01.<meine-domain>"
}

34
6
VM Provisioning

35
Components for starting an instance

Compartment Virtual Cloud Network Key Pair Image Shape


(VCN)
The root organizational Virtual version of a SSH key for server Template of a virtual A template that
unit & other conventional network, access. hard disk containing determines the number
organizational units. with subnets, routing the operating system of CPUs, the amount of
tables and gateways. and other software. RAM and other
Alternatives: Oracle resources.
PCA provided images, Flex shapes possible.
BYOI.

36
Minimum network configuration

37
A few minutes later

38
VM with a second network interface
• The 2nd vNIC must be configured in the same way as OCI at OS level with a script.
• This also applies to routing - but not for a 2nd IP address on the 2nd vNIC - no NMCLI
– https://docs.oracle.com/en-us/iaas/Content/Resources/Assets/secondary_vnic_all_configure.sh

root@host-01:secondary_vnic [PROD]# ./secondary_vnic_all_configure.sh


CONFIG ADDR SPREFIX SBITS VIRTRT NS IND IFACE VLTAG VLAN STATE MAC
VNIC
- 10.179.159.12 10.179.159.0 27 10.179.159.1 - 0 ens3 - - UP 00:13:97:89:4d:e6
ocid1.vnic.AK00958122.PCA01.zb36shd9ais09h4triqmxpg0k4eiz2e998kwc862ud4hr5wagpgebjy8jpv1
- 10.2.3.12 10.2.3.0 25 10.2.3.1 - 1 ens4 - - UP 00:13:97:60:4a:c4
ocid1.vnic.AK00958122.PCA01.mpb1iqo6npr1azbpk25m1yvn2hrmh4pnxzskhtipulmlvyzrqiz1l5pwh49v

root@host-01:secondary_vnic [PROD]# cat add_route.sh


#!/bin/bash
EXIST=$(ip route show 10.2.3.128/25 | wc -l)
if [ $EXIST -eq 0 ]
then
ip route add 10.2.3.128/25 via 10.2.3.1 dev ens4
fi

39
Console Connection – Port 5000

C:\> plink.exe -ssh -i <path and file for your SSH private key> -P 32222 -L 5000:localhost:5000
mlockcxjr0cabcdhxdfpdb24vdxdrya@<your pca management vip>
vnc@ocid1.instance.AK02134552.PCA01.cpyc0v4abczd647dfhsbs9fslraj4b38agmeoy37gtkdenry2p7zb5zu18f

40
7
Various

41
Automation Matters
• SSH key distribution, DNF updates etc.

42
My Oracle Support Experience - 75 SRs later
• In 95% of all cases, a support bundle is required when uploading.
• Zoom sessions bring more than SR ping-pong.
• The Oracle crew in the USA seems to be more technically savvy - time zone on Zoom calls!
• Meanwhile they "know" each other in the calls.
• SEV2 takes a long time, if SEV1 then only with 7/24.
• A lot is solved in the internal MySQL database.
• Blocker so far:
– Day0 configuration failed
– VMs no longer bootable after shape customization
– Local peering gateway routes no longer work after an LPG in the same subnet is deleted.
– DNS cannot be deleted.

43
Five tips to take away

01 02 03 04 05
Never use 3.0.1. Complete the Plan network, Work on bugs Reduce project
checklists in full order DNS consistently & risk and plan
and ask if entries, firewall, be patient. sufficient
anything is routing etc. reserves.
unclear.

44
“Gring abe u vou seckle”

*** 1997 - Anita Weyermann – World Championship bronze medal 1500m

45
Merci vöumou

46

You might also like