Thursday Oct 16, 2008

Aufzeichnungen Vorträge SourceTalk 2008

Einige Vorträge sind während den SourceTalk Tagen 2008 mit dem TeleTeachingTool aufgezeichnet worden. Seit kurzem stehen diese auf den Webseiten der Veranstaltung zur Verfügung. Darunter auch mein  Erfahrungsbericht "Portierung von Open HA Cluster auf OpenSolaris": ohne Video (12MB) und mit Video (53MB).

In meinem Blogeintrag vom letzten Jahr wird erklärt, wie man die Aufzeichnung auch unter Solaris anschauen kann. Das ttt.ksh Skript habe ich aktualisiert, der MP3 Ton wird damit korrekt abgespielt.

Meinen Präsentation kann man auch gerne als PDF runterladen.

Business Continuity and Disaster Recovery Webcast

In the past months I talked in various presentations about Open HA Cluster and Solaris Cluster. The emphasis has been set to give an introduction into the Solaris Cluster architecture and the fact that this product is now fully Open Source, describing the various possibilities to contribute and giving an overview about already existing projects.

Most talks started with a note that in order to achieve high availability for a given service, it is not just enough to deploy a product like Solaris Cluster. The same is true if you look for business continuity and disaster recovery solutions. Besides the service stack in the backend, it is not only necessary to analyze the infrastructure end-to-end to identify and eliminate single points of failure (SPOF), but also to have a close look at people (education), processes, policies and clearly defined service level agreements.

Thus I am happy to see a webcast hosted by Hal Stern about Business Continuity and Disaster Recovery, which gives a nice introduction about this holistic topic. More information can be found at a dedicated page about Sun Business Continuity & Disaster Recovery Services.

Start with a Plan Not a Disaster! :-)

Monday Sep 29, 2008

Building Open HA Cluster on OpenSolaris

The first source code tarball for project Colorado, to compile the Open HA Cluster core framework on the OpenSolaris binary distribution, is available. It also contains a set of updated scbld tools needed to run with KSH93 on OpenSolaris.

Have a look at the detailed instructions on how to get it compiled on OpenSolaris 2008.05.

The following parts have been disabled to compile:
  • dsconfig
  • agent_snmp_event_mib/mib
  • spm
  • adminconsole
and are thus not part of the created SVR4 packages, since the source code for those components relies on headers/libraries not being available for the OpenSolaris binary distribution.

Next steps: design on how to create/send IPS packages instead of building the SVR4 packages. Any help is welcome. Have a look at the Wiki page for the current plans.

Friday Sep 19, 2008

Einladung zum SourceTalk 2008

Vom 23. bis 25. September finden am  Mathematisches Institut der Universität Göttingen zum vierten mal die SourceTalk Tage 2008 statt. Am Mittwoch, 24. September ist eine Vortragsreihe OpenSolaris gewidmet. Darin werde ich um 11:15 Uhr einen Vortrag zum Thema  "Erfahrungsbericht - Portierung von Open HA Cluster auf OpenSolaris" halten.

Der Erfahrungsbericht handelt von den bisherigen Aktivitäten beim OpenSolaris Projekt Colorado. Einen Vorgucker kann man in meinem zugehörigen Blogeintrag nachlesen (in Englisch).

Zusammen mit meinen anderen Kollegen werden wir gegen 17 Uhr auch zu einer "Meet the Experts" Session für Fragen aller Art um das Thema Solaris, OpenSolaris und Open HA Cluster zur Verfügung stehen.

Man sieht sich in Göttingen!

Monday Sep 08, 2008

Project Colorado: Running Open HA Cluster on OpenSolaris

If you have ever asked yourself why Solaris Cluster Express (the Open HA Cluster binary distribution) is running on Solaris Express Community Edition and not yet on the OpenSolaris binary distribution, then you might be interested in project Colorado. This project is endorsed by the HA Clusters community group and has as its goal to provide a minimal and extensible binary distribution of Open HA Cluster that runs on the OpenSolaris binary distribution.

As always, the devil is in the details. Here are some of the reasons why this isn't just a "recompile and run" experience:

  • Package system did change:
    One of the big changes with the OpenSolaris binary distribution is the switch to use the image packaging system (IPS). As Stephen Hahn explains in some of his blogs, it is a key design criteria of IPS to not include something like the rich scripting hooks found within the System V packaging system.
    Have a look at our initial analysis on how and where those scripting hooks are currently used. Note that installation is one aspect, uninstall another. In the more modular world of network based package system this means dependencies to other packages need to be explicit and in fine granularity. And within the Cluster world, this needs to be in agreement across multiple nodes. Since we also deliver kernel modules and plan to deliver a set of optional functionality, this brings the challenge on where to put the tasks that have been within the scripting hooks for configuration and uninstallation.
    Further, if you know the details of the Open HA Cluster build steps (which is similar to the ON build steps), you know that building the packages via the pkgdefs hierarchy is an important element to later then also assemble the deliverables for the DVD image. Since we do not just want a 1:1 conversion of our existing packages into IPS packages, we need to come up with a mechanism to deliver the IPS packages into a network repository as part of the build step, since there is no delivery of a DVD image going forward.
  • Zones behaviour did change:
    The native zones brand type introduced in Solaris 10 did inherit the packages to install from the global zone. For Solaris Cluster this means that cluster related packages got automatically installed and configured when a zone got created. On OpenSolaris this behavour did change. Currently there is only the ipkg brand type available. This means that out of the box none of the ways Solaris Cluster integrates with zones works without various changes needed.
  • KSH93 vs KSH88:
    OpenSolaris did switch to KSH93 for /bin/sh and /usr/bin/ksh, the previous KSH88 shell is no longer available. Again the devil is within the details. KSH93 does deal e.g. with local vs. global variables differently. Some scripts required for building (like /opt/scbld/bin/nbuild) or to install Solaris Cluster (like /usr/cluster/bin/scinstall) do break with KSH93. The full set of impacted scripts needs to get determined.
  • The Sun Java Web Console (webconsole) is not part of OpenSolaris:
    The Java Web Console provides a common location for users to access web-based system management applications. The single entry point that the web console provides eliminates the need to learn URLs for multiple applications. In addition, the single entry point provides user authentication and authorization for all applications that are registered with the web console. All web console-based applications conform to the same user interface guidelines, which enhances ease of use.
    Those are all reasons why Solaris Cluster did choose to deliver its browser user interface (BUI) named Sun Cluster Manager using and leveraging the Sun Java Web Console framework. In addition it also uses the Web Application Framework (JATO) and Lockhart Common Components.
    Since those components are not available for the OpenSolaris binary distribution, this brings the challenge which management framework to use (and develop against) instead. Of course a substitution is not trivial and can be quite time consuming. And it is not sure if existing code can get reused.
  • Dependencies on encumbered Solaris code:
    Besides components that the OpenSolaris binary distribution did choose not to deliver anymore, there is the goal to create a freely redistributable binary distribution. This means OpenSolaris does also not deliver the Common Desktop Environment (CDE), which includes the Motif libraries. The adminconsole delivered with Solaris Cluster does use Motif and ToolTalk.
    The adminconsole tools need to get redesigned to use libraries available within OpenSolaris.
  • No SPARC support for OpenSolaris yet:
    The OpenSolaris binary distribution is currently only available for the i386 platform. Solaris Express Community Edition does provide also a distribution for SPARC. While this is not a strong inhibitor to run on OpenSolaris, it is nonetheless a reason why providing Solaris Cluster Express is still a requirement.
    The good news is that there are plans to provide SPARC support for OpenSolaris within future releases.
  • OpenSolaris Installer does not support network installations yet:
    While this not a direct problem, it becomes to one if you consider that developers for Open HA Cluster are distributed around the world and most engineers have only access to remote systems, without the possibility to perform an installation requiring keyboard and monitor.
    Again the good news is that there are plans to add support for automated network installations within future OpenSolaris releases.

Besides solving the above challenges, we also want to offer some new possibilities within Colorado. You can read the details within the umbrella requirement specification. There are separate requirement specifications to outline specific details for the planned private-interconnect changes, cluster infrastructure changes involving the weaker membership, enhancements to make the proxy file system (PxFS) optional, and changes to use iSCSI with ZFS for non-shared storage configurations.

You can provide feedback on those documents to the ha-clusters-discuss mailing list. There is a review scheduled with the Cluster Architecture Review Committee on 18 September 2008, where you are invited to participate by phone if you are interested.

Monday Jun 23, 2008

Solaris8 and 9 Container on Solaris Cluster

If you are still running applications on Solaris 8 using SPARC hardware and maybe even using Sun Cluster 3.0, then you should get a plan ready to upgrade to a more recent releases like Solaris 10 and Solaris Cluster 3.2 02/08.

As you might know the last ship date for Solaris 8 was 02/16/07, the end of Phase 1 support is scheduled for 3/31/09.

Sun Cluster 3.0 is also reaching its end of life as announced within the Sun Cluster 3.2 Release Notes for Solaris OS.

In case you can not immediately upgrade to a newer Solaris release, Sun recently announced the Solaris 8 Container, which introduces the solaris8 brand type for non-global zones on Solaris 10. The packages can be freely downloaded for evaluation and would require subscription for the RTU and support.

While the solaris8 brand type is NOT extending the support life for Solaris 8, it allows you a phased approach for migrating to Solaris 10 and leveraging new hardware platforms while the application still runs within a Solaris 8 runtime environment.

The Sun Cluster Data Service for Solaris Containers does support the solaris8 brand type for Sun Cluster 3.1 08/05 with Patch 120590-06 and for Solaris Cluster 3.2 with Patch 126020-02 and newer.

Before going through the virtual to physical (p2v) migration, the existing Sun Cluster 3.0 configuration and packages need to get removed. See the Sun Cluster 3.0 System Administration Guide for more details on how to achieve that. This also means that there is no cluster framework running within the solaris8 brand type zone. Therefore existing standard agents can not be used. However, the sczsh component of the HA Container agent can be used to manage an application running within that solaris8 branded zone.

Of course any migration should get carefully planned.

The same works and is true for the recent announced Solaris 9 Containers. Patch 126020-03 introduces support for the solaris9 brand type for the HA Container agent on Solaris Cluster 3.2.

Tuesday Jun 03, 2008

OpenSolaris Usergroup in Berlin: Open HA Cluster Vortrag

Meine Anwesenheit in Berlin wegen dem LinuxTag 2008 konnte ich glücklicherweise mit einem Vortrag bei der hiesigen OpenSolaris User Group verbinden. Am Mittwoch, 28.5.2008 um 19:30 Uhr, sollte der Vortrag stattfinden. Allerdings sorgten erstmal ein paar Missverständnisse dafür, daß es trotz pünktlicher Ankunft in der Tucholskystr. 48 gegen 19 Uhr zu einer viertelstündigen Verspätung kam :-( Durchaus nicht meine Art!

War aber erleichtert festzustellen, daß die Teilnehmer geduldig waren - und mich noch dazu bis etwa 21:30 Uhr haben vortragen lassen! Eine Stunde mehr als geplant - das entgegengebrachte Interesse war wirklich toll und hat zu interessanten Fragen und Diskussionen geführt.

Habe mich dann noch dem traditionellen Abendessen beim Inder um die Ecke angeschlossen, welches zu einer Fortsetzung der Gespräche und Austausch von Anekdoten rund um Open HA Cluster, OpenSolaris und die Wunder der IT bis etwa 24 Uhr eingeladen hat.

Insgesammt kann ich nur empfehlen die OpenSolaris User Group in Berlin als Interessierter oder Vortragender zu besuchen, nette und interessierte Leute, vom Anfänger bis zum alten Hasen alles dabei :-)

Anbei mein Vortrag zum runterladen. Mein Dank geht an Franz Timmer und Detlef Drewanz für diese Gelegenheit und an die Teilnehmer für das rege Interesse!

LinuxTag 2008: Hochverfügbarkeit mit Open HA Cluster

Sun Microsystems hatte einen Stand als Aussteller und Sponsor auf dem LinuxTag 2008 in Berlin. Dort gab es unter anderen auch einen Arbeitsplatz zu Open High Availability Cluster, an dem man mit Hartmut Streppel (Mi/Do), Heiko Stein (Fr/Sa), Eve Kleinknecht (Do/Fr/Sa) und mir (Mi/Do/Fr/Sa) alle Informationen und Fragen rund um das Thema Hochverfügbarkeit diskutieren und Demonstrationen live anschauen konnte. Vielen Dank an die Kollegen für die tolle und kompetente Unterstützung!

Am Samstag gab es dann innerhalb des OpenSolaris Track den Vortrag zu Hochverfügbarkeit mit Open HA Cluster, inclusive live Demonstration eines Serviceschwenk von HA PostgreSQL (Datenbank, IP, zpool (auf USB-Stick)) zwischen zwei Solaris Zonen, konfiguriert auf meinem Laptop als Single-Node Cluster (Solaris Express Community Edition 01/08 und Solaris Cluster Express 02/08). Eine prima Umgebung um sich mit der Technologie vertraut zu machen oder Agentenentwicklung zu betreiben.

Die gleiche Konfiguration kann man übrigens auch innerhalb einer VirtualBox installieren. Diese Option hatten wir auf dem Ausstellungsstand ebenfalls vorgeführt.

Anbei die Präsentation zum download.

Samstags gab es dann auch die Keynote von Ian Murdock, in welcher unter anderem die dritte Open Source Phase zu Open HA Cluster angekündigt wurde: ca. 2 Millionen Zeilen Quellcode des Solaris Cluster Core Framework! Ab jetzt ist also der komplette Quellcode von Open HA Cluster verfügbar! 

Es gibt zwei kleine Video von Terri Molini mit Eindrücken vom LinuxTag und von der LinuxNacht und Keynote.

Insgesammt hat sich der LinuxTag 2008 für mich gelohnt. Es gab viele neue Kontakte und interessante Gespräche. Nicht zuletzt hoffe ich das wir das Thema Open HA Cluster als nützlich und relevant darstellen konnten. 

Tuesday May 13, 2008

Einladung zum Linuxtag 2008

Vom 28. bis 31. Mai 2008 findet die 14. Auflage Europas führender Veranstaltung zu Linux und Open Source auf dem Berliner Messegelände unter dem Funkturm statt. Der LinuxTag 2008 zieht rund 10000 Messebesucher an, darunter IT Entscheider, Anwender, Entwickler und diverse Open Source Projekte und Gemeinschaften aus über 30 Ländern.

Mit über 80 Aussteller von freien Projekten gehört der LinuxTag zu den weltweit größten Messen für Open Source Software, und bietet Gelegenheit direkt mit den Entwicklern in Kontakt zu treten um aktuelle Trends zu diskutieren.

Sun Microsystems wird auf dem LinuxTag einen Stand mit mehreren Arbeitsplätzen zu diversen Open Source Projekten ausstellen. Darunter wird auch das Open High Availability Cluster Projekt sein. Dort gibt es Gelegenheit einen Überblick zu aktuellen Cluster Projekten zu bekommen, über jedes erdenkliche Cluster Thema von Interesse zu diskutieren, bis hin zu Live Demonstrationen oder Quelltext Analysen.

Hartmut Streppel und ich halten am Mittwoch, 28. Mai 2008, ein ganztägiges Tutorium über Open HA Cluster und Flying Containers.

Am Samstag, 31. Mai 2008, werde ich eine Präsentation zum Thema "Hochverfügbarkeit mit Open HA Cluster" innerhalb des OpenSolaris Themenblocks geben. Details findet man in der Agenda.

Hier ist eine persönliche Einladung von mir (vielen Dank an Deirdré Straughan für Ihre Geduld bei der Aufnahme):

Freue mich möglichst viele von Euch in Berlin zu sehen!

Monday Apr 21, 2008

Flying Container Tutorium bei den Frühjahrsfachgesprächen der GUUG

Wie bereits in meinem vorherigen Blog erwähnt, konnten Hartmut Streppel und ich ein ganztägiges Tutorium bei den Frühjahrsfachgesprächen 2008 der GUUG bestreiten. Hier nun der Verweis auf den zweiten Teil "Flying Container" von Hartmut:

  • Überblick Solaris Container mit Demo
  • Flying Container vs. Flying Service
  • Flying Container Setup mit Demo
  • Flying Container und Disaster Recovery mit Demo

Wednesday Apr 09, 2008

Visualizing Solaris Cluster Resource Group and Resouce dependencies/affinities

Solaris Cluster did improve a lot with the new command line interface and the updated Solaris Cluster Manager Browser User Interface (BUI) within the 3.2 and 3.2 02/08 release. The BUI has a Topology view for resource groups and resources in order to visualize the various dependencies and affinities the cluster framework offers.

It is also a good idea to document a cluster configuration carefully. For that purpose an online view within a BUI is not always helping. And sometimes you want to see all dependencies/affinities within one graph.

Fortunately the cluster framework does offer the scha_cmds(1HA) API, which makes it possible to retrieve all informations out of the cluster and use them e.g. in shell scripts. And there are already tools available as open source which do a great job in visualizing graphs. One tool I selected is Graphviz, which is available for a lot of platforms, including Solaris. The Companion  Software CD for Solaris does provide a ready to install package for SPARC and x86.

I developed a shell script which extracts the necessary informations out of the cluster framework and creates two files, one for the resource groups with dependencies and various affinities, and one for the resouces with the various dependencies. The files contain the output ready for the dot language consumed by Graphviz. Here you can see some example outputs, in this case converted into png files. You can click on the images to enlarge them for better view.

<br>Resource group dependencies graph for the evans cluster
Figure 1: Resource dependencies graph for the evans cluster

Resource dependencies graph for the evans cluster
Figure 2: Resource dependencies graph for the evans cluster

Resource dependencies graph for the geo-muc1 cluster
Figure 3: Resource group dependencies graph for the geo-muc1 cluster

Graph to visualize resource dependencies for the geo-muc1 cluster - click to enlarge
Figure 4: Resource dependencies graph for the geo-muc1 cluster

Let me know if you would find such a script useful! I plan to publish it under a freshly proposed OpenSolaris project for Open HA Cluster utilities - if I get the necessary approval that is :-)

Wednesday Apr 02, 2008

Detailed Deployment and Failover Study of HA MySQL on a Solaris Cluster

Krish Shankar from ISV engineering published a very nice and detailed blog illustrating the deployment process of MySQL on a Solaris Cluster configuration. It also focuses on regression and failover testing of HA MySQL, and explains in detail the tests that were performed.  Solaris 10 fully supports MySQL, and the HA cluster application agent for MySQL on Solaris Cluster.

Screencast on how to install a single node cluster available!

In addition to the online instructions on how to setup a single node cluster with Solaris Express and Solaris Cluster Express, there is now also a screencast available, which guides you through the necessary steps. So if you learn better through visual ways, check it out!

The screencast is also linked on the Open HA Cluster page. You will need a Shockwave Flash plugin for your browser to view it. The plugin installed with firefox on Solaris Express Developer Edition 01/08 works great to display it.

Thursday Mar 20, 2008

Open HA Cluster Tutorium bei den Frühjahrsfachgesprächen der GUUG

Am 13. März 2008 haben Hartmut Streppel und ich zusammen ein Tutorium über "Open HA Cluster" und "Flying Container" auf den Frühjahrsfachgesprächen 2008 an der Hochschule München abgehalten. Die Veranstaltung wird jährlich von der German Unix User Group organisiert.

Der erste Teil des Tutorium ist nun verfügbar, die 65 Seiten enthalten die Präsentation und Sprecherkommentare in Deutscher Sprache. Hier die Agenda im Überblick:

  1. Einführung in die Solaris™ Cluster Architektur
    • Core Cluster Framework
    • Data Services (Agenten)
    • Geographic Edition
  2. Einführung in das Open HA Cluster Projekt
    • HA Clusters Community Group
    • Open HA Cluster Agenten + aktuelle Projekte
    • Open HA Cluster Geographic Edition
    • Build Demo
  3. Einführung in die Template basierende Agentenentwicklung
PS: Der zweite Teil des Tutorium ist nun auch verfügbar.

Friday Feb 22, 2008

Solaris Cluster Express 02/08 available!

Solaris Cluster Express 02/08 is available for download. It is build to run on Solaris Express Developer Edition 01/08. Have a look at the Release Notes for more details. Two things I want to highlight:

  1. This combination (SCX 02/08 and SXDE 01/08) is a good fit to get started and engaged with the HA xVM project, since SXDE 01/08 enables to configure the Sun xVM hypervisor for x86-64.
  2. SCX 02/08 delivers the HA Informix agent to manage Informix Dynamic Server, which is the first data service developed within the HA Clusters Community Group with community participation.
Happy Clustering!

This Blog is about my work at Availability Engineering: Wine, Cluster and Song :-) The views expressed on this blog are my own and do not necessarily reflect the views of Sun and/or Oracle.


« July 2016