
Anthony's Blog: Using System Storage - An Aussie Storage Blog
Storwize V7000 Training
I was recently asked to travel to Taiwan to teach a Storwize V7000 class.
I have done a fair amount of travel with IBM, but this was my first visit to Taipei. I must say I loved the energy and enthusiasm of the place. While there I taught a new course: Storwize V7000 Implementation (SSE01) For those who are familiar with IT education, IBM Systems and Technology Group (STG) Education comes in 4 forms: 1) Lecture Only 2) Lecture with simulated labs 3) Lecture with remote remote labs 4) Lectures with local labs This particular course is the third type, meaning it consists of lectures combined with remote labs. This means we connected via VPN to actual machines located in Raleigh, North Carolina. We had access to multiple pods, where each pod has its own Storwize V7000, plus a DS3400, SAN switches and hosts running AIX and Windows. In addition IBM Taiwan secured a demonstration Storwize V7000 for us to have in the classroom (to see and touch and do initial config on). This was the first time the course had been offered in Taiwan and everyone was very excited to be part of it. The agenda looked something like this: Day 1: • Unit 1 - Introduction to the IBM Storwize V7000 • Unit 2 – Enclosures and RAID Arrays * Labs: initial config, accessing the GUI and CLI, configuring internal and external storage and defining attached hosts Day 2: • Unit 3 – Fabric Zoning • Unit 4 – Thin provisioning and Volume Mirroring • Unit 5 – Data Migration Facilities part 1 * Labs: Accessing storage from AIX and Windows hosts via fibre channel and iSCSI, Easy Tier Part 1, migration part 1 Day 3: • Unit 5 – Data Migration Facilities part 2 • Unit 6 – Easy Tier • Unit 7 – Managing IBM Storwize V7000 * Labs: Migration part 2, Thin provisioning, RAID options, Easy Tier part 2 By the conclusion of the course the students had had many hours enjoying the new GUI, which I thought was really important. Hands on operation of a system is the best way to learn. Those that already knew SVC found the course quite easy, but everyone felt that they had learnt a lot. The chance to use Easy Tier and do a live migration was really important. I certainly enjoyed teaching the course and look forward to teaching it again! For those who live in Australia, yes we are going to offer the course locally. As soon as I have dates I will share them with you.
|
||
All good things to those who wait
Its been a while since my last blog entry... but don't worry (if you were). I have not given up on my blog.
![]() I took three weeks off for vacation: to recharge my batteries and reconnect with my family. It was a great break which sadly had to come to an end. I have three rules when I go on leave:
The first few days were hard ( without meaning to belittle the concept, its a bit like battling an addiction), but by the second week I had stopped thinking about work altogether and really started to enjoy myself. Not that I don't enjoy working for IBM (I love it), but we all need to take regular breaks to stay sane. Of course there is a penalty for ignoring your email for 3 weeks..... about 1000 emails to go through on return. The good news is... I am now ready for take off. Lets have a great 2011! . And if your feeling like this guy, maybe its time for a break! |
||
Visio Cafe updates still cooking along
The demand for Visio stencils is very strong, which says something about how popular Microsoft Visio is.
Its good because it means people are documenting their environments. I am being asked for ETAs for stencils that are not yet in the IBM Disk zip file found on Visio Cafe. The most recent update on Jan 17, 2011, added more nSeries stencils as can be seen here or as reproduced below:
I am getting a lot of requests right now for the Storwize V7000 stencils. The good news is that I have them. I am unsure why they are not on Visio Cafe yet, so I am posting them here as a zip file. Just right click on the link and save as, then unzip and your good to go. . // Edit Feb 3, 2011 - I changed the file to one with correct IBM naming // . Enjoy. |
||
IBM Storage Site Updates as of February 7, 2011
Thanks again to Rob Jackard of the ATS Group for supplying me with his excellent summary of support site updates.
Here are some major points to be aware of.
AIX / Misc:(2011.01.27) IBM TechDoc- Power Systems SAN Multipath Configuration Using NPIV. http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/WP101761 (2011.01.19) SDDPCM path selection may fail even though non-preferred paths are available. http://www-01.ibm.com/support/docview.wss?uid=ssg1S1003753
DS3000 / DS4000 / DS5000:(2011.02.04) IBM Retain Tip# H193288- DS3000, DS4000, DS5000 controllers will reboot every 825 days.http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-5075377(2011.02.02) IBM Retain Tip# H201983- Recommended drive firmware upgrade – IBM Disk Systems.NOTE: Disk Systems that may be effected- DS4000 series, DS5000 series, DS3950 depending on installed disk drive models.http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-5086967(2011.02.02) IBM Retain Tip# H196488- DS3500/DS3950/DS5000 systems not working with Brocade on 8 Gbps host ports.NOTE: Brocade FC switches running firmware 6.2.0e or above are effected. May need to change the fillword setting from ARB to IDLE.http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-5083089(2011.02.02) Copy Services User’s Guide – IBM System Storage DS Storage Manager v10.70.http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-61173
DS8000 / DS6000:(2011.01.28) IBM TechDoc- DS8000 Host Ports and Installation Sequence.http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/TD105671N series:(2011.01.27) Data ONTAP 7.3.5 Gateway Publication Matrix.https://www-304.ibm.com/support/entdocview.wss?uid=ssg1S7003440(2011.01.27) IBM System Storage N series Data ONTAP Matrix.https://www-304.ibm.com/support/entdocview.wss?uid=ssg1S7001786(2011.01.27) IBM System Storage N series System Manager 1.1R1 Publication Matrix.https://www-304.ibm.com/support/entdocview.wss?uid=ssg1S7003021(2011.01.27) IBM System Storage N series FRU (Field Replaceable Unit) lists.https://www-304.ibm.com/support/docview.wss?uid=ssg1S7001720(2011.01.27) Data ONTAP 7.3.5 Filer Publication Matrix.http://www.ibm.com/support/docview.wss?uid=ssg1S7003439SAN:(2011.02.02) IBM Retain Tip# H196488- DS3500/DS3950/DS5000 systems not working with Brocade on 8 Gbps host ports.NOTE: Brocade FC switches running firmware 6.2.0e or above are effected. May need to change the fillword setting from ARB to IDLE.http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-5083089(2010.11.07) Recommended port fillword setting for 8Gbps Brocade switches with 8Gbps SAN Volume Controller or Storwize V7000.http://www-01.ibm.com/support/docview.wss?rs=591&uid=ssg1S1003699SONAS:(2011.01.17) Intel has reported Page Fault or Corrupted Data using 64-Bit application in 64-Bit NOS. NOTE: IBM is working with their suppliers to qualify and release a new BIOS which corrects this issue. https://www-304.ibm.com/support/docview.wss?uid=ssg1S1003746SVC / Storwize V7000:(2011.02.04) IBM TechDoc- EPIC and IBM Storwize V7000 – Solution Overview and Performance Benchmark.http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/WP101831(2011.02.04) SAN Volume Controller Node HDD Failures can Result in Host Multipathing Driver Failures.NOTE: This mechanism was introduced by APAR IC74194 in V5.1.0.9 PTF release. This APAR will also be included in a future V6.1.0.x PTF release.http://www.ibm.com/support/docview.wss?uid=ssg1S1003757 (2011.02.04) IBM SAN Volume Controller Code V5.1.0.9.http://www-01.ibm.com/support/docview.wss?uid=ssg1S4000955(2011.02.04) SAN Volume Controller and Storwize V7000 Software Upgrade Test Utility V5.7.http://www-01.ibm.com/support/docview.wss?uid=ssg1S4000585(2011.02.03) IBM TechDoc- SVC Global Mirror- A practical review of important parameters.http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/WP101848(2011.02.03) Cache Destage Issue in SVC V6.1.0.4 and V6.1.0.5 Code May Result in an Incorrect Report that Host Data Has Been Written Correctly.NOTE: This issue is resolved in the V6.1.0.6 PTF release.http://www-01.ibm.com/support/docview.wss?uid=ssg1S1003756(2011.02.03) IBM TechDoc- SVC 4.3.1.7 on multiple Virtual SAN environment.http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/WP101847(2011.01.27) IBM TechDoc- SVC / Storwize V7000 Performance Monitor svcmon.http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/PRS3177(2011.01.27) IBM TechDoc- IBM Easy Tier on DS8000, SVC and Storwize V7000 Deployment Considerations Guide January 2011.http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/WP101844(2011.01.25) IBM SAN Volume Controller Code V6.1.0.6.http://www-01.ibm.com/support/docview.wss?uid=ssg1S4000954(2011.01.25) IBM Storwize V7000 Code V6.1.0.6.http://www-01.ibm.com/support/docview.wss?uid=ssg1S4000953(2011.01.25) Storwize V7000 Node Canisters May Shut Down or Reboot Unexpectedly During Normal Operations. NOTE: This issue has been fully resolved by APAR IC74088 in the V6.1.0.6 release. http://www-01.ibm.com/support/docview.wss?uid=ssg1S1003748
(2011.01.19) IBM TechDoc- DS8000 with SVC Best Practices.http://www-03.ibm.com/support/techdocs/atsmastr.nsf/WebIndex/PRS4465SSPC / TPC / TPC-R:(2011.01.27) TPC 4.2 Install with DB2 9.7, fails dbSchemaInstall on AIX and Linux. http://www-01.ibm.com/support/docview.wss?uid=swg21456804
(2011.01.25) IBM System Storage Productivity Center Flash for Version 1.3. http://www-01.ibm.com/support/docview.wss?uid=ssg1S1003416
(2011.01.24) Unstable TPC 4.1 Device Server on AIX 5.3. NOTE: AIX APAR IZ52907 corrects a problem in AIX 5.3, where the "getgrent_r" routine causes heap corruption due to an invalid "free" condition. http://www-01.ibm.com/support/docview.wss?uid=swg21421812
XIV: (2011.01.21) IBM XIV Storage System Planning Guide. http://www-01.ibm.com/support/docview.wss?uid=ssg1S7003187
(2011.01.20) IBM XIV Storage System Product Overview. NOTE: This document previously was called the IBM XIV Storage System Theory of Operation. http://www-01.ibm.com/support/docview.wss?uid=ssg1S7002919
(2011.01.20) IBM XIV Management Tools (XIVGUI, XIVTop, XCLI), version 2.4.4 for all platforms. http://www-01.ibm.com/support/docview.wss?uid=ssg1S4000950
(2011.01.20) IBM XIV XCLI (only) for Linux / AIX / Solaris / HPUX, version 2.4.4. http://www-01.ibm.com/support/docview.wss?uid=ssg1S4000951
(2011.01.12) Intel has reported Page Fault or Corrupted Data using 64-Bit application in 64-Bit NOS. NOTE: The new BIOS level will be available with the XIV 10.2.4 release. http://www-01.ibm.com/support/docview.wss?uid=ssg1S1003752 ========================
|
||
XIV and QoS - Using new Performance Classes
I have been exploring some of the new features added in XIV firmware version 10.2.4.
Today I look at QoS (Quality of Service). This new feature allows you to restrict how much IO (in IOs per second or IOPS) or throughput (in megabytes per second or MBps) an individual host can generate. We do this by creating a new construct called a Performance Class. We can create up to four different Performance Classes and assign different hosts to each of those classes. You can spot the new menu item under Hosts and Clusters: ![]() The new panel contains a list of all your hosts and a new button at the top of the GUI panel called Add Performance Class: . ![]() When you create the Performance Class, you can set either an IOPS or a Bandwidth Limit. It is also possible in a single Performance Class to set both an IOPS limit AND a Bandwidth Limit. In these examples we set just a bandwidth limit. One small quirk is that the limits will be rounded to a multiple of the number of active interface modules. So don't be surprised if the numbers you enter are not the numbers that then appear. In the example below I enter 100 (for 100 MBps). However when the class got created, the value was rounded down to 96 MBps( since I have 6 active interface modules). . ![]() To prove a simple point, in this example we have created three Performance Classes, all of which limit bandwidth. You can see by their names the limit they will impose on any host moved to that Performance Class. . ![]() The exercise I performed used an AIX LPAR with an Oracle workload generator, that generates a constant workload of 150 MBps. The first step was to add the host to the 96 MBps Performance Class. ![]() Then the fun began. Monitoring of the performance of the LPAR was done with XIV Top. We moved the LPAR between performance classes to see the effect on throughput of each class. All of this was done concurrently with no host interruption. You can see from the output of XIV Top, that as the performance class was changed, the throughput was gradually throttled back (or allowed up) to that level. . ![]() . At the end of the process we then removed the LPAR from its Performance Class, returning it to an unrestricted state. This effectively allowed it to move back up to 150 MBps. . ![]() So why is this important? Some clients had a concern that non-production hosts (such as test and development servers), got an equal share of the XIV performance pie. In general this is not as issue, as the grid architecture of the XIV works very well with competing IO from multiple sources. However with the advent of very high performance machines, it is not outside the realms of possibility for an individual server to generate over 80,000 IOPS or over 1,000 MBps. I have certainly achieved this during benchmarking. If you spin-up several of these runaway hosts simultaneously, you could saturate the grid and impact more deserving hosts. So adding this feature makes sense. What is even more sensible? it is added at no extra cost via a non-disruptive code update. |
||
Some more thoughts on XIV QoS
After a chat with my one of our business partners, I thought I would add some more info about XIV and QoS:
|
||
Chickens and Eggs (and XIV )
In my last post I talked about the versions of XIV code and XIV Management GUI needed for QoS.
It leads to the question of how to match the Management GUI version to the XIV code version.... which goes with which? Many storage products have management software that is separate from the software that runs inside the box (more commonly known as the firmware). So for XIV, is there a best practice? What if I have multiple XIVs.... do I need them all to be on the same firmware version? The good news is that you can simply use the highest available version of Management GUI, regardless of what code versions the XIVs are on. In other words, if you see a new version of XIV GUI on the download site.... just upgrade to it. Check out the screen capture below. This was taken with version 2.4.4 of the Management GUI (which is the latest at February 2011): The XIV on the left is running the GA XIV code, version 10.0.0.a (which came out well over two years ago). So I am able to run one of the oldest versions of XIV code, with the latest Management GUI. This is very good. But why is the right hand machine greyed out? To mix things up. the XIV on the right is running a brand new development (and thus un-released) version of code. Because I am running a (relatively) older GUI version against a (relatively) newer XIV code version, the GUI is protecting us from a potential mismatch. This means the GUI is always backward compatible, but is not always forward compatible. ![]() Now speaking of greyed out, we saw a bug on previous versions of the GUI code, where a read-only user id would be confronted with a similar sight. The machine would be greyed out and thus unmanageable. The work around was to use the older 2.4.2b build 10 version of the GUI. The good news is that 2.4.4 contains the fix for this bug (so its another reason to upgrade). . That is all for now. Don't forget you can always logon as p10demomode to get a demo mode view of the XIV GUI. |
||
I am moving to Wordpress!
I think it had to happen eventually.
After much consideration I am moving to WordPress. Want to know why? You can find out here: |
||
Which tech related podcasts do you listen to?
Which tech related podcasts do you listen to?
Check out my latest blog post on Wordpress:
|
||
XIV VAAI Support – whats the story?
After seeing some tweets from @SFoskett regarding XIV support for VAAI, I thought I would supply an update with the information that I have to hand.
IBM added VAAI support in XIV code level 10.2.4. However a code fix has since been written for a VAAI related issue (I don't have the details), which means that we are delaying the official support till the release of 10.2.4a code (which should be early March). In the meantime we are also waiting for the release of the VMWare certified driver for VAAI and XIV, which should come out at the same time. So there are two dependancies. As soon as both things are available, I will write a new post confirming this. I have some very keen customers lined up to try it out, so I plan to blog about the results as soon as my clients have had some run time to generate meaningful stats. |
||
A story of hazardous waste
After 3.5 years of reliable service, the 19" LCD monitor on my sons computer died... and
would not power back on. Warranty long since expired and replacement LCDs being relatively cheap, I replaced his monitor with a 22" LCD and he happily updated his Facebook status to suit.
Except there was a problem.... what to do with the dead monitor? I had three choices:
But there was a further problem. This lovely sticker on the back made no mention ofROHS and made even more disturbing mentions of mercury!
What I found was this site at Sustainability Victoria, which took me to this site which told me all about a program called ByteBack. One trip to Officeworks in Dandenong later and my dead LCD was off to be recycled at no charge to myself. Not only was my shed less cluttered, but I might even have helped the environment. I would be curious to know if other people have been able to find similar programs in their locations? If so... please let know, lets spread the word! |
||
Want my XIV Visios? You got em!
One of the more common themes I blog about is the demand for Visio Stencils. In this regard I have some more good news.
Firstly the VisioCafe site has an updated IBM-Disk stencil set with the following additions: IBM-SystemStorage-Disk.vss - Added EXP5060 Front, Front Open and Rear Views Secondly, I have started using the collections function on the IBM developerWorks site to share my files with the public. To this end I have posted a collection of all the standard Visio documents I use to build XIV Solution Designs. You can find them here. They tend to use two basic building blocks, which are the Fibre Channel patch panel and the iSCSI patch panel. For instance this is what the iSCSI patch panel Visio diagram looks like: And this is an example of what a Fibre Channel Visio diagram looks like. Your free to use any of my diagrams for any purpose you like. If you have suggestions, comments or donations, please feel free to share. The link again is here. |
||
XIV Training anyone?
I am starting the week in Adelaide having travelle here to teach an XIV course:
SSA0AU XIV Technical Training. I really enjoy teaching, particularly when the students are coming from a non-IBM background. It gives me the chance to better learn how IBM's products compare to our competitors, because the experiences and view points come from real end users. It also helps me to reconfirm my knowledge and understanding of our own products. The course consists of a day of lectures and a day of labs (using the XIV Labs inMontpellier France). Here is the course layout. Day 1
Day 2
The idea is to teach all the concepts on day one and then let the students hit real machines in a remote lab environment on day two. The hands on part is always the best bit as far as I am concerned (learning by doing always beats learning by listening). Students who have never touched the XIV GUI always enjoy this part. A bigger challenge is when you have a student who already has lots of hands on experience. In those cases I work to consolidate what they have already learned. I am curious, how often do you wait so long to do a course, that there was not much left to learn by the time you actually got to do it?
|
||
Is my iPhone really helping me?
Shock horror, I am starting to question the value of my iPhone.....
Actually... I am pondering whether the iPhone is the ideal social media tool I thought it was. So whats the problem? The problem is that while using the iPhone for this purpose I rarely interact, I never create content, I rarely contribute to content. By this I mean I almost never comment on blogs and I hardly ever twitter. This is quite simply because I hate using the keyboard. More and more I leave both twitter and feed readers to when I have time to actually interact via a real keyboard. So I am curious... does anyone else feel the same way? Are there better devices out there for interaction? Or is it just my fat fingers and slow brain getting in the way? |
||
Why I use a storage cloud provider
Things have been pretty revolting lately, and I am not talking about Tunisia or Egypt or Libya (thought actually they could equally apply to my story).
What I am talking about is mother nature, and she is pretty angry with us right now. In the last few months Australia and New Zealand have seen massive floods in Queensland, Victoria and Western Australia, destructive cyclones hitting Queensland and Western Australia, ferocious bush fires in Western Australia and most recently, a massive earthquake in New Zealand. The personal loss of life and of property have been shocking and tragic. Each of these events have reminded me how quickly everything we hold dear can be taken away in an instant... by an event over which you have no control. Which leads me to storage clouds.... If something can be stored electronically, then it can be stored in a cloud. A cloud that is hopefully well backed up, and far away from your own personal location. And no this is not an advertisement... its a suggestion.... Given the events of the last few months, I have started using a storage cloud provider to protect my photos, my music and my insurance information. I looked for cloud storage providers who:
I considered the following uses:
Let me give you an example of a document I would never want to have to replace.... My son is practicing to get his drivers license. In Victoria you need 120 hours of driving experience recorded in a log book. This log book needs to be filled in every time he drives the car. If the log book is lost... those 120 hours would need to be driven again. I cannot tell you how hard it is to find 120 hours of driving opportunities (and I heartily support the 120 hours scheme!). Even if you did feel inclined to create fake entries to recreate the book (which is illegal), frankly creating 120 hours of fake driving log entires would be very hard work. To make things worse... where I am storing this booklet? In the car of course (which is the most convenient place to store it). So what happens if the car is stolen? There goes the logbook.... So the plan I work on is that every time a page is filled up, I scan that page as an image stored on my laptop. The image goes into a folder that is automatically backed up to the cloud. Yes it does depend on my being diligent, but the actual process of copying the file somewhere else is automatic. Now I have 3 copies... the original, the scanned image on my laptop and a third (automatically created) copy way off in the cloud somewhere. As for personal recommendations: 1) Get 2 GB free on Dropbox. This is a great point solution and a great way to dip your toes in. Are there others? Yes there are... Mozy is one I have seen recommended. There is alsoAmazon S3. I am sure there are plenty more.... Have there been issues with storage cloud providers? A quick search reveals stories like: Flikr deleted a users data and Carbonite lost data due to hardware failure. Still... I have no plans to store my ONLY copy of data in the cloud. For me its a backup medium... not a primary storage location. Are you convinced? Oh... and my son? He is on 89 driving hours... 31 to go.... |