Archive for January, 2016

Cool tool: the Isilon Self-Service Platform utility

Risa Galant

Risa Galant

Principal Technical Writer at EMC Isilon Storage Division
Risa Galant

Latest posts by Risa Galant (see all)

Want to have more control over troubleshooting issues on your Isilon cluster? How about being able to resolve common issues yourself, bypassing the support queue? Well, you’re in luck! The Isilon Self-Service Platform utility (SSP) allows you to do just that. Use the SSP utility to perform first-line troubleshooting on your cluster. You’ll be using the same utility that EMC Isilon Technical Support Engineers and Field Representatives use to prevent and troubleshoot a wide range of known issues that occur on Isilon clusters.
Note: The SSP utility runs only on Windows platforms.

How the SSP utility works

The SSP utility analyzes a cluster log set: it does not run live on the cluster. You first collect the cluster log files (the *.tgz files), then run the SSP utility. The utility runs a series of checks on the cluster log files and generates a diagnostic report of the current health of the cluster. You can run checks for specific categories such as pre-upgrade or health check, or choose specific checks for a custom run.

After the analysis completes, the Overall Health screen presents a summary of the basic health of your cluster. It includes high level details about the cluster and the names and status of the individual nodes on the cluster. Summary results are color-coded: red for critical, orange for needs attention, and green for okay. And to help you resolve issues, the report provides links to the relevant documentation.

You can run the SSP utility using its command line interface or the SSP GUI.

Getting the Self-Service Platform utility

Go to the Isilon Self-Service Platform Info Hub and download and unpack the SSP .zip file. The .zip file contains the SSP utility executable file as well as the user guide.

Collecting the cluster log files

Before running the SSP utility, you must collect the cluster log files. You can use the isi gather command or the OneFS WebUI to collect the files. After they’re collected, copy them to a convenient location that the SSP utility can access.  When you run the SSP utility, you specify the location of the log files, a location for the results files, and choose the check(s) that you want to run. The SSP utility places the generated results files for each test in a separate folder in the results location that you specified.

KB article 304468 explains how to collect the log files, complete with a video demonstration.

Running the SSP utility

To run the SSP utility, double-click the Isilon_Self-Service_Platform.exe file. The main screen appears:

SSP Utility Main Screen
(Click the image to see a larger version.)

Click the Help button in the upper left to view the SSP utility’s FAQ page.

Enter the log location, your service request (SR) number, and the output path for the generated diagnostic report. After you click next, you can choose the checks to run. You’ll see a screen with tabs for each of the check categories: pre-upgrade, post-upgrade, health, firmware, and custom, as the following figure shows.

SSP Utility Pre-Upgrade Checks Screen
(Click the image to see a larger version.)

The checks include:

  • Cluster level checks such as overall health of the cluster, patches installed, InfiniBand configuration and upgrade service status
  • Cluster configuration checks such as whether or not ESRS is enabled, file sharing configuration, priority of all routable gateways and whether any gateways share the same priority
  • Node level checks such as boot flash drive problems, NVRAM battery status, mismatched nodes, node health, amount of free space, and uptime
  • Node status checks such as the DMI log, device errors, kernel open files, NIC status, netstat connections, and the cluster’s var/crash partition
  • Disk level checks such as disk load for all disks in each node, node drive bay health, drive errors, and any errors reported in the idi.log file

You can select the checks to run based on category, such as pre-upgrade check, or use the Custom tab to select only those checks you’re interested in:

SSP utility Select Tests screen
(Click the image to see a larger version.)

After you select the checks you want, click the Run button. You’ll see a progress indicator at the bottom of the screen and the SSP utility’s UI is disabled for the duration of the run.

The report results output structure

After the run completes, the SSP utility presents a summary of the results, similar to the following figure (with identifying information redacted).

SummaryScreen
(Click the image to see a larger version.)

You’ll find the results in the folder you specified for the output. The generated results output file structure is similar to the following.

SSP Utility Results File Structure
(Click the image to see a larger version.)

Go get it!

The SSP utility is a great way to take charge of troubleshooting and performing preventative health checks to resolve or avoid common, known issues on your cluster. Check out the Isilon Self-Service Platform Info Hub for the latest download and documentation. And for information about collecting log files (complete with a video demonstration), see KB article 304468.

Let us know!

Let us know what you think. If you have feedback for us about this or any other Isilon technical content, email us at isicontent@emc.com. And thank you!

New and popular EMC Isilon technical content for December 2015

Risa Galant

Risa Galant

Principal Technical Writer at EMC Isilon Storage Division
Risa Galant

Latest posts by Risa Galant (see all)

Check out EMC Isilon’s latest and most popular customer-focused technical content. Each month we highlight newly published technical content developed for Isilon customers, as well as the Isilon Top 10’s:

  • Top 10 Knowledgebase articles
  • Top 10 Customer Support documents
  • Top 10 Isilon Info Hubs

New customer-focused Isilon technical content

Here are links to customer-focused technical content that was published in December. Enjoy!

CONTENT TYPE

LINK AND DESCRIPTION

Cluster Talk Podcast Episode 10: Blinky lights as prank or breakthrough source of data bandwidth, the Hadoop Roundup, solving a bi-directional user mapping problem, highlights from OneFS.NEXT, and why happiness in December was all about the Hamiltons.
Cluster Talk Podcast Podcast: Cluster Talk Hub: Looking for all the Cluster Talk podcasts? You’ll find them here.
Video Cool Commands: Why can’t I access my Windows file?
Video Cool Commands: Why can’t I access my UNIX file?
Video ID.TV video playlist: Looking for every Isilon video ever, including the Cool Commands videos? You’ll find them here on the ID.TV YouTube channel.
Isilon Community (ECN) Blog InsightIQ – Lesser known facts
Isilon Community (ECN) Blog Gotchas of Isilon for VDI file storage & Home Directories
Isilon Community (ECN) Blog SMB throughput benchmarking with fio.exe
Isilon Community OneFS and cluster quorum
Isilon Community (ECN) Blog NFS throughput benchmarks with fio
Info Hub Customer Troubleshooting: How to Add a Drive to a Node
Info Hub Customer Troubleshooting: How to Diagnose and Troubleshoot a Battery Failure
Info Hub Customer Troubleshooting: Troubleshoot a Bit Error Rate (BER) Alert on your Isilon Cluster
Info Hub Customer Troubleshooting: InsightIQ does not Display Data Correctly in the File System Reporting Page
Info Hub Customer Troubleshooting: Troubleshoot Problems with InsightIQ Upgrades
Info Hub Customer Troubleshooting: Troubleshoot Problems with Adding or Importing a Cluster for Monitoring in InsightIQ
Info Hub Customer Troubleshooting: Troubleshoot Problems with Accessing or Logging in to the InsightIQ Web Administration Interface
Info Hub Customer Troubleshooting: InsightIQ does not Display Data Correctly on the Live Performance Monitoring Page
Info Hub Customer Troubleshooting: Troubleshoot Problems with Installing or Setting up InsightIQ
Info Hub Customer Troubleshooting: Troubleshoot Windows Active Directory Authentication
Info Hub Customer Troubleshooting: Troubleshoot Your LDAP Authentication Provider
Info Hub Customer Troubleshooting: Troubleshoot Windows File System Permissions for your Isilon Cluster
Info Hub Customer Troubleshooting: Troubleshoot Identity Mapping
Info Hub Customer Troubleshooting: Troubleshoot Problems with your NIS Authentication Provider
Info Hub Customer Troubleshooting: Troubleshoot your SmartConnect Configuration
Info Hub Current OneFS Patches
Info Hub InsightIQ

Top 10 Knowledgebase Articles

Next up: December’s top 10 most-viewed Knowledgebase articles, including ETAs and ESAs. You’ll need to login to view them.

  1. ETA 209918: Isilon OneFS: Nodes that have run for more than 248.5 consecutive days may restart without warning which may lead to potential data unavailability
  2. Isilon OneFS: Nodes that have run for more than 248.5 consecutive days may restart without warning which may lead to potential data unavailability (209418)
  3. ESA-2015-175 (211190)
  4. ESA-2015-155 (208181)
  5. ESA-2015-148 (211191)
  6. ESA-2015-181 (211192)
  7. UPDATE: ETA 208773: Isilon OneFS: When used in conjunction with Isilon OneFS, specific versions of the BXE 10GigE network interface card may experience data inconsistency (208773)
  8. UPDATE: ETA 202452: Isilon OneFS: Nodes that have run for 497 consecutive days may restart without warning
  9. ESA-2015-164 (210195)
  10. OneFS: How to reset the CELOG database and clear all historical events (16535)

Top 10 customer-focused technical documents

And now, December’s most popular customer-focused technical documents.

  1. Current Isilon Software Releases
  2. Isilon Supportability and Compatibility Guide
  3. Current Isilon OneFS Patches
  4. Insight IQ 3.2 Installation Guide
  5. Isilon OneFS 7.2.1.0 – 7.2.1.1 Release Notes
  6. Isilon Product Availability
  7. OneFS Upgrade Planning and Process Guide
  8. InsightIQ 3.2.2 Release Notes
  9. Isilon OneFS 7.2.0.0 – 7.2.0.5 Release Notes
  10. X410 Installation Guide

Top 10 Isilon Info Hubs

We got December’s most popular Info Hubs right here.

  1. Isilon Uptime
  2. OneFS Upgrades
  3. Hadoop
  4. OneFS 7.2 Documentation
  5. Current OneFS Patches
  6. OneFS API
  7. Cloudera with Isilon
  8. OneFS 7.2.1 Documentation
  9. InsightIQ
  10. Customer Troubleshooting

Tell us what you want to know! Contact us with questions or feedback about this blog at isi.knowledge@emc.com. To provide documentation feedback or to request new content, contact isicontent@emc.com.

Check out the latest Customer Troubleshooting Guides

Risa Galant

Risa Galant

Principal Technical Writer at EMC Isilon Storage Division
Risa Galant

Latest posts by Risa Galant (see all)

Happy New Year! To celebrate, we’ve published a host of new EMC Isilon Customer Troubleshooting Guides.  These guides provide step-by-step troubleshooting instructions to help you solve issues that may affect Isilon clusters, or to walk you through the steps needed to gather important data to help EMC Customer Support solve your problem quickly.

This latest set of guides covers topics including Isilon hardware, authentication and permissions, Networking/SmartConnect configuration, and InsightIQ.

Visit our Customer Troubleshooting Info Hub for the latest list of published guides. More guides are coming soon!

The new guides are:

Check out these troubleshooting resources and let us know what you think. Email us at isicontent@emc.com with your feedback. And thank you!