Archive for December, 2014

New “Hacker How-To” series on EMC Isilon social media

Kirsten Gantenbein

Kirsten Gantenbein

Principal Content Strategist at EMC Isilon Storage Division
Kirsten Gantenbein
Kirsten Gantenbein

If you follow EMC Isilon on Twitter (@EMCIsilon), you probably noticed our new Hacker How-To series. This six-week series highlighted videos, whitepapers, and blog posts on the following topics:

  • Disaster Recovery OneFS modules: SyncIQ, SnapshotIQ, and SmartLock
  • Site Prep and Planning: Cable management, node selection, and cluster installation
  • Multiprotocol support in OneFS
  • Role-based access control in OneFS 7.1
  • InsightIQ overview and installation

The series was popular, so we’re planning a new series in March! We want to know which topics you’d like to see next. We also want to know how you’d like to receive this information.

Here are two questions to help us make the next Hacker How-To most useful to you.

Start a conversation about Isilon content

Have a question or feedback about Isilon content? Visit the online EMC Isilon Community to start a discussion. If you have questions or feedback about this blog, contact us at isi.knowledge@emc.com. To provide documentation feedback or request new content, contact isicontent@emc.com.

[display_rating_result]

Multitenancy for Hadoop data on an EMC Isilon cluster

Kirsten Gantenbein

Kirsten Gantenbein

Principal Content Strategist at EMC Isilon Storage Division
Kirsten Gantenbein
Kirsten Gantenbein

The process of analyzing big data within big organizations can be complicated. There can be many data sets to analyze, some which are stored in silos or contain secure information. And there can be many different Hadoop users accessing these data sets, each with different permissions and credentials. So how can organizations effectively manage multiple data sets and Hadoop users?

In EMC® Isilon® OneFS®, you can take advantage of multitenancy to tackle this issue. Multitenancy creates secure, separate namespaces on a shared infrastructure so that different Hadoop users (or tenants) can connect to an Isilon cluster, run Hadoop jobs concurrently, and consolidate their Hadoop workflows onto a single cluster. OneFS 7.2 supports several Hadoop distributions and HDFS 2.2, 2.3, and 2.4. The OneFS HDFS implementation also works with Ambari for management and monitoring, Kerberos authentication, and Kerberos impersonation.

The white paper, “EMC Isilon Multitenancy for Hadoop Big Data Analytics,” highlights how to set up access zones for multitenancy and manage Hadoop data in an Isilon cluster.

How Hadoop works in Isilon

The Apache Hadoop analytics platform comprises the Hadoop Distributed File System, or HDFS, a storage system for vast amount of data, and MapReduce, a processing paradigm for data-intensive computation analysis.

EMC Isilon serves as the file system for Hadoop clients. This enables Hadoop clients to directly access their datasets on the Isilon storage system and run data analysis jobs on their compute clients. OneFS implements server-side operations of the HDFS protocol on each node in the Isilon cluster to handle calls to the NameNode and to manage read/write requests to DataNodes.

EMC Isilon Hadoop Deployment

To configure an Isilon cluster for Hadoop, you first need to activate a HDFS license in OneFS. Contact your account team for more information. Then visit our EMC Hadoop Starter Kits to learn how to deploy multiple Hadoop distributions, such as Pivotal, Cloudera, or HortonWorks, on your Isilon cluster.

Access zones for multitenancy

Access zones lay the foundation for multitenancy in OneFS. Access zones provide a virtual security context that segregates tenants and creates a virtual region that isolates data sets. Each access zone encapsulates a namespace, HDFS directory, directory services, authentication, and auditing. An access zone also isolates system connections for further security.

The following procedures for managing and securing data sets are covered in “EMC Isilon Multitenancy for Hadoop Big Data Analytics.”

  • Provide multiprotocol support – Learn how you can store data by using existing workflows on your Isilon cluster and access it through SMB, NFS, OpenStack Swift, and HDFS protocols, instead of running HDFS copy operations to move data to Hadoop clients.
  • Manage different data sets – Learn how you can use SmartPools for managing different data sets based on customized policies.
  • Associate network resources with access zones – Understand how virtual racking works in Isilon and how you can configure SmartConnect in OneFS to manage connections to data on your Isilon cluster.
  • Secure access zones – Review how role-based access control and directory services with access zones in OneFS are used to authenticate users assigned to each zone.

Hadoop information hubs

You can find a rich array of information about Isilon and Hadoop. Visit our online Isilon Community on the EMC Community Network for InfoHubs, which serves as a single location for all of our Hadoop-related content. The Hadoop InfoHub contains links to general information about Isilon and Hadoop. The Cloudera with Isilon InfoHub contains links to information about deploying the Cloudera distribution for Isilon.

Start a conversation about Isilon content

Have a question or feedback about Isilon content? Visit the online EMC Isilon Community to start a discussion. If you have questions or feedback about this blog, contact us at isi.knowledge@emc.com. To provide documentation feedback or request new content, contact isicontent@emc.com.

[display_rating_result]

Top EMC Isilon support documents for November 2014

Kirsten Gantenbein

Kirsten Gantenbein

Principal Content Strategist at EMC Isilon Storage Division
Kirsten Gantenbein
Kirsten Gantenbein

To those celebrating the Thanksgiving holiday, I hope it was a good one!

Below are the most viewed EMC® Isilon® KB articles and recently published content in November.

Most viewed knowledgebase (KB) articles

Check out November’s top 10 most viewed KB articles.

  1. Best practices for NFS client settings (90041)
  2. OneFS 6.5 and later: How to safely shut down an Isilon cluster prior to a scheduled power outage (16529)
  3. How to upload files to Isilon Technical Support (16759)
  4. OpenSSL Multiple Vulnerabilities in EMC products (188456)
  5. OpenSSL Heartbeat Vulnerability (Heartbleed) in EMC products (185965)
  6. How to configure Windows DNS for a SmartConnect zone (183530)
  7. Troubleshooting performance issues (88844)
  8. How to reset a node to factory defaults (16696)
  9. How to reset the CELOG database and clear all historical events (16586)
  10. Active Directory clients cannot connect to the cluster after the machine account password is changed (169843)

Most viewed product documentation

We’re working on our analytic reports. Check back next month.

New Isilon support content

Check out these links to new support content published in November (login to the EMC Online Support site is required for all content except videos). For Syncplicity Panorama users, there is a new Isilon tech note available. We also published InfoHubs on our EMC Community Network (ECN) Isilon Community website. These InfoHubs contain links to all the OneFS 7.2 documentation that you need.

CONTENT TYPE

TITLE AND LINK

Video Technical Demo: EMC Isilon SmartConnect
Video Technical Demo: SMB Share Provisioning and Administration in EMC Isilon OneFS 7.1.1
Document Syncplicity Panorama with Isilon Storage Tech Note
Documentation InfoHub on Isilon Community (ECN) OneFS 7.2 Documentation
Documentation InfoHub on Isilon Community (ECN) OneFS 7.2 Node Equivalency (Compatibility)
Release Notes (updated) Isilon Drive Firmware Package 1.8 Release Notes
Release Notes (updated) Isilon Drive Support Package 1.2 Release Notes
KB Article SyncIQ fails with error: SyncIQ failed to take a snapshot on source cluster. Unable to delete snapshot from previous run (194408)
KB Article (updated) How to determine if an Isilon cluster is in a window of risk for data loss (16701)
KB Article OneFS 7.1.1: Troubleshooting Pivotal Hadoop for OneFS issues (193150)

 

Tell us what you want to know! Contact us with questions or feedback about this blog at isi.knowledge@emc.com. To provide documentation feedback or request new content, contact isicontent@emc.com.

 

[display_rating_result]