For help or query regarding Sizer Basic or Collector , please reach out to our experts on the Sizer community at the link below:
https://next.nutanix.com/sizer-capacity-solution-planning-148
Just another WordPress site
For help or query regarding Sizer Basic or Collector , please reach out to our experts on the Sizer community at the link below:
https://next.nutanix.com/sizer-capacity-solution-planning-148
These questions are here to assist with ensuring that you’re gathering necessary information from a customer/prospect in order to put together an appropriate solution to meet their requirements in addition to capturing specific metrics from tools like Collector or RVTools.
This list is not exhaustive, but should be used as a guide to make sure you’ve done proper and thorough discovery. Also, it is imperative that you don’t just ask a question without understanding the reason why it is being asked. We’ve structured these questions with not only the question that should be asked, but why we are asking the customer to provide an answer to that question and why it matters to provide an optimal solution.
Questions marked with an asterisk (*) will likely require reaching out to a specialist/Solution Architect resource at Nutanix to go deeper with the customer on that topic/question. Make sure you use the answers to these questions in the Scenario Objectives in Sizer when you create a new Scenario. These questions should help guide you as to what the customer requirements, constraints, assumptions, and risks are for your opportunity.
This is a live document, and questions will be expand and update over time.
REVISION HISTORY
4/21/25 – 1st Revision – Mike McGhee
1/5/21 – 1st Publish – Matt Bator
Why ask? This question helps us understand the use case, any current expectations and what the competitive landscape may look like as well as an initial idea of the size / scale of the current solution.
Why ask? If we’re sizing into an existing cluster we need to understand current hardware and current workload. For licensing purposes adding Files to an existing cluster means the Unified Storage Pro license. A common scenario has been to add storage only nodes to an existing cluster to support the new Files capacity. If sizing into a new cluster we can potentially dedicate this cluster to Files and Unified Storage.
Why ask? We need to understand protocol to first validate they are using supported clients. Supported clients are documented in the release notes of each version of Files. Concurrent SMB connections also impact sizing with respect to the compute resources we need for the FSVMs to handle those clients. Max concurrent connections are also documented in the release notes of each version.
It also helps us validate supported authentication methods. For SMB, we require Active Directory where we support 2008 domain functional level or higher. There is limited local user support for Files but the file server must still be registered with a domain. For NFS v4 we support AD with Kerberos, LDAP and Unmanaged (no auth) shares. For NFS v3 we support LDAP and Unmanaged.
Why ask? Every FSVM has an expected performance envelope. There is a sizing guide and performance tech note on the Nutanix Portal which give a relative expectation on the max read and write throughput per FSVM and max read or write IOPs per FSVM.
Throughput based on reads and writes are integrated into Nutanix Sizer and will impact the recommended number of FSVMs. This may also impact the hardware configuration,including choice of NICs, leveraging RDMA between the CVMs, or iSER supported since the Files 5.0 release via a performance profile. Also the choice of all flash vs. hybrid.
Why ask? Seeing data from an existing solution can help validate the performance numbers so that we size accurately for performance.
Why ask? When sizing for storage space utilization the application performing the writes could impact storage efficiency. Backup, Video and Image data are most commonly compressed by the application. For those applications we should not include compression savings when sizing, only Erasure Coding. For general purpose shares with various document types assume some level of compression savings.
Why ask? If the customer has existing performance data, it’s good to understand if they are expecting equivalent or better performance from Files. This could impact sizing, including going from a hybrid to an all flash cluster.
Why ask? Concurrent SMB connections are a required sizing parameter. Each FSVM needs enough memory assigned to support a given number of users. A Standard share is owned by one FSVM. A distributed share is owned by all FSVMs and is load balanced based on top level directories. We need to ensure any one FSVM can support all concurrent clients to the standard share or top level directory with the highest expected connections. We should also be ensuring that the sizing for concurrent connections is taking into account N-1 redundancy for node maintenance/failure/etc.
Why ask? Files has a soft (recommended) limit of 100 shares per FSVM. We also leverage Nested shares to match an existing environment if there are more shares needed. Files currently supports 5,000 nested shares since the 4.4 release.
Why ask? This indicates a large number of top level directories making a distributed share a good choice for load balancing and data distribution.
Why ask? Distributed shares cannot store files in the share root. If an application must store files in the root then you should plan for sizing using standard shares. Alternatively, a nested share can be used.
Why ask? Nutanix Files is designed to store millions of files within a single share and billions of files across a multi-node cluster with multiple shares. To achieve speedy response time for high file and directory count environments it’s necessary to give some thought to directory design. Placing millions of files or directories into a single directory is going to be very slow in file enumeration that must occur before file access. The optimal approach is to branch out from the root share with leaf directories up to a width (directory or file count in a single directory) no greater than 100,000. Subdirectories should have similar directory width. If file or directory counts get very wide within a single directory, this can cause slow data response time to client and application. Increasing FSVM memory up to 96 GB to cache metadata can help improve performance for these environments especially if designs for directory and files listed above are followed.
Why ask? Nutanix supports standard shares up to 1PiB starting with the Files 5.0 release (prior to compression.) And top level directories in a distributed share up to 1PiB. These limits are based on the volume group supporting the standard share or top level directory. We need to ensure no single folder or share (if using a standard share) surpasses 1PiB.
12. Largest number of files/folders in a single folder?
Why ask? Nutanix Files is designed to store millions of files within a single share and billions of files across a multi-node cluster with multiple shares. To achieve speedy response time for high file and directory count environments it’s necessary to give some thought to directory design. Placing millions of files or directories into a single directory is going to be very slow in file enumeration that must occur before file access. The optimal approach is to branch out from the root share with leaf directories up to a width (directory or file count in a single directory) no greater than 100,000. Subdirectories should have similar directory width. If file or directory counts get very wide within a single directory, this can cause slow data response time to client and application. Increasing FSVM memory to cache metadata and increasing the number of vCPUs can help improve performance for these environments especially if designs for directory and files listed above are followed.
Why ask? Core sizing question to ensure adequate storage space is available with the initial purchase and over the expected timeframe.
Why ask? Understanding the expected active dataset can help with sizing the SSD tier for a hybrid solution. Performance and statistical collection from an existing environment may help with this determination.
Why ask? Change rate influences snapshot overheads based on retention schedules. Nutanix Sizer will ask what the change rate is for the dataset to help with determining the storage space impact of snapshot retention.
Why ask? Helps to determine if data reduction techniques like dedup and compression are effective against the customers data. Files does not support the use of deduplication today, so any dedup savings should not be taken into account when sizing for Files. If the data is compressible in the existing environment it should also be compressible with Nutanix compression.
Why ask? Block size can impact storage efficiency. A solution which has many small files with a fixed block size may show different space consumption when migrated to Files, which uses variable block lengths based on file size. For files over 64KB in size, Files uses a 64KB block size. In some cases a large number of large files have been slightly less efficient when moved to Nutanix Files. Understanding this up front can help explain differences following migrations.
Why ask? Nutanix Files uses two levels of snapshots, SSR snapshots occur at the file share level via ZFS. These snapshots have their own schedule and Sizer asks for their frequency and change rate under “Nutanix Files Snapshots.” The schedule associated with SSR and retention periods will impact overall storage consumption. Nutanix Files Snapshots increase both the amount of licensing required and total storage required, so it’s important to get it right during the sizing process.
Why ask? Data Protection snapshots occur at the AOS (protection domain) level via the NDSF. The schedule and retention policy are managed against the protection domain for the file server instance and will impact overall storage consumption. Sizer asks for the local and remote snapshot retention under “Data Protection.”
Files supports 1hr RPO today and will support near-sync in the AOS 5.11.1 release in conjunction with Files 3.6. Keep in mind node density (raw storage) when determining RPO. Both 1hr and near-sync RPO require hybrid nodes with 40TB or less raw or all flash nodes with 48TB or less raw. Denser configurations can only support 6hr RPO. These requirements will likely change so double check the latest guidance when sizing dense storage nodes. Confirm that underlying nodes and configs support NearSync per latest AOS requirements if NearSync will be used.
Why ask? If the customer needs active/active file shares in different sites which represent the same data, we need to position a third party called Peer Software. Peer performs near real time replication of data between heterogenous file servers. Peer utilizes Windows VMs which consume some CPU and memory you may want to size into the Nutanix clusters intended for Files.
Files 5.0 introduced an active/active solution called VDI sync, specific for user profile data. The solution supports activity against user specific profile data within one site at a time. If the user moves to another site, the VDI session can follow and localize access for that user.
Why ask? Nutanix is working to integrate with three main third-party auditing vendors today, Netwrix (supported and integrated with Files), Varonis (working on integration) and Stealthbits (not yet integrated). Nutanix Files also has a native auditing solution in File Analytics.
Along with ensuring audit vendor support, a given solution may require a certain amount of CPU, Memory and Storage (to hold auditing events). Ensure to include any vendor specific sizing in the configuration. File Analytics for example could require 8vcpu 48GB of memory and 3TB of storage.
Data Lens is a SaaS offering in the public cloud, so you will need to ensure the customer is comfortable with a cloud solution.
Why ask? Files supports specific Antivirus vendors today with respect to ICAP integration. For a list of supported vendors see the software compatibility matrix on the Nutanix Portal and sort by Nutanix Files:
https://portal.nutanix.com/page/documents/compatibility-interoperability-matrix/software
If centralized virus scan servers are to be used you will want to include their compute requirements into sizing the overall solution.
Why ask? Files has full change file tracking (CFT) support with HYCU, Commvault, Veeam, Veritas and Storware. There are also vendors like Rubrik who are validated but do not use CFT. If including a backup vendor on the same platform, you may need to size for any virtual appliance which may also run on Nutanix.
Why ask? Less about sizing and more about implementation. Prior to Files 3.5.1 Files could only support distributed shares with DFS-N. Starting with 3.5.1 both distributed and standard shares are fully supported as folder targets with DFS-N.
Files 5.1 introduced a native unified namespace to combine different file servers into a common namespace.
Why ask? Files supports tiering which means automatically moving data off Nutanix Files and to an S3 compliant object service either on-premises or in the cloud. In scoping future requirements, customers may size for a given amount of on-premises storage and a larger amount of tiered storage for longer term retention.
These questions are here to assist with ensuring that you’re gathering necessary information from a customer/prospect in order to put together an appropriate solution to meet their requirements in addition to capturing specific metrics from tools like Collector or RVTools.
This list is not exhaustive, but should be used as a guide to make sure you’ve done proper and thorough discovery. Also, it is imperative that you don’t just ask a question without understanding the reason why it is being asked. We’ve structured these questions with not only the question that should be asked, but why we are asking the customer to provide an answer to that question and why it matters to provide an optimal solution.
Questions marked with an asterisk (*) will likely require reaching out to a specialist/Solution Architect resource at Nutanix to go deeper with the customer on that topic/question. Make sure you use the answers to these questions in the Scenario Objectives in Sizer when you create a new Scenario. These questions should help guide you as to what the customer requirements, constraints, assumptions, and risks are for your opportunity.
This is a live document, and questions will be expand and update over time.
Why ask? It helps us understand the customer’s maturity level when it comes to application deployment and could uncover some of the competitive infrastructure. See some of the possible competitive or other products we may be able to work with or integrate with.
Why ask? Gives us the opportunity to discuss our SNOW plugin. Also helps understand which front end they will use for the Calm implementation.
Why ask? Helps us understand which providers they may consume with Calm. Helps us understand which services are still on-prem and available as a target for AOS. May help position Beam. Also helps us understand if they have a Microsoft EA which may force their spend to go to Azure.
Why ask? It helps uncover their current pain points and possibly competitive landscape. (This would typically be asked when talking to the Infrastructure Team) If the process is already well documented/defined, the hardest part of the implementation is already done.
Why ask? Helps understand the competitive landscape as well as integration points that will need to be solved
Why ask? It helps us estimate the size of the deal for licensing
Why ask? Helps understand their current place on the journey to cloud native apps. If they are still investigating, we have an option to position Karbon. If they are using another product already, we may be able to provide the infrastructure for that environment.
Why ask? Helps us understand which providers they may consume with Calm. Helps us understand which services are still on-prem and available as a target for AOS. May help position Beam. Also helps us understand if they have a Microsoft EA which may force their spend to go to Azure.
Why ask? Gives us the opportunity to discuss our SNOW plugin. Also helps understand which front end they will use for the Calm implementation.
Why ask? Helps us understand the integrations needed for a successful implementation.
Resources:
Glossary of Terms: https://github.com/nutanixworkshops/calmbootcamp/blob/master/appendix/glossary.rst
xPert Automation team page: http://ntnx.tips/xPertAutomation (Internal Only)
LinkedIn Learning – DevOps Foundations Learning Plan: https://www.nutanixuniversity.com//lms/index.php?r=coursepath/deeplink&id_path=79&hash=2ce3cb1f946cc3770bd466853e68ee36ddbcf5e1&generated_by=19794
Udacity+Nutanix: Hybrid Cloud Engineer Nanodegree
Calls to action/next steps:
1. Create a SFDC opportunity, quote a Calm+Services bundle, add a DevOps resource request
2. Test Drive: Automation
3. Calm bootcamps (+Karbon, +CI/CD, etc.) (Internal Only)
These questions are here to assist with ensuring that you’re gathering necessary information from a customer/prospect in order to put together an appropriate solution to meet their requirements in addition to capturing specific metrics from tools like Collector or RVTools.
This list is not exhaustive, but should be used as a guide to make sure you’ve done proper and thorough discovery. Also, it is imperative that you don’t just ask a question without understanding the reason why it is being asked. We’ve structured these questions with not only the question that should be asked, but why we are asking the customer to provide an answer to that question and why it matters to provide an optimal solution.
Questions marked with an asterisk (*) will likely require reaching out to a specialist/Solution Architect resource at Nutanix to go deeper with the customer on that topic/question. Make sure you use the answers to these questions in the Scenario Objectives in Sizer when you create a new Scenario. These questions should help guide you as to what the customer requirements, constraints, assumptions, and risks are for your opportunity.
This is a live document, and questions will be expand and update over time.
Why ask? This question helps us understand the use case, any current expectations and what the competitive landscape may look like
Why ask? It helps us understand how serious the customer is about migrating and the drivers : usually cost and helps create a pipe-line.
Why ask? To determine which environment is easier to go after as a starting point.
Why ask? Helps us articulate Nutanix Value for Relational Database Workloads.
Why ask? Helps us articulate a Disaster recovery/backup strategy.
Why ask? Whether using third party DR tools ( Zerto/Actifio/SRM) or native database replication. Whether using third party backup ( Commvault/VEEAM/VERITAS ) or native tools
Why ask? Helps us identify transactional, OLTP, vs Analytical, OLAP/DWH ( latency sensitivity )
Why ask? Beyond 30 TB , hyperconverged virtualizing may not be beneficial. Need to understand use case
Why ask? Accurate sizing
Why ask? Determine if there are potentially any mission critical workloads
Why ask? Inventory purposes and Era only supports a single SQL Server instance on the same host.
Why ask? Inventory purposes and also helps identify which databases are considered critical for AG (Always On Availability Groups) etc. , databases reside in an instance.
Why ask? Inventory sizing purposes.
Why ask? Different SQL Server versions have different features, limitations etc and also different CU cumulative update levels. SQL Server stopped issuing service packs in SQL Server 2016 everything now is a CU format. External SQL Server Edition and Version Comparison.
Why ask? Different Windows versions have different features, limitations and update levels that may affect SQL Server, also driver versions etc.
Why ask? This can help differentiate which licensing model the customer is using and why.
Why ask? This can help determine if shared storage is used such as a SQL Server Failover Cluster Instance (FCI), or a SQL Server Always On Availability Group (AG) which does not require shared storage. Also is there any multi site replication being used either as a physical storage layer or logical SQL Server layer.
Why ask? Inventory sizing purposes for baseline.
Why ask? Inventory sizing purposes for baseline.
Why ask? Inventory sizing purposes for baseline.
Why ask? Inventory sizing purposes for baseline.
Why ask? Inventory sizing purposes for baseline helpful in determining expectations with regard to latency.
Why ask? Inventory sizing purposes for baseline.
Why ask? The number of I/O service requests to use as a baseline for their current workload.
Why ask? The response time requirement to use as a baseline for their current workload.
Why ask? The throughput requirement to use as a baseline for their current workload.
Why ask? This helps determine what their workload profile is like and how it will affect our platform (reads are local, writes incur node replication cost) as a baseline.
Why ask? This helps determine what their workload profile I/O size is related to bandwidth .
Why ask? This helps determine what SQL Server is waiting on to process transactions, where there may be a bottleneck.
Why ask? This helps narrow the focus and develop a relationship with the customer. It also assists in focusing on how Nutanix can help alleviate those specific pain points and gives information about how the solution can be shown to resolve those particular pain points.
Why ask? Oracle licensing is expensive and customers want to make the best use of their entitlement when replatforming and not spend more $$ on new licensing when doing a new solution. Customers are also looking forward to reducing their Oracle License overhead .
Why ask? There may be possibilities to eliminate some Options by using Nutanix Features such as Compression, Encryption, Replication
Why ask? When inventorying an Oracle DB environment, you can use the Automatic Workload Repository (AWR) report to gather detailed inventory and performance statistics for an Oracle Database. Nutanix has an AWR script that can be run to capture the necessary information and is able to be downloaded from within the Sizer Tool. When adding a Workload select Import, then click the AWR tab and you will see the AWR SQL Script download link. Once run, you can then upload the output using the Upload File option.
Why ask : To find out customer operational efficiency for provisioning . Era can help improve this from weeks to hours.
Why ask : Customers make multiple “full copies” of PROD for test-dev dev/test and use up to 5-10 times the space they need . Era will help in creating space optimized clones of database with “rapid speed”
Why ask: Customers using traditional techniques to refresh a copy of a database from a RMAN backup , takes multiple hours and is usually done once a month . With Era , they can clone everyday or multiple times a day in minutes.
Why ask : Oracle patching is a huge pain point in large Oracle environments. Era provides a unique way to do “fleet patching” which will help save 100’s of man hours spent in traditional patching
Why ask : Migration is an involved process and a lot of planning and time is required for migration.
Era provides an easy method to “replicate & migrate” databases (Same version) for same-endian formats. ( Linux->Linux or Windows ->Linux)
Why ask: customers are looking to reduce their software licensing cost of database replication and will look for opportunities to replicate using infrastructure (nutanix replication) . era enables cross-cluster replication including replicating to a NTNX cluster in AWS cloud in an upcoming release 2.0