PSRemotely – Framework to Enable Remote Operations Validation
Before we get started with what is PSRemotely, here is some background.
As part of my work in an engineering team, I am tasked with writing scripts which will validate the underlying infrastructure before the automation (using PowerShell DSC) kicks in to deploy the solution.
Below are the different generic phases which comprise the whole automation process:
- Pre-deployment – getting the base infrastructure ready, the bare minimum required for automation. For example – network configuration on the nodes is needed.
- Deployment – deployment of the solution leveraging PowerShell DSC.
- Post-deployment – scripts/runbooks configuring or tweaking the environment.
What I meant by validating underlying infrastructure above, is that the compute and storage physical hosts/nodes have a valid IP configuration, connectivity to the AD/DNS infrastructure etc. the key components that we required to be tested and validated to get confidence in our readiness to deploy the engineered solution on top of it.
Note – Our solution had scripts in place that would configure the network based on some input parameters and record this in a manifest XML file. After the script ran, we would assume that everything is in place. These assumptions at some points cost us a lot of efforts in troubleshooting.
In short, initial idea was to have scripts validating, what the scripts did in an earlier step. So it began, I started writing PowerShell functions, using workflows (to target parallel execution on nodes). This was a decent solution until there were requests to add validation tests for entirely everything in the solution stack e.g. DNS configuration, network connectivity, proxy configuration, disks (SSD/HDD) attached to the storage nodes etc.
Phew! It was a nightmare maintaining it.
Rays of hope: Pester, PoshSpec, and Remotely!
We went into looking at how to use some of the open source PowerShell modules into helping us perform operations validation. At this time in community, Pester was gaining traction for the operations validation.
Using Pester
We moved away from using standalone scripts for the operations validation and started converting our scripts into Pester tests. It is not surprising to see that many operations people find it easier to relate to using Pester for Ops validation, since we have been doing this validation for ages manually. Pester just makes it easy to automate all of it.
For example, in our solution each compute node gets three NIC cards, pre-deployment script configures them. If we had to test whether the network adapter’s configuration was indeed correct, it would look something like below using Pester:
|
|
Using PoshSpec & Pester
PoshSpec added yet another layer of abstraction on our infrastructure tests by adding yet another DSL.
Below is how our tests started looking with usage of Pester and PoshSpec.
Note – For validation of IPv4 Address, another keyword named IPv4Address was added to PoshSpec which would essentially call Get-NetIPAddress and spit out the IPv4 address assigned on the NIC interface with specified alias.
|
|
By using Pester and PoshSpec to write tests, it sure made maintaining these tests easy, but we still have a problem at
hand. How do we target our above tests to all the nodes in the solution?
Remotely??
At some point Ravi was tinkering with this particular PowerShell module and suggested to take a look at it. It was promising to begin with as he added support for passing Credential hash to Remotely. We would have to specify a hash table with the computer name as key and credential as value to Remotely and it would take care of connecting to those nodes, executing the script block in the remote runspace. At this point things started falling in place for what we had in mind. Our tests started looking nice and concise:
|
|
Soon we realized that the Assertions above e.g. {Should Be ’10.10.10.1’} are to be dynamically created by reading the manifest XML file which drives the whole deployment. It contains what is the expected configuration on the remote nodes.
We wanted our tests to be generic so that we could target them to all nodes part of the solution. We were looking to have our tests organized like below, where of course a node-specific details e.g. $ManagementIPv4Address etc. would be read from the manifest file and created on the fly either on the local machine or remote node :
|
|
The above syntax looks quite descriptive and decouples the validation tests and environment details too.
But there were some downsides to the above approach.
- Requires us re-writing our existing tests to accommodate keyword Remotely for executing script block on remote and running assertions locally.
- Remotely connects each time to all the nodes to run each PoshSpec based ops validation tests. Results in lot of overhead to run a large number of validation tests.
- Trouble passing environment specific data to the remote nodes e.g. in the above tests passing the expected IPv4 address to the remote node.
- For running Pester/PoshSpec tests on the remote nodes, these modules need to be present on the remote node, to begin with.
The existing Remotely framework was meant to execute script block against a remote runspace but it was not specifically built to perform operations validation remotely.
Enter PSRemotely
After trying to integrate Remotely with Pester/PoshSpec based tests, we had a general idea on what we needed from a framework/DSL, if it was to provide us with the capability of orchestrating operations validation remotely on the nodes. Below are some of those features we had in mind along with the arguments for these to be implemented:
Target Pester/PoshSpec based operations validation tests on the remote nodes.
Allow specifying environment data separately from the tests, so that same tests could be applied across on nodes.
We decided on the ability to use DSC Style configuration data here for specifying node specific environment details.
Easier debugging on the remote nodes, in case tests fail.
If something failed on the remote node during validation, we should be able to connect to the underlying PowerShell remoting session and debug issues.
Allow re-running specific tests on the remote nodes.
In case a test failed, performing a quick remediation action and validating that specific test passed it is a good to have feature when you have lot of tests in your suite.
Self-contained solution.
Have the framework bootstrap the remote nodes with required modules version (Pester & PoshSpec) under the hood. Remote nodes might not have internet connectivity here.
Allow copying required artifacts to remote nodes.
For our solution, we require a manifest file with details about the deployment to be copied on each node.
Use PowerShell remoting as underlying transport mechanism for everything.
Return bare minimum JSON output, if everything passes. If a test fails then return the error record thrown by Pester.
And, PSRemotely was born!
So after a lot of discussions with Ravi, I finally got insight on how the remote operations validation DSL should look like:
|
|
Configuration data, can be generated separately or specified from a .psd1 or .json file
|
|
PSRemotely tests, specify CredentialHash and Configuration data to the PSRemotely
|
|
After having a clear idea on the features required in the framework and how we wanted the DSL to look like, I started working on it. This post has set up the context on why we began working on something entirely new from scratch.
Join me in the second post where I try to explain how to use PSRemotely to target remote nodes for operations validation.
Share on: