VMware

How To: VMware Cloud Foundation 9.0 and 9.1 Offline Depot – VMUG Advantage VCP-VCF Licenses

This process works for both VCF 9.0 and 9.1 (including 9.0.1 and 9.0.2), but VCF 9.1 does things a little differently, and VMware’s documentation on setting up an offline depot has lots of stuff us home-labbers don’t really use, need, nor want:
https://techdocs.broadcom.com/us/en/vmware-cis/vcf/vcf-9-0-and-later/9-1/lifecycle-management/binary-management-for-vmware-cloud-foundation/set-up-an-offline-depot-web-server-for-vmware-cloud-foundation.html

But setting up the offline depot isn’t all that bad…

Read on to see how I did it.

Script to generate JSON file from Excel Workbook for VMware Cloud Foundation

As you know, I prefer to use command line or API to do things. It’s faster, repeatable, and consistent.

The problem I have with VMware Cloud Foundation is the API or PowerVCF module won’t let you use the vcf-ems-deployment-parameter.xlsx parameter workbook, you must supply a JSON file.

I have another script I plan on releasing that is a fully automated deployment of VCF from start to finish, including monitoring ESXi deployment in the hardware OEM’s tooling, then deploys CloudBuilder, and leverages this portion to generate the JSON file, but I digress. That’s for another post.

In order to use this script you must have an existing CloudBuilder appliance running and know the admin & root passwords. It is a first edition, you must enter/change the variables to suit your environment. It doesn’t do any validation, will continue on error.

I’ve added the file to my GitHub VCF Preparation repo as generate-json.ps1 here: https://github.com/ThepHuck/VCF_Preparation/blob/main/generate-json.ps1

Thanks & happy scripting!

Host prep scripts for deploying & redeploying VCF

Hello! Long time, no scripting! I’ve been blowing through VCF, deploying, redeploying, and built some scripts to help me with this. Sharing is caring, read on to see what I’ve done…

How to bypass BAD PASSWORD: it is based on a dictionary word for vCenter VCSA root account

Today I am midway through setting up my lab and realized the reason VMware Cloud Foundation (VCF) is failing is because I set the wrong password in my JSON file for the root account on my vCenter appliance.

No big deal, right? Just SSH in and change it. I tried, and got this:

New password:
BAD PASSWORD: it is based on a dictionary word
passwd: Authentication token manipulation error
passwd: password unchanged

New password:

BAD PASSWORD: it is based on a dictionary word

passwd: Authentication token manipulation error

passwd: password unchanged

The bypass was actually easy. Presumably you’re already SSH’d in as root, so you just need to edit /etc/pam.d/system-password

# Begin /etc/pam.d/system-password

# use sha512 hash for encryption, use shadow, and try to use any previously
# defined authentication token (chosen password) set by any prior module
password  requisite   pam_cracklib.so   dcredit=-1 ucredit=-1 lcredit=-1 ocredit=-1 minlen=6 difok=4 enforce_for_root
password  required    pam_pwhistory.so  debug use_authtok enforce_for_root remember=5
password  required    pam_unix.so       sha512 use_authtok shadow try_first_pass
# End /etc/pam.d/system-password

# Begin /etc/pam.d/system-password

# use sha512 hash for encryption, use shadow, and try to use any previously

# defined authentication token (chosen password) set by any prior module

password requisite pam_cracklib.so dcredit=-1 ucredit=-1 lcredit=-1 ocredit=-1 minlen=6 difok=4 enforce_for_root

password required pam_pwhistory.so debug use_authtok enforce_for_root remember=5

password required pam_unix.so sha512 use_authtok shadow try_first_pass

# End /etc/pam.d/system-password

Remove enforce_for_root from the first line with pam_cracklib.so. Save the file, no need to restart any services, and retry passwd.

New password:
BAD PASSWORD: it is based on a dictionary word
Retype new password:
passwd: password updated successfully

New password:

BAD PASSWORD: it is based on a dictionary word

Retype new password:

passwd: password updated successfully

After that, I re-added enforce_for_root to the file and clicked RETRY back in VCF and all things are happy once again.

How to create an NSX CLI user, API user & set up NSX Plugin for vROps

TL-DR: See below for details on these commands

Create a local user in the NSX Manager’s CLI, then use the API to grant CLI privileges to that user.

Here’s how using a linux machine:
ssh admin@[nsxmanagerIP] enable config t user vrops-readonly password plaintext notrealpassword user vrops-readonly privilege web-interface
Log out of the NSX Manager (type exit) and stay logged into the linux machine.
Create cli-auditor.xml that contains this (replace brackets with greater/less than):
[?xml version="1.0" encoding="ISO-8859-1" ?] [accessControlEntry] [role]auditor[/role] [resource] [resourceId]globalroot-0[/resourceId] [/resource] [/accessControlEntry]
Add the user as an auditor in the NSX Manager as a CLI user:
curl -i -k -u 'admin:password' -H "Content-Type: application/xml" -X POST --data "@cli-auditor.xml" https://nsxmanagerip/api/2.0/services/usermgmt/role/vrops-readonly?isCli=true
Add your domain/vCenter user as an auditor in the NSX Manager (NOT as a CLI user):
curl -i -k -u 'admin:password' -H "Content-Type: application/xml" -X POST --data "@cli-auditor.xml" https://nsxmanagerip/api/2.0/services/usermgmt/role/[email protected]?isCli=false

Details for creating the NSX CLI user for vROps

Here’s the error

While building a new environment for my lab, I ran across an interesting thing yesterday.

I looked at my cluster’s VSAN health and saw this error:

It’s complaining that my hosts don’t have matching Virtual SAN advanced configuration items.

If you click on that error, you’ll see at the bottom where it shows comparisons of hosts and the advanced configurations:

It shows VSAN.DomMaxLeafAssocsPerHost and VSAN.DomOwnerInflightOps as being different between a few of my hosts. Looking at the image above, you’ll see node 09 has values of 36000 and 1024, respectively, while the other nodes 10-12 show 12000 and 0.

I immediately went to the host configuration advanced settings in the web client, searched VSAN and don’t see either of those. I even checked through PowerCLI and can’t see those:

SRM Connection or Session Limit Reached
Scroll down for the Update

Have you ran into one of these errors before:

     [exec] AxisFault
     [exec]  faultCode: ServerFaultCode
     [exec]  faultSubcode: 
     [exec]  faultString: fault.drextapi.fault.ConnectionLimitReached.summary
     [exec]  faultActor: 
     [exec]  faultNode: 
     [exec]  faultDetail: 
     [exec]     {urn:srm0}SrmFaultConnectionLimitReachedFault:<connectionLimit>10</connectionLimit>
     [exec] fault.drextapi.fault.ConnectionLimitReached.summary

[exec] AxisFault

[exec] faultCode: ServerFaultCode

[exec] faultSubcode:

[exec] faultString: fault.drextapi.fault.ConnectionLimitReached.summary

[exec] faultActor:

[exec] faultNode:

[exec] faultDetail:

[exec] {urn:srm0}SrmFaultConnectionLimitReachedFault:<connectionLimit>10</connectionLimit>

[exec] fault.drextapi.fault.ConnectionLimitReached.summary

     [exec] AxisFault
     [exec]  faultCode: ServerFaultCode
     [exec]  faultSubcode: 
     [exec]  faultString: dr.fault.SessionLimitExceeded
     [exec]  faultActor: 
     [exec]  faultNode: 
     [exec]  faultDetail: 
     [exec] 	{urn:srm0}MethodFaultFault:<vim25:reason>Invalid fault</vim25:reason>
     [exec] dr.fault.SessionLimitExceeded
     [exec] 	at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
     [exec] 	at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
     [exec] 	at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)

[exec] AxisFault

[exec] faultCode: ServerFaultCode

[exec] faultSubcode:

[exec] faultString: dr.fault.SessionLimitExceeded

[exec] faultActor:

[exec] faultNode:

[exec] faultDetail:

[exec] {urn:srm0}MethodFaultFault:<vim25:reason>Invalid fault</vim25:reason>

[exec] dr.fault.SessionLimitExceeded

[exec] at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)

[exec] at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)

[exec] at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)

Or in the GUI:Lost connection to remote SRM server. Unable to login. The maximum number of SRM users has been reached.RJ, from RJApproves.com, & I had been plagued by these messages for weeks, maybe even months. Well, we finally got it all figured out!

Keep reading for the fix!

VMware Site Recovery Manager & Active Directory – Part 2 – Domain Controllers in test environment

On May 25th, I published this post covering some scenarios on how to use Site Recovery Manager & Active Directory. Michael White from VMware responded with some good info. He had an awesome suggestion of using a script to cold clone a DC daily to use for testing.

Let’s take a look at some ways we can get this done:

To include Active Directory or not to include Active Directory, that is the question.

I’ve been reading a lot around VMware’s Site Recovery Manager and considerations surrounding Active Directory. Most of what you will read says ‘NEVER’ protect AD with SRM, only use native AD replication, especially since SRM & vCenter at your Recovery Site require AD to be running anyway.

But what if you have multiple domains for different uses? This is where the lines become blurred. Think about this for a second:

One AD environment (single forest/domain, no trusts) where vCenter & SRM live, call it infrastructure AD
A second AD environment (also single forest/domain, no trusts) for your application servers, call it application AD
You have infrastructure AD at both sites, SRM & vCenter authenticate accordingly
Protected site has application AD
Recovery site has nothing

Now here is where I say ‘why wouldn’t you protect AD with SRM?’ In a true disaster, the protected site is gone, no AD exists anywhere, so using SRM to bring them up on the recovery site makes sense. Is my logic flawed?

However, if I had my application AD living at both sites, using native replication, I agree 100% in not including your Domain Controllers in your SRM Recovery Plan. This leads to my concern…

Testing vs Planned vs Unplanned

This post will cover testing only. I’ll write a follow-up covering planned & unplanned failovers later.

To me, the only way to really test your DR plan (in this instance, your SRM Recovery Plan) is to not have anything different between them.

Site Recovery Manager 5.1 installation fails connecting to database – dbmanager could not initialize vdb connection

I recently ran into an issue when installing SRM and thought I’d share. I didn’t get a screenshot, but the error was something like this:

Failed to Initialize – dbmanager could not initialize vdb connection – odbc error

If you click skip from there, it’ll fail to create the tables, and eventually get to the point where you’ll have to roll back.

As it turns out, it was due to a c0mp73x”P@s$w0rd! that caused the problem. I’m not sure what characters killed it, but going to a less complex pAs5w0rd worked fine. ODBC worked fine, user & permissions were set up properly, it just came down to SRM not being able to handle the special characters. What’s strange is a similarly complex password works for vCenter.

Hope this helps, have fun out there!

ThepHuck

VMware

How To: VMware Cloud Foundation 9.0 and 9.1 Offline Depot – VMUG Advantage VCP-VCF Licenses

Script to generate JSON file from Excel Workbook for VMware Cloud Foundation

Host prep scripts for deploying & redeploying VCF

How to bypass BAD PASSWORD: it is based on a dictionary word for vCenter VCSA root account

How to create an NSX CLI user, API user & set up NSX Plugin for vROps

TL-DR: See below for details on these commands

Details for creating the NSX CLI user for vROps

VMware Virtual SAN Health failed Cluster health test

Here’s the error

VMware Site Recovery Manager 5.1 Connection & Session Limits – Updated

SRM Connection or Session Limit Reached
Scroll down for the Update

Keep reading for the fix!

VMware Site Recovery Manager & Active Directory – Part 2 – Domain Controllers in test environment

VMware Site Recovery Manager & Active Directory – Part 1 – Testing Recovery Plans with Active Directory

To include Active Directory or not to include Active Directory, that is the question.

Testing vs Planned vs Unplanned

This post will cover testing only. I’ll write a follow-up covering planned & unplanned failovers later.

Site Recovery Manager 5.1 installation fails connecting to database – dbmanager could not initialize vdb connection

VMware

TL-DR: See below for details on these commands

Details for creating the NSX CLI user for vROps

Here’s the error

SRM Connection or Session Limit Reached ***Scroll down for the Update***

Keep reading for the fix!

To include Active Directory or not to include Active Directory, that is the question.

Testing vs Planned vs Unplanned

This post will cover testing only. I’ll write a follow-up covering planned & unplanned failovers later.

SRM Connection or Session Limit Reached
Scroll down for the Update