National Coding Week – Golang, TiDB, and AI, Oh my!

This week is National Coding Week. The theme for 2023 (at least according to codingweek.org) is Artificial Intelligence. I heard about this on a LinkedIn post from a former colleague who was taking this as an opportunity to learn Go.

I just learned that this week is National Coding Week. Now I have a flimsy excuse to start taking baby steps learning Go! #nationalcodingweek

As it happens, Go is used quite a lot at my new employer, PingCAP, in their development of TiDB (a modern distributed SQL database) so I thought that making a few “experimental” changes to TiDB would be a good way to learn a new language and scratch the AI itch at the same time.

Deciding on a Project

For my project I wanted to accomplish the following:

Learn some Go programming
Learn a bit more about TiDB
Do something AI related

After seeing a recent blog post by Daniël van Eeden about extending TiDB with custom functions, I thought it would be interesting to follow that example and add some custom functions to TiDB. I have been looking at some of the interesting capabilities that are made available in vector databases, and thought it may be interesting to add some experimental functions that would allow users to calculate the distance between vectors.

These kinds of functions are useful for users who are storing vector embeddings and want to be able to calculate the distance between them. Measuring the distance between vectors can be used to calculate similarity between two vector embeddings persisted in a database, or it could also compare persisted vector embeddings with a vector embedding generated by a search query.

While there are definitely some more state of the art approaches to optimize performance and scaling for these kinds of vector similarity searches, for this experimental effort I have stuck with the more straightforward cosine similarity and dot product functions.

Getting Started

To get started with the project I relied on the detailed write up from Daniël van Eeden, as well as TiDB’s excellent developer guide.

I followed the getting started guide to set up my IDE (Visual Studio Code) on my Mac and get started.

It was remarkably easy to get the TiDB database up and running following the guide so I won’t replicate that documentation here.

Learning by Doing

My previous programming experience was primarily in Java so adjusting to some of the syntactic differences between the two was interesting. Taking a look at some of the existing examples in the TiDB code base was very helpful in figuring it out, as well as being able to dive into some of the Go documentation as needed.

The first thing I did was follow the example from the TiDB developer documentation as these custom functions will be compiled into TiDB. The initial changes I made were to update functions.go so that TIDB would recognize the function names in the SQL statements. I also updated builtin.go at this point to point to the function implementations I was going to write (as new builtin functions, compiled into TiDB).

Next, I set about defining the functions themselves. This helped me to learn a bit about how functions and methods are defined in Go. I particularly liked the ability to define multiple return values (somewhat reminiscent of returning multiple values as tuples in Haskell) to help with handling error cases. This was an interesting change of pace from Java and its use of exceptions to handle some of these cases.

I decided to extract the calculation of the Cosine Similarity to operate directly on float64 arrays. I originally did this as I was thinking of importing the functions using some library (that would have likely operated on float64’s versus the internal TiDB types), but after an initial investigation it seemed that these functions were easy enough to implement directly (for the purposes of this experimental project) so just went ahead and did that:

func Cosine(a []float64, b []float64) (cosine float64, err error) {
	if len(a) != len(b) {
		return 0.0, errors.New("Invalid vectors: two arrays of the same length were expected")
	}
	if len(a) == 0 {
		return 0.0, errors.New("Invalid vectors: two non-zero length arrays were expected")
	}

	sum := 0.0
	s1 := 0.0
	s2 := 0.0

	for i := range a {
		sum += a[i] * b[i]
		s1 += a[i] * a[i]
		s2 += b[i] * b[i]
	}
	return sum / (math.Sqrt(s1) * math.Sqrt(s2)), nil
}

I also needed to decide how to store the vector embeddings in the database. To make life easy on myself I decided that I would forgo adding a custom type and instead decided to use the JSON data type that is available in TiDB. The functions would operate on JSON arrays of numbers. To do this I used some of the useful capabilities exposed in TiDB types to convert from the JSON type to an array of float64 in Go:

func AsFloat64Array(binJson types.BinaryJSON) (values []float64, err error) {
	if binJson.TypeCode != types.JSONTypeCodeArray {
		err = errors.New("Invalid JSON Array: an array of numbers were expected")
		return nil, err
	}

	var arrCount int = binJson.GetElemCount()
	values = make([]float64, arrCount)
	for i := 0; i < arrCount && err == nil; i++ {
		var elem = binJson.ArrayGetElem(i)
		values[i], err = types.ConvertJSONToFloat(fakeSctx, elem)
	}
	return values, err
}

All the changes I made (including some basic test cases) are available in Github (see initial commit).

Phase 1 Complete

All of the changes I have made now enable me to easily calculate the cosine similarity and dot product of two JSON arrays in TiDB using SQL. Using the MySQL command line client (TiDB is wire compatible with MySQL) I can run SQL like the following:

SQL

mysql> SELECT 1 AS r, x_cosine_sim('[1.0, 2.0, 3.0, 4.0 ,5.0]','[1.0, 2.0, 3.0, 4.0, 5.0]') AS result   
    -> UNION
    -> SELECT 2 AS r, x_cosine_sim('[1.0, 2.0, 3.0, 4.0 ,5.0]','[-1.0, -2.0, -3.0, -4.0, -5.0]') AS result
    -> UNION
    -> SELECT 3 as r, x_cosine_sim('[1.0, 2.0, 3.0, 4.0 ,5.0]','[-1.0, 2.0, 3.0, 4.0, -5.0]') AS result
    -> ORDER BY r ASC;
+------+---------------------+
| r    | result              |
+------+---------------------+
|    1 |                   1 |
|    2 |                  -1 |
|    3 | 0.05454545454545454 |
+------+---------------------+
3 rows in set (0.00 sec)

Reflections…

For some people (like myself), having a small project to work on is a useful tool for learning a new programming language. While the code I have shared is not professional quality (and could definitely be improved and be more idiomatic) it was helpful in its goal of helping me get more familiar with the Go language and its available tooling. Thanks to Chris and Daniël for the indirect inspiration for this project!

Managing Data at Scale in VMware and Hybrid Cloud Environments

Thanks to the VMware User Group I was recently able to share some of my thoughts on managing data at scale across VMware and Cloud environments. In the session I shared some stories covering how operators were managing data using VMware capabilities like vSphere’s DRS and Storage Policies, as well as concepts like Rubrik’s SLA Domains. I covered some interesting topics and customer stories, including:

Imperative and declarative automation approaches
Policy driven management
Application of machine learning to data security
Managing data across edge, core, and cloud

If this sounds like your kind of thing then watch the webinar on Managing Data at Scale in a VMware and Hybrid Cloud Environment on-demand.

VMworld 2018 Session Recommendations

VMworld 2018 is just a few short weeks away at this point. Many of those reading this post would no doubt have already filled out their schedule, for those of you who have procrastinated however here are a few sessions that I am looking forward to. To make it interesting I’m limiting my recommendations to one per day, while at the show I fully believe you should take advantage of mingling with others in the community and browsing the show floor to get a sense of some of the innovation that is happening around the ecosystem.

Sunday – Demystifying vSAN Management for the Traditional Storage Administrator [HCI1475QU]

As a fan of vSAN and having listened to Pete Koehler on many topics, I’m sure this will be a great session for anyone looking for to get a handle on how vSAN differs from traditional storage.

Monday – Application modernization with VMware Cloud on AWS [HYP2145BUS]

I don’t think I’m going to be able to watch this one live due to other commitments but will be eagerly watching the replay. I’ve presented with Wen before and also watched Aarthi present so I know this will be a great session for anyone attending.

Tuesday – VMware NSX for Service Providers: A Technical View [HYP2406BU]

Service providers networking is an interesting beast. If networking is your thing then this promises to be an interesting session and you can always trust Ray to get into the details and I expect Tina to bring the service provider perspective into the mix.

Wednesday – Confluent Platform: Introduction and Deployment on PKS [CODE5593U]

There’s a lot of excellent sessions happening on Wednesday, one that is a little out of my ordinary area though is this one on running confluent on top of Pivotal Container Services. Should be an interesting change from the usual VMworld topics.

Wednesday Bonus – Ransomware Threat Recovery Using Rubrik Polaris [SAI3712BUS]

I’m going to cheat and share another session on Wednesday just because I know it’s going to be cool and cover one of Rubrik (my employer’s) latest capabilities presented by a couple of excellent presenters. Promises to be enlightening!

If you’d like to learn more about Polaris before this session check out the Polaris announcement blog post

Thursday – Architecting at the Tactical Edge with VMware vSAN and vRealize [HCI1691BU]

I’ve had a bit of an inside view into what has been happening behind the scenes for this session. It’s going to be interesting to hear about some of the more challenging aspects of this project, and how they were addressed. Promises to be an informative and interesting session with some good presenters!

Other Sessions

If the sessions above aren’t enough to fill your schedule there are several more excellent sessions being presented at VMworld this year. Here are a few of my favorite speakers, any of their sessions should be worth your time if you like to skew a bit more technical in your tastes:

Rebecca Fitzhugh – has an awesome array of presentations this year, all of which will no doubt be amazing
Duncan Epping – let’s just say he knows how to present and is not shy of addressing both the technical details and high level perspectives
Christian Dickmann – enjoy listening to his thoughts on simplifying operational management
Cody Hosterman – if vSphere storage is your thing, you’ll be at home
Christos Karamanolis – always interesting to listen to his forward looking thoughts

There are of course many other great presenters, but hey this list is getting long already!

If you’re attending VMworld this year have a great time! If you want to connect with me at the conference feel free to reach out to me on twitter @BenMeadowcroft.

Thoughts on Product Management

On my last day at VMware I was pulled aside by Glenn Sizemore who interviewed about me for the “career day” episode of the vSpeaking podcast. Glenn asked me a few questions about my role and I thought it would be helpful to write my responses and add a bit more detail for people who are interested in the Product Management role.

How would I describe Product Management?

Product Management for enterprise software is about building the right products, products that provide real, demonstrable, value to the customer. As a Product Manager you have to be able to get to grips with some of the underlying business challenges faced by customers. Keeping everyone aligned on that north star ensures that, as a team, we spend our effort building something that customers both value and are willing to pay to unlock that value.

How did I decide to get into Product?

I trace my Product Management roots back to my time at a small startup company in the UK called Mobysoft. When I joined as the first full-time employee the company was much smaller than it is now. Being involved at an early stage gave me the opportunity to take on a lot of responsibility, build the engineering team, and develop a new service called RentSense.

In the early days of developing the new service I got to work closely with customers to understand the challenges they were facing and ensure that the product my team were developing was going to hit the mark. That was my first experience on the Product side of the fence and I definitely wanted more.

I decided at this point I was interested in transitioning my career from engineering to Product Management. I made the move to the USA to pursue my MBA and began transition into Product Management.

3 things I love about Product Management?

First, I love the satisfaction of seeing something through from beginning to end. Being able to work with customers to identify their needs and then work on bringing to market technology solutions to solve those challenges and close that loop is hugely satisfying for me.

Second, is the people. I get to work with some incredibly intelligent peers across multiple disciplines. As a former engineer I always appreciate being able to work with high caliber engineers and I have been incredibly honored to have worked with some exceptionally talented people during my times at AWS, VMware, and now Rubrik. Being able to share the context of the customers pain points with the engineering teams is one of the things that I think Product Managers absolutely need to do. Ensuring that customer empathy is baked into the product throughout its execution is how good products are forged.

Third, as a self confessed data geek, I love the opportunity to dive into data. Direct customer interaction is critical to gathering insights, but the qualitative insight has to be married to quantitative analysis. Without this combination it’s all too easy to fall into the trap of building a great solution for just one customer and being a consultancy versus a Product company.

Something people don’t tell you before taking the job?

Probably the biggest surprise for me was the opportunity to work collaboratively across many different teams. Not just cross-functionally with the teams that were involved in delivering the same product, but also with teams across the company working on a variety of different initiatives. Ensuring that as a PM you remain focused is critical, but being open to working with adjacent teams (both within and outside the company) can bring a lot of leverage in delivering value to customers.

Some Thoughts on VMware Cloud on AWS Stretched Clusters

Companies are considering a variety of migration strategies as they are looking to leverage the cloud. For VMware Cloud on AWS (VMC) migration is one of the key use cases that VMware have promoted (alongside Disaster Recovery). A key benefit touted by VMware for their offering is the ability to re-host applications without having to re-platform or re-architect, however, this is not without caveats when it comes to availability and resiliency.

For a customer migrating to the cloud, delivering the right level of resiliency and availability is a key concern. On AWS the Availability Zone is a key building block for designing available architectures. For customers who are willing to re-architect their application, designing the application to ensure resiliency in the face of an AZ loss is critical, as well as ensuring customers are eligible for AWS SLA credits in the event of an EC2 outage! But what options are available for delivering multi-AZ availability when pursuing a re-host migration strategy?

For VMware Cloud on AWS, delivering this re-host capability this is also one of the most significant limitations with what is currently available. When customers provisioned a new SDDC it could only be placed within a single Availability Zone (AZ). The combination of vSphere HA, vSAN’s erasure coding, and VMC’s auto-remediation of failed hosts ensured that failures of the individual bare metal EC2 instances could be handled well. However, there remained an issue of protecting against failures of entire Availability Zones.

With the unveiling of a technology preview of their new stretched clustering capability, VMware is presenting a differentiated offering. Stretched networking, by NSX, and stretched storage, from vSAN, combine with vSphere’s HA to deliver a platform that delivers resiliency against AZ failure, without having to re-architect or re-platform your application to take advantage of multiple Availability Zones. On the vSAN side, the increased costs of mirroring the storage are now offset by the introduction of deduplication and compression support. More details were shared during VMware’s recent Cloud Briefing event and I also spoke about VMware’s plans here during my VMC storage deep-dive session at VMworld.

It will be interesting to see how VMware’s customers evaluate this new offering when it moves out of tech preview status and into General Availability.

VMware Site Recovery VMworld 2017 Session

During VMworld 2017 I shared a tech preview, with GS Khalsa, of the VMware Site Recovery service that’s now available as an add-on to VMware Cloud on AWS. While we’ve already made several enhancements to the service, over and above what you’ll see in the tech preview, I think it still illustrates many of the exciting new options available with VMware Site Recovery today!

Check out the session online

VMware Cloud on AWS Storage Deep-Dive

At VMworld earlier this year I presented a deep-dive on vSAN storage for VMware Cloud on AWS with Matt Amdur. This was an interesting topic as we’d had to deliver some enhancements to vSAN for deployment onto the Amazon EC2 Bare Metal instances, now that they’ve been released there are a few more public details that can be shared!

At VMworld we covered a few key topics including the host and cluster configuration on EC2 Bare Metal instances, how we were operating the storage in AWS that would be a little different from how on-premises customers would operate, and a few peaks into the unique features delivered for VMware Cloud on AWS and a look into our plans.

At the recent re:Invent conference, AWS launched their new EC2 Bare Metal instances. VMware were early customers of this instance and worked collaboratively with AWS to ensure the new bare metal platform was a good platform for running ESXi and vSAN. With the launch of the solution, AWS was more open about talking details on their platform. Check out the session by Aaron Blassius and Matt Wilson sharing details on the new platform we are using for VMware Cloud on AWS.

VMware Cloud on AWS – Disaster Recovery and other use cases

I was lucky enough to be able to share some details about the new VMware Site Recovery service at the AWS re:Invent conference alongside Wen Yu. In the VMware Cloud on AWS technical deep dive and native service integration session we covered some key use cases customers are looking to address with VMware Cloud on AWS including:

Disaster Recovery
Database Migration
Securing web/content management

Check out the session (recorded the day before the official launch) on Youtube

Launch Often! VMware Cloud on AWS

At VMworld, back in August, the first version of VMware Cloud on AWS was launched. Now three months later we’re doing it again! As the Product Manager owning the storage and disaster recovery initiatives it’s been a great experience to work with the joint VMware and AWS teams as we delivered the storage platform for VMware Cloud on AWS (built with vSAN), and are now delivering new Disaster Recovery (DR) capabilities with VMware Site Recovery.

Delivering improved resiliency and DR options has been an important focus for VMware Cloud on AWS. This new capability allows customers to protect their mission-critical workloads running on-premises to VMware Cloud on AWS, or vice-versa. We also support protection between VMware Cloud on AWS SDDCs. This enables customers to protect workloads across different AWS Availability Zones, or even between AWS Regions with the newly announced support for US East (N. Virginia).

It’s also been a great experience to work closely with some of our forward-looking customers as we’ve been developing VMware Cloud on AWS. Listen to one of these early customers share their view of the collaboration between VMware and AWS, and the new capabilities we’re delivering.

More details on the VMware Site Recovery solutions can be found on the VMware Cloud Services site: