Trending Posts

mint screenshot

Make Money Online: Documenting 10 Years of Failure

By John Ward / March 20, 2015 / 65 Comments
This is the history of my experience trying to make money online over the past 10 years or so. This is by far the longest post I've ever personally written and it's more of an autobiography than a blog post....
pancakeswap prediction bot

I Made an Automated Pancakeswap Prediction Bot

By John Ward / October 14, 2022 / 0 Comments
A few weeks ago I got the idea to try to automatically bid on Pancakeswap's Prediction game. So I decided to try to build a Pancakeswap Prediction bot to take on the task. I did this just to learn about...
IBM Watson Avatar Logo

IBM Watson Explorer

By John Ward / April 6, 2015 / 1 Comment
I'm going to talk a little bit about IBM Watson Explorer (WEX). A few people have contacted me about what I do at my day job as a Watson Explorer Consultant. Since this is my personal site I don't usually...
2022

My 2022 Recap and 2023 Plans

By John Ward / January 13, 2023 / 0 Comments
I haven't been posting to my blog that often, but I wanted to recap 2022 and lay out some of my plans for 2023. Overall, 2022 was a pretty good year for me, and I made some progress on business...
watson explorer vs elastics earch

Watson Explorer vs Elasticsearch for Enterprise Search

By John Ward / May 12, 2020 / 0 Comments
Are you interested in IBM Watson Explorer vs Elasticsearch? Recently, I had to do some comparisons between IBM Watson Explorer and Elasticsearch for a project. I spent some time going through the features of both platforms and found some interesting...
what is ibm watson

What is IBM Watson

By John Ward / July 24, 2020 / 0 Comments
There is a lot of confusion about what exactly IBM Watson is? I'm going to try to clear that up a little bit in this blog post. I'll go into the history of IBM Watson and what IBM is doing...

One AdSense Change Dramatically Increased My Earnings…

By John Ward / March 9, 2015 / 12 Comments
... and I have no idea what it is. I used to blog actively on my tutorial site, TeamTutorials. In it's prime the site would see over 100k page views per month. Even in the prime the site barely met...

Are Products the Road to Prosperity?

By John Ward / May 13, 2015 / 1 Comment
A few weeks ago a wrote a somewhat popular post about my past experiences trying to make money online. I went through the ups and downs of working as an affiliate promoting other people's products. At the conclusion of the...
xrp

What’s the Future of XRP

By John Ward / September 20, 2021 / 0 Comments
The XRP token was a fast-growing cryptocurrency until the U.S. Securities and Exchange Commission (SEC) filed a lawsuit against the parent company, Ripple,  back in December of 2020. This left many people scrambling to dump their XRP holdings and several...
IBM Watson explorer connector list

IBM Watson Explorer Connector List

By John Ward / March 4, 2020 / 0 Comments
IBM Watson Explorer Foundational Components include many out-of-box connectors for various types of data sources including web, filesystems, shares, databases and content management systems. This allows Watson Explorer (WEX) to be a very versatile tool for ingesting and indexing from...

Regular Expression Converter for Watson Explorer Engine

Sometimes it’s useful to extract data from a Watson Explorer content node using regular expressions. In this post, I’ll show you how to extract data using a regular expression and create a new content node for that specific data.

To start off we will use the default example-metadata collection. We will attempt to extract any 3 digit number from the snippet content to make the regex easy. You can do much more advanced regular expressions if necessary.

First go to the example-metadata collection and click “test-it”

Then click on “Test-it” next to the first result:

Now scroll down and look at the output of the ” Create Metadata from Content” converter:

In the output, you will see the snippet content has the number 500 in it.


We will make a converter that will extract any 3 digit number into a new content. First, add a new converter:

Select the Regex entity extraction converter and click Add.

In the converter configuration, in the list of entities node names enter “my-regex-node” and the target node of “snippet”. Then click OK.

Now on the sidebar of WEX click the + next to XML.

enter the following names:

Now update the xml node to include your regular expression like below. Note that my regex is “[0-9]{3}” to match 3 digits. Save the node.

<entities name="my-regex-node">
  <entity name="regex-rule" weight="-1">
    <regex>[0-9]{3}</regex>
    <replace>viv:str-to-mixed(viv:current-string())</replace>
  </entity>
</entities>

Return to the collection and do a test-it, as we did above, down to that same first result. If you look at the converter trace you will see the regex converter is running.

Click on the 910 output to see your new content node:

Now you can use the new “regex-rule” content in your search application.

Published by

John Ward

I've been in working in the tech space since about 2004. I've spent time working with Artificial Intelligence, Machine Learning, Natural Language Processing, and Advertising technology.