May 29, 2014

After working with the raw materials to install Hortonworks Data Platform HDP on Windows in an Azure IaaS VM, I came across the automated install tool from Hortonworks.

http://hortonworks.com/blog/automated-install-hdp-2-1-hadoop-windows/

You get it from github and run the exe included.  It uses a config file to describe where all the install files are such as python, java, etc. and it goes through and installs them all.  It then runs smoke tests to see if everything is working.

An easy way to get a cluster up and running!


May 14, 2014

I was working with Power Query for Excel and needed to parse a tab separated file.  Unfortunately there isn’t a TSV option directly in Power Query.  I couldn’t find much information on how to tackle this so I’ll post here.

The first thing I ended up doing was importing the tab separated file as a CSV.  This added three steps to Power Query.  The first step looked like this. 

= Csv.Document(File.Contents(“<FILE LOCATION>”),null,{0,42,53},null,1252)

If you look at the documentation for Csv.Document, you’ll see that the third parameter is the delimiter.  Unfortunately something like “/t” doesn’t work in PQ so I had to figure out what it is.

If you use #(tab), that is what you need to “escape” tab and send the value to the function.  You’ll need to have quotes around it like this:

= Csv.Document(File.Contents(“<FILE LOCATION>”),null,”#(tab)”,null,1252)

I found that this messed up the next steps so I ended up creating a blank query and adding this as the source step.  This set up the FirstRowAsHeader step and the ChangedType step.


Announcement

November 14, 2011

Well – I’ve been so busy lately that I haven’t had much time to blog, but I wanted to make an announcement here that I’m taking a job at Microsoft. I’ll be starting in a couple weeks as a TS for the Data Platform out of the North Central District. I’m really excited and hopefully I’ll be doing a little more blogging here as a result.


Management Studio Find and Replace

July 8, 2010

Some pretty cool stuff…

Did you know that SQL Server Management Studio makes a nice little regular expression (regex) editor and tester?  Go to find and replace (Ctrl-H) and down at the bottom of the dialog box is a check box names “Use”.  Check that box and choose Regular Expression.  Now to the right of the box where you type the text, there is a glyph that will fly out a regular expression character helper.  You can type in text in Management Studio and test a regular expression against it.

You can also leverage this regular expression functionality to do a multi-line find and replace.  You can substitute in the old C# (C++) syntax \n for a new line.  I needed to replace a block of text with a multi-line block of text and this worked beautifully!

Don’t forget you can use Management Studio to search in files too.  A nice GREP tool too!  Hit the glyph just to the right of Quick Replace or Quick Find at the top and you’ll get the option to search in files.


Silverlight PivotViewer Released

June 29, 2010

Now this is cool!

http://www.silverlight.net/learn/pivotviewer/

Mark