Sep 26 2013

PASS Votes

I’ve already made my own intentions clear. I’m casting one of my votes for Allen Kinsel. But, I get three votes. Some of you may be asking yourselves, why didn’t recommend two other candidates?

Honesty time.

First, no one else asked me to. I’m actually glad of this. If I received requests for help from more than three of the candidates, I would be very hard pressed to choose. Yes, I ultimately have to, but I don’t have to publicly reject anyone. At this point, it looks like I’m ready to vote for every other candidate. That’s a good thing from my standpoint, maybe not so much from theirs. However, if anyone wanted a little help, they should have asked. That’s a good policy with life in general.

Second, I want Allen on the board. Rather than tell you three people to vote for and watch votes possibly dissipate, I want you focused on one thing, Allen… and two other people. This is pure selfishness on my part, but hey, look at that, I’m the Scary DBA. I get to be selfish on my own blog.

Third, I’m kind of stuck. I know where my second vote it is going (and no, I’m not going to tell you), but I am frankly agonizing a little about my third vote. Look at this list. If you know these people as I do, you’d be stuck too.

Fourth, I’m actually considering not casting a third vote because of how this EMEA/Open/US thing is laid out. It makes me think we could see someone win the vote, but not get the seat and that kind of sucks. Discussion on it here. That might be something to address in the future. It feels wonky, but maybe that’s just me. Note, I said considering. Nothing is carved in stone here and I get three votes, pretty sure I’ll use them all. Think of it like Chicago, vote early & often.

Finally, like every single election ever held, to a degree, this is a popularity contest. And I honestly hate that. I hate it about all elections (traditionally, prettier people win, and just how messed up is that), but especially about this one because these are all my friends and I’m going to see friends win and see friends lose. Trust me when I tell you, the friends who lose will be very unhappy. Just be glad that the vote results will be announced next week, not at the Summit the way they used to be. Great googly moogly that made for some really depressing (if, later, fun-filled) evenings (if you weren’t there, you weren’t there). It was much simpler to point to one friend out of many and give you good reasons to vote for him (that’s Allen Kinsel by the way in case anyone lost the thread).

At this point, voting is open. If you met the qualifications you’ve received your email. Go get it done.

Thank you to everyone involved in this. Thanks to the Nominations Committee. It’s a pretty horrible job, but you stepped up and took care of business. Thanks to each and every person who volunteered to run for the board. You’re all better and braver than I.

Oh, and if you are eligible and you don’t vote, please do not let me hear you complaining about how PASS is run. This is your moment to make a difference. If you don’t take it, you just gave up the right to complain about the outcome.

 

Sep 23 2013

PASS Votes and So Should You

To put it bluntly, you need to go cast a vote for Allen Kinsel for the PASS Board.

In his own words, here’s why Allen is running.

Why am I supporting Allen? First, he’s a friend. But ignore that. This isn’t, or at least shouldn’t be, a popularity contest. It’s about getting what you want, what you need, out of the PASS organization. I absolutely believe that Allen has the best interests of this organization at heart. He wants to make a difference for the better. More than that, I’ve seen him working as a volunteer within the organization where he does make a difference, does make things better. He brings passion to the job. If you’ve ever volunteered to help run any kind of organization, you’ll know that one of the single most important things that you can apply is passion. I also think that Allen has a vision for where he wants this organization to go. Even more than that, I think Allen’s vision is the right one. So we’re talking drive, passion, a proven track record and a vision.

What else can I say to convince you? He likes puppies. There. You can vote for Allen Kinsel, lover of puppies.

Or, don’t listen to me at all. Instead, listen to all these people.

Here are the candidates for office. I am actually friends or acquaintances with every one of them. They are all good people. You can vote for three of them. Make one of those votes for Allen Kinsel.

Vote opens on the 25th and closes on the 30th. You are eligible to vote if you are a member of PASS.

Aug 26 2013

Make the PASS Summit Work For Your Employer

logo_headerIt was pointed out to me that since PASS is such a huge networking event, any employer would be crazy to send a good employee to the event. They’ll just come back and hand in their two weeks notice. You know, that’s entirely possible. But, let’s not confuse networking with job hunting. Funny enough, while I did get my latest job while at the PASS Summit, it wasn’t through the personal network that I had built up over the years of going to, and speaking at, the Summit.

I use that network as an extensive knowledge base. If I have a question about Availability Groups, I have at least three different people I can reach out to. If I get stuck on some internals question, I have other individuals I speak with. I know who to talk to if I get stuck in PowerShell. Think about it. How much more valuable does that make me to an employer? They’re not just hiring me. They’re hiring my network. But that’s only part of how you want to convince your boss to send you to the PASS Summit. Let’s go over a few items that will make it easier for you to convince your employer it absolutely is in their best interest to send you to the PASS Summit.

My Knowledge Base

Maybe this one is obvious, but you should talk to your boss about the addition of more skills to your skill set, an improvement of your overall knowledge and your worth to the company. There is a ton of excellent learning opportunities at the Summit covering the entire length, breadth and depth of SQL Server and it’s attendant products. These sessions are lead by some of the most knowledgeable and skilled people in the industry. Further, they’re practically slavering at the bit to have you ask your difficult question so that they can exercise their skills and expand their knowledge. You can learn more, faster, at the PASS Summit than almost anywhere. That’s going too help your employer because you will be a better employee

Our Current Problem

Just about every year in the 6-8 weeks leading up to the PASS Summit, I would start collecting questions. What particular pain points are we experiencing with the products that I need to grab 10 minutes with a Microsoft engineer to talk about. Oh, didn’t I mention that fact? Yeah, the guys who built the product are frequently at the Summit (although more are there when it’s in Seattle). You can take your immediate problems straight to these people. Further, there’s likely to be an MVP or MCM standing near by who might be able to help out too. Or, you can try the Customer Advisory Team (CAT) who always have a number of representatives there. In short, you can get pretty close to premier support without wasting a premier support ticket.

Our Future Direction

Your company needs to make decisions about future direction. You’ve seen the marketing hype. Now, what do the people who are working with the newest stuff every day have to say? Can you get more information by attending sessions that are not put on by Microsoft on emerging technologies? Yes, frequently, but not always, you can. The PASS Summit is the place to see this. Microsoft doesn’t just develop things and then toss them over the fence to see what works (mostly). Instead, they have companies and individuals working with them all the time to develop new directions for the product. Those people and organizations are frequently at the Summit, displaying new stuff on the vendor floor or giving presentations about the new directions they’re taking the technology. You can get a better understanding if your company’s plans are going to work well going into the future. Even if the plan is best summed up as “We’ll sit on SQL Server 2000 until it rots around our ears.” Others are doing it too. Find out how it’s working out for them.

Our Team Skill Set

Most companies are not going to want to send all of the database development team or DBA team, or development team away for a week. Instead, they’ll send one or two people from each team (maybe less). So your team loses out, right? Wrong. Two things. First, coordinate. Make sure that you cover as many sessions as you possible can. Don’t overlap. When I was working on a team heading to the Summit we would divide up sessions to make sure things got covered that the company needed or that we needed as individuals. While I may want to see speaker X do her session on indexing again, my co-worker has yet to see it, so I’ll send them. And make sure you have a couple of sessions picked for a time period because the session you’re in could be a bad choice. If a session isn’t for you, for any reason, just walk out. Second, teach. You just spent a week getting data dumped into your brain. Teach it to your team. We made a pact that anyone who went off to an event had to present 2-3 sessions to the team from that event. You can even purchase the event DVD and show sessions to your team in meetings.

NOTE: This is not to say, steal these slide decks to become your internal training resource, unattributed to the original presenter. That is a bad thing.

My Retention

Who do you want to work for? The employer that says, “Heck no you can’t go to the PASS Summit. You’ll meet people and figure out that our company stinks and you’ll try to get a new job, or you’ll learn more and be more valuable and we’re not about to raise your pay.” Or, the employer who says, “Yeah, sure you can go this year. Let’s document what you’re going to learn and how it’ll help the company.” OK, it’s not going to be that easy. You may have to agree not to leave the company for a year or something afterwards. Be cautious about exactly what kind of strings get attached, but also be aware of the fact that the company is investing in you and would probably expect to get something for that investment. Just be sure it’s fair to both you and them.

I get it that some employers are smaller and just can’t foot the bill for this. See if they’ll meet you part way. You pay for the trip and lodging and they pay for the Summit, or vice versa. It can also be about timing. You’ve got a major software release that’s going to prevent you from going. I almost missed a Summit myself because of this. It’s just not always possible, but a good employer will find a way to make it possible, occasionally. If there is literally no support, of any kind, ever, you’re either working for a not-for-profit or, maybe, the wrong company.

I’ll Be On Call

Be on call. Carry the laptop with you. Keep your phone charged (ABC = Always be charging). Don’t enjoy the evening festivities too much (and yes, there are parties at the PASS Summit). Be a responsible employee. I’ve had to walk out of great sessions because of calls from the office. I missed half a day because of a failed deployment. But I was online and available, not falling off the face of the planet just because I was at the Summit. Make the commitment to be available as needed by your employer.

My Notes

Take lots and lots and lots of notes. You can type them into OneNote or EverNote or whatever. Or you can scribble them into your tablet or onto notepads. Anything that works. But write stuff down. Write lots of stuff down. Write down what you’re thinking about the information as well as details said by the speaker that may not be visible on slides or in code. Write down what you talked about with that lady from that vendor on the back of their card. Take notes while talking to the Microsoft engineer or CAT member. Then, turn the notes over to your employer. They act as an additional knowledge base about the event. It’s one more resource that you’re bringing back to your team.

Our Swag

Bring home a t-shirt or two for those people who couldn’t go. If there’s a particularly cool piece of swag, give it to the boss or have it as a raffle at the team training event for the best question. Share the stuff you get as well as the information you get. A friend of mine and I once collected 56 t-shirts and a stack of other swag (and had a heck of a time getting it all back on the plane) which we then spent almost two weeks handing out in the office to our team, development teams, managers and systems people, etc. It made us look good and cost us nothing but a little time on the vendor floor. It’s silly, but it works. If nothing else, it shows the boss that you’re thinking about your team and the company while you’re away.

My/Our Network

I talked about it at the beginning of this blog post. Network. That means not being “that person.” That person is the one who comes to the event, shows up for all the sessions, doesn’t ask questions or talk to a single person all day, then leaves and goes to their hotel room (and then usually goes home saying “Wow, that was a waste of my time”). There are large numbers of opportunities to network. Waiting in line to register, turn and talk to someone. Ask questions of the presenter during their session AND follow-up afterwards (although, let them get unplugged and out of the way of the next speaker). Go to the vendor floor where you should talk to the vendors as well as others. Attend the First-Timers event. Go to the Birds of a Feather lunch. Wear a kilt on Day 2 of the Summit (SQL Kilt Day, you’re reading the words of the founder of the event). Attend the Women in Technology Luncheon. Look up and track down all the places where people are getting together and talking. Go to them. Get together. Talk.

I’m an introvert (people laugh when I say it, but it’s true). I recharge with alone time, not at parties. I get it. But the PASS Summit is not recharge time. If you’re not almost literally crawling into the venue on Friday, you’re doing it wrong. The flight home should be the most relaxing plane flight you’ve ever had because you’ll pass out before take-off and wake up when the wheels touch down.

Take the time and trouble to begin to build your network. And remember, a network is not a series of authors or MCMs or MVPs that you can call. It’s a collection of people, some may be presenters/authors/etc., but the best are probably doing the same job you do but for a different organization. Talk to everyone. Build that network.

Conclusion

And that’s all I’ve got. Here is a different view from the PASS organization and another from Steve Jones. Yes, the learning and the networking should be enough for any employer, but these things aren’t always immediately valuable. So, try out some of the other strategies and approaches I mentioned. Explain to the boss this is what you’ll be doing. Come up with a written plan. Then execute that plan at the Summit. Your career is in your hands. You have to decide how and where you’re going to expand it. The PASS Summit is a unique opportunity to do just that, but you may need to convince the boss.

In case it worked, here’s where you go to register. And don’t forget about the free event SQL in the City taking place in Charlotte before the Summit starts.

Aug 21 2013

Speaking this Fall

I’ve been at home through the whole month of August, restingworking hard, but it’s time to get on the road again.

First up, it’s off to San Diego for two events. On Friday, September 20th, Red Gate (my Lord and Master) is hosting a free afternoon session of training. Steve Jones (b|t) and I will be walking through several sessions oriented towards the DBA and the skill sets they need to get things done. It’s a free event, training done the Red Gate way, so register here, get some free training, a little networking and finish up with a frothy beverage.

Then it’s off to SQL Saturday San Diego the next day, Saturday, September 21st. As I write this, they haven’t posted the schedule, so I’m not sure what session I’ll be doing, but come on down anyway, especially if you’re in the area for SQL in the City the day before (or vice-versa).

October, things heat up quite a bit. We’re taking SQL in the City on tour through the US again. Events will be in Pasadena on the 9th, Atlanta on the 11th, and Charlotte on the 14th. These are all-day events with a number of different topics relating to development, deployment and DBA work. And, it’s all done the Red Gate way. We regularly fully register these events and have to put people on waiting lists. So, two things. If you want to go, register now. If you’re registered and can’t go, please let us know so we can give your ticket to someone else. I love these events. They’re great for networking, learning, and we get a lot of input from people who are using, or even just interested in, Red Gate tools. I’m presenting three sessions, one on backups, one on database deployment, and another one… that I’m working on, OK? If you’re in Charlotte for the PASS Summit, we’ll be there Monday, so swing on by.

That leads directly into the PASS Summit 2013, October 15-18. What to say about PASS. Biggest, best, SQL event. Period. SQL Family, SQL Kilt, Pre-Cons, Sessions, Karaoke, Networking, Parties, Vendors. You know the drill. I have a full day pre-con on Tuesday before the Summit begins all about Azure and relational databases. I’m presenting three times during the summit, a spotlight session on Tuning Windows Azure SQL Databases, a session with Dandy Weyn (b|t) about being the DBA of the Future, and a group session with a bunch of dear friends on Lessons on Working from Home. I know this is an expensive proposition, but if you can go, you should. It really does build your skill set and your network.

Things slow down a little in November. Right now I’m speaking twice in Dallas. Once at the SQL Saturday there and once at a pre-conference seminar for that SQL Saturday event. This time we’re talking all about query tuning. If you’re in the area, please register.

And that’s not all. There’s more to come.

 

Aug 15 2013

Azure and Your MSDN Account

AstonI’ve heard over and over again that the reason people don’t want to learn Azure, to explore it, to understand where it’s applicable to them and their business (and where it’s not, because, it does have well-defined limitations) is because they don’t want to pay for it.

Fine.

Don’t.

Do you have an MSDN account? What? You do? Then you have access to a credit in Azure and an account with a spending limit. Your credit level is dependent on what level of MSDN you currently subscribe to, but who cares. You not only will NOT pay anything to play in Azure, you will NEVER pay anything. You can’t go beyond your limit.

Want some more incentive (not that you should need any now)? Cool. Set up your Azure account with MSDN and you can win an Aston-Martin V-8. Well, you could, but I’m entered too, so I’m going to win (I work for a UK-based company, the car is red, I’ve been working with Azure for a long time… come on, it’s mine). Although, you never know.

If you do get registered for Azure and then want to learn more, faster, especially the relational aspects of Windows Azure SQL Database and SQL Server on Azure VMs, I am putting on an all day pre-conference seminar on exactly those topics at this year’s PASS Summit. Register here.

We can look at pictures of my new Aston-Martin.

Aug 12 2013

You should NOT attend the PASS Summit

logo_headerIf you asked me, prior to today, if I would type or say those words, I would have laughed right at you.

But then, I saw this question on Ask SQL Server Central. It’s from a college student, not yet twenty-one, who was considering paying his (assuming it’s a guy since their handle is ‘Eagle Scout’) own way to the Summit and wondered if it would be worth it. It pains me to say that I suggested that he not do it.

Don’t get me wrong. I think the PASS Summit is probably the single greatest resource you have to advance your career. Where else can you go to get that broad a choice in training? Where else can you go to get that many of the leaders of our industry, specializing in all aspects of SQL Server and the Microsoft stack, sharing their knowledge? Where else can you expect to have extended networking opportunities with those same leaders and all our peers? Easy answer right? Nowhere. It’s a unique place and a unique opportunity. So I’m nuts for telling this guy not to go, right? I don’t think so.

This case is unique. We’re talking about someone who is not yet employed in this industry, still in school, paying their own way out to this event. Not just paying for the event, but for the travel, the lodging, the food… It’s not a small expenditure. In fact, it carries a pretty hefty price tag. Someone in his situation, I couldn’t, in good conscience, recommend it.

What about the rest of you? I’ve heard people say that it costs too much, that it’s not worth it, that you can get the same information for free online, that there are better teachers elsewhere. So you don’t really need to spend the money either, right?

Right. Don’t go. Don’t spend that money. Seriously…

That way, those of us who recognize the unique value this conference offers will have a major leg-up on the rest of you in the job market. Not that there aren’t jobs enough for everyone. But the better jobs, the exciting and interesting jobs, those are going to go to people who are investing in their future by learning more, networking more and striving to achieve more. In short, the PASS Summit attendees who show up and take part. I can’t tell you how many people I know that are currently working at what they consider to be their dream job that point to the PASS Summit as the single biggest factor in landing that job. I do.

Stay home. Read a few blog posts. Don’t bother networking through events like SQL Saturday or your local user group. Certainly don’t travel to Charlotte and sit through sessions by industry leaders in order to ask them engaging and pertinent questions that will immediately help you improve on your company’s bottom line by improving the speed, safety or accessibility of its data. Save that money. Save that time. After all, you’re already in your dream job. Right?

Aug 07 2013

24 Hours of PASS Streaming Available

If you missed the 24 Hours of PASS Summit 2013 Preview, you missed some excellent sessions. I watched a few, but not all. But now, thanks to the wonders of modern technology, I can go back and catch the ones I missed. You can too.

I’d like to call out my session, Be a Successful DBA in the World of Cloud and On-Premises Data. I know that my fellow DBAs and database developers are largely dismissive of Azure. I get it. But I really think you’re missing out on this. It’s another excellent tool in your toolbox that you need to start taking advantage of. Little things like getting a quick and easy installation of SQL Server 2014 or Windows Server 2012 R2 up and running. Helpful things like the ability to quickly prototype applications and databases. The possibilities just keep expanding in and around this tool set. You can take advantage of the tools you’re working on mastering too.

Please, check out my presentation from 24HOP. I think I address many of the issues that are keeping people from exploring and understanding this new and growing technology. And, if you want to get a leg up on learning it, I’m putting on an all day pre-conference seminar at this year’s PASS Summit in Charlotte. Check out and register for Thriving as a DBA in the World of Cloud and On-Premise Data.

Jul 01 2013

Getting Started With SQL Server 2014 the Easy Way

You know you want to at least take a look at the new Client Technology Preview (CTP) of SQL Server 2014. I don’t blame you either. I want to spend hours swimming through it too. But, you’re thinking to yourself, “Heck, I’d have to download the silly thing, provision a new VM, walk through the install… Nah. Too much work.” I don’t blame you. I found myself on the road the day the software was released, so I was going to attempt to do all that work on a hotel wireless system. In short, I was going to have to wait, no options. Or were there? Actually, there is a much easier option. Azure Virtual Machines.

And no, it’s not that I can simply get a Windows Azure VM ready to go faster than I can a local one (and, depending on just how I set up and maintain my local servers, that might be true). No, it’s that I can immediately get a copy of SQL Server 2014, no download required. It’s that I can, within about five (5) minutes have a server up and running with SQL Server 2014 installed and ready to go. How? Microsoft maintains a gallery of images for quick setups of Azure Virtual Machines. A couple of those images include SQL Server 2014.

VMIcon

To get started on this, and not pay a penny, you need to make sure that you pass the MSDN permissions listed at that link. I know that some people won’t, and I’m sorry. However, get your MSDN subscription set up and link it to an Azure account, then you’re ready to go. Throughout this post, I’ll refer to paying for Azure, if you’re running through MSDN, just insert, “using up my credits” for “paying” and it should all make sense.

First, click on the Virtual Machines icon.

VMNewThis will show a list of VMs on your account, if any. We’re going to add one, so we’ll click on the little plus sign in the lower left corner of your screen.

Clicking on the New button gives you options. Reading the screen you can tell that you have a list of different services that you can add; Compute, Data Services, App Services, Networks and Store. By default, if you’ve opened this listing from the VM list, you’re going to already have Compute selected. That provides a second list of options; Web Site, Virtual Machine, Mobile Service and Cloud Service. Again, if you’ve opened these options from the VM list you’re going to have the Virtual Machine selected. If not, make sure that is what gets selected. The final two options you have are Quick Create and From Gallery. For our purposes we’re going to use the Gallery, but let me first tell you what the difference here is. Your licenses for SQL Server, Windows Server, and most Microsoft products (so far as I know) are transferable between Azure and your on-premises machines. This means you can create an empty virtual machine on Azure and then load your software on to it. You don’t pay additional licensing fees. But, you can also use the images on the Gallery. Here you can set up a VM for whatever is listed and you get those machines and their software for additional cost, but no additional license required. In short, you can pay a little bit more to get access to SQL Server or what have you without having to buy an additional license. It’s a great deal.

VMOptions

Worry about paying for it all later. We’re going to click on the From Gallery selection. This opens up a new window showing all the different possibilities you have for your VMs. You can install anything from Ubuntu to Sharepoint to several different flavors of SQL Server. You can even add your own HyperV images to this listing (although that does mean paying for licensing on any created VMs). Scroll down until you see SQL Server 2014 CTP1. On my listing currently, there are two copies. One that runs on Wndows Server 2012 and one that runs on Windows Server 2012 R2. If you want a Start button on your screen, pick the second one. You’ll then be walked through the wizard to get this thing created. Click on the right arrow at the bottom of the screen after selecting a VM.

VMGallery

Now you need to supply a machine name. It needs to unique within your account. You’ll also have to pick the size of machine you want. This, and the size of the data you store, is what you pay for. You’ll need to decide how you want to test 2014, small or large. For my simple purposes, exploring 2014, I’m going with Medium. That currently means 2 cores and 3.5gb of memory. You can go all the way up to 8 cores and 56gb of memory, but you will be paying for that, just so we’re clear. You also have to create a user and password for the system. Strict password rules are enforced, so you’ll need a special character and a number in addition to your string.

vmStep2

You need to configure how this machine will behave on the network. You need to supply it with a DNS name, your storage account, and your region. I would strongly recommend making sure that your servers and your storage are all configured for exactly the same region. Otherwise, you pay extra for that extra processing power. Also, you may see somewhat slower performance.

VMStep3

Finally you have to, if you want to, add this server to an Availability Group. For our test purposes we’ll just leave that set to None. But, you can make this a part of an AG in Azure or with a mixed hybrid approach as an async secondary with your on-premises servers. Oh yes, the capabilities are pretty slick. I would suggest also leaving PowerShell remoting enabled so that you can take advantage of all that will offer to you in terms of managing your VMs and the processes running within them.

VMStep4

VMCreatingClick on the check mark and you’re done. You’ll go back to the VM window and at the bottom of the screen you’ll see a little green icon indicating activity. It will take about five minutes for your VM to complete. While it’s running, you can, if you choose, watch the process, but it’s a bit like watching paint dry. You’ll see the steps it takes to create your machine and provision it with the OS and SQL Server version you chose.

Once it’s completed, you’ll have a VM with a single disk, ready to go. But, you need to connect to it. Remember that user name and password? We’re going to use that to create a Remote Desktop connection to the server. When the process is completed, the server will be in a Running state. Click on that server in the Management Portal and click on the Dashboard selection at the top of the screen. This will show you some performance metrics about the machine and, at the bottom, give you some control over what is happening. The main thing we’re looking for is the Connect button.

VMConnectClick on that button. You will download an RDP file from the Azure server. Open that file (and yes, your system may give you security warnings, click past them) and you’ll arrive at a login screen, configured for your Azure account. That’s not what you want. Instead, you’re going to click on “Use another account.” Then, in that window type in your machine name and user name along with the password. Once you click OK, you’ll be in an RDP session on your SQL Server 2014 CTP1 VM. Have fun!

VMRDP

Remember, you can stop the VM when you’re not using and you stop paying for it (or, using up your MSDN credits). Just go to the dashboard and use the “Shut Down” option at the bottom of your screen.

If you found this useful and you’d like to learn a lot more about the capabilities of using Azure within your environment, I’d like to recommend you sign up for my all day pre-conference seminar at PASS 2013 in Charlotte. I’ll cover this sort of thing and one heck of a lot more about the future of being a DBA working in the hybrid environment of Azure and on-premises servers.

Nov 09 2012

PASS Summit Day 3: Dr. David Dewitt

Two quick points, I’m putting this blog together using the Surface.. ooh… and this isn’t a keynote, but a spotlight session at the Summit. Still, I thought I would live blog my thoughts because I’ve done it for every time Dr. Dewitt has spoken at the Summit.

Right off, he has a slide with a little brain character representing himself.

But, we’re talking PolyBase, and futures. This is basically a way to combine hadoop unstructured nosql data with structured storage within SQL Server. Mostly this is within the new Parallel Datawarehouse. But it’s coming to all of SQL Server, so we need to learn this. The information ties directly back to what was presented at yesterday’s keynote.

HDFS is the file system. On top of that a framework for executing distributed fault-tolerant algorithms. Hive & Pig are the SQL languages. Sqooop is the package for moving data and Dr Dewitt says it’s awful and he’s going to tell us why.

HDFS was based on a google file system. It supports 1000s of nodes and it assumes hardware failure. It’s aimed at small numbers of large files. Write once, read multiple times. The limitations on it are caused by the replication of the files which makes querying the information from a datawarehouse more difficult. He covers all the types of nodes that manage HDFS.

MapReduce is used as a framework for accessing the data. It splits the big problem into several small problems. It puts the work out into the nodes. That’s Map, Then the partial results from all the nodes is combined back together through Reduce. MapReduce uses a master, JobTracker and slaves, multiple TaskTrackers.

Hive, a datawarehouse solution for Hadoop. Supports SQL-like queries.It has somewhat performant queries. By somewhat he says that the PDW is 10 times faster.

Sqoop is the library and framework for moving data between HDFS and a relational DBMS. It seriealizes access to hadoop. That’s the purpose of PolyBase to get parallel execution access all the Hadoop hdfs. Sqoop breaks up a query through Map process. Then Sqooop runs two queries a count, and then reworks the query into a pretty scary query including an ORDER BY statement. This causes multiple scans against the tables.

Dr. Dewitt talks through the choices for figuring out how to put together the two data sets, structured and unstructured. The approach taken by Polybase is to work directly into HDFS, ignoring where the nodes are stored. Because it’s all going through their own code, they’re also setting up to text and other data streams.

They’re parallelizing access to HDFS and supporting multiple file types. Further, putting “structure” on “unstructured data”

By the way, I’m trying to capture some of this information, but I have to pay attention. This is great stuff.

How the DMS,the stuff used by Microsoft to manage the jump between HDFS and SQL Server is just flat out complicated. But the goal was to address the issues above and it does it.

He’s showing the direction that they’re heading in. You can create nodes and objects within the nodes through sql-like syntax. Same thing with the queries. They’ll be using the PDW optimizer. Phase 2 modifies the methods used.

I’m frankly having a little trouble keeping up.

It’s pretty clear that the PDW in combination with the HDFS allows for throwing lots and lots of machines at the problem. If I was in the situation of needing to collect & process seriously huge data, I’d be checking this out. The concepts are to use MapReduce directly, but without requiring the user to do that work, but instead using TSQL. It’s seriously slick.

By the way, this is also making yesterday’s keynote more exciting. That did get a bad rap yesterday, but I’m convinced it was a great presentation spoiled by some weak presentation skills.

All the work in Phase 1 is done on PDW. Phase 2 moves the work, optionally, to HDFS directly, but still allows for that to be through a query.

Dr. Dewitt’s explanation of how the queries are moved in and out of PDW and HDFS are almost understandable, not because he[s not explaining it well, but because I’m not understanding it well. But seeing how the structures are logically handling the information does make me more comfortable with what’s going on over there in HDFS.

I’m starting to wonder if I’m making buggy whips and this is an automobile driving by. The problem is, how on earth do you get your hands on PDW to start learning this?

 

 

Nov 08 2012

PASS Summit 2012 Day 2: Keynote

Welcome to Day 2 of the PASS Summit!

It’s been a very exciting event so far. Today I’m presenting two sessions, one on tuning queries by fixing bad parameter sniffing and one on reading execution plans. Please stop by, or watch the one on execution plans on TV as PASS is livestreaming events all day long on SQL TV (which is what I used to call Profiler).

The intro video, which can be good or goofy was really good this year. They had people from all over the world talking in their native language, making the point that the PASS organization is a global community. It really is.

Doug McDowell is giving us the finance and governance information for the PASS organization. I find this boring and vital at the same time. We need to know how this organization is managed, if we care about the organization. And since, let’s be honest, this organization has changed many of our lives for the better. I mean through the family we’ve met, the jobs we’ve gained, and just the knowledge that has been shared with us. PASS has doubled it’s expenses in two years in order to support all the stuff they do, SQL Saturday, Rally, 24 Hours of PASS, etc. It’s amazing.

We have three new board members, Wendy Pastrick, James Rowland-Jones and  Sri Sridharan. Congrats guys. You’re crazy for taking part, but thanks for everything you do.

Next up is Tom LaRock, another board member and a good friend. The PASSion awards are great. It’s the people who are doing, crazy sick work for the community. Mention goes to Amy Lewis and Jesus Gil. But the award went to Jen Stirrup. Well deserved. She is so active and so passionate. It’s amazing. It’s a well deserved win for her. Congrats Jen and thanks for all you do.

PASS Board members are gathering feedback from the community. If you have an idea, talk to a board member.

Don’t forget to attend the Women in Technology Luncheon. Men and women can attend.

Quentin Clark is now up for the Microsoft part of the keynote. We’re seeing a bunch of people talk about how great SQL SErver 2012 is. It really is great. He’s taking off on the concept of the data lifecycle. That’s a pretty interesting topic. He’s talking about how big data is getting both really, really cool and absolutely frightening. Hotels tracking guests within their building, coupons & ads based on the person standing in the supermarket, things like that. People are actually to the point where we can do things like this. It’s really cool. But wow, that is going to build out some seriously large data sets. The idea is to make gathering, interpreting, and sharing data easy, simple and very, very fast.

We’re starting off with data management. The combination between SQL SErver and Hadoop is pretty slick. It’s PolyBase, the new technology announced yesterday. But, please, presenters, don’t leave teeny tiny fonts up on screen while you talk. Zoom in. The room can’t see it. However, that information was very interesting. I like seeing how you can put these things together. Next up is discovering and refining data. We’re going straight into Excel. That’s the bad news. The good news, Access is dieing. YAY!

So the demo was poorly delivered, but very well structured. We got a good idea of how exactly we can do this with the new technology. There are lots of setup in the management area and in Excel to prep for  what they’re calling the ‘Ah ha’ moment. In other words, this is making your data more and more available, but the work to set it up is absolutely non-trivial. The structures get built out in really interesting ways, especially all the model work you’ll be doing in SSAS in order to prep this data. They’re showing how Azure marketplace hooks in. Once all of it is put together, an incredibly difficult task, you can really poke at the data with these new tools. It’s exciting stuff. It’s a shame that the presenters sucked all the life out of it.