Aboutmatt.net

What is the JSLint of iOS development?

2016-04-05T21:30:00+01:00

Over in the JavaScript world many developers feel the benefit of opinionated linting tools like JSLint and JSHint. One of my questions early on was are there any equivalent tools for iOS developers? Specifically for Objective-C, but what about for Swift as well?

As well as that I was hoping to find something I could use something to enforce a style guides on my code, because I know most editors do a poor job of this task.

Compiling is linting…

You first friend in writing great code is the compiler. Clang (the underlaying compiler for the Mach and iOS platform) tells you when you’ve done things wrong. In the JavaScript world we have to worry about whether we’ve provided the right number of arguments to a method, whether we have the order correct and whether we have provided the right object types.

This is easy to validate in most situations but when you get to a reasonably large code base, it gets harder. When you are changing code/interfaces of classes it gets harder.

As well as being able to check the syntax and general correctness of your code, the Clang compiler does some static/semantic analysis of your code.

There is even Xcode support.

Objective-C Tools

Once you get passed the CLang analysers you still be hungry for more tools to keep your code clean. There are open source projects to produce lints/static analysers for Objective-C.

As far as linting is concerned I’ve found a couple of projects.

OCLint

OCLint says it is:

"A static code analysis tool for improving quality and reducing defects by inspecting C, C++ and Objective-C code"
OCLint Homepage oclint.org/…

The project has been around since the 1st of January 2013, as of writing it had hit 0.10.2. It hasn’t had the most consistent of release schedules but does seem to getting a lot of interest from the wider Github community.

It’s written in C++ primarily, so it’s got the potential to be cross platform if you were in the market to run static analysis on non-Mac hardware. Though I haven’t confirmed that yet.

Infer

Infer is tool built by Facebook which is:

"A tool to detect bugs in Android and iOS apps before they ship".
Infer Homepage fbinfer.com/…

It’s relatively new (first release being the 11th of June 2015). It’s written in OCaml, runs on a Linux box, but there are no guarantees about it’s perform on code bases outside Facebook.

Beautifying

The only real player in town for beautifying/enforcing style seems to be Uncrustify. The main aim of the project according to it’s creators is to:

"Create a highly configurable, easily modifiable source code beautifier"
Uncrustify Homepage uncrustify.sourceforge.net/…

I’ve heard great things about Uncrustify, the C++ community really seem to appreciate it, so I’m looking forward to being able to finally use it myself.

Swift Tools

Swift is the future of iOS and Mac. I’ve been experimenting a little bit with the language over the last while as well, Objective-C is still my main focus, but I get tempted by all the great things I hear on podcasts…etc

As far as linters are concerned the two main players are:

SwiftLint

Unlike the Objective-C linters I found, SwiftLint is written itself in Swift. This project was started on the 18th May 2015. It’s based on the Github Swift Style Guide, so you know it’s opinions are based on some solid hard won experience.

Tailor

Tinker, tailor, solider, spy… this linter is written in Java. Which means it’s definitely cross platform. Which gives you the options such as; say you had a centralized source control system and wanted to run a linter on a pre-commit hook. The project had its first release on the 19th September 2015, so a little later to the game than SwiftLint, but both are basically spring chickens. I don’t know how well Tailor would work if you wanted to integer with a development environment though.

Conclusion

This is by no means an exhaustive list. There are many tools coming out everyday to help with code quality (more for Swift than Objective-C obviously). As time goes by, and as I discover better tools, I’ll likely write about this topic again.

References

Below are links to web pages/articles I used to create this article.

Title Image

Title image based on “Window Cleaning In Protective Rubber Gloves Washing Windows” by Cade Martin, Dawn Arlotta

Tools

Facebook static analyser “Infer”
Objective-C linter; “OCLint”
Swift Linter; “SwiftLint”
Swift Linter; “tailor”
The code style checker “Uncrustify”
The built in “Clang Analyser”

Misc

Stack Overflow question “Lint-checking tools for Objective-C development” by user Sedate Alien
Facebook Infer documentation on the tools limitations.
Apple developer documentation on Xcode clang analyser integration.
The Github Swift Style Guide.

Do I Choose Swift or Objective-C For My iOS Application?

2016-03-22T20:30:00+00:00

I’ve worked with programming languages that were still in their infancy. I’ve written about it. TypeScript turned out to be a really great language, but I spent more than half the time I worked in it at or below 1.0. Being on the bleeding edge was cool, but worn pretty tiny pretty quickly.

So it’s probably of no surprise to discover that all my current iOS development efforts have been focused on Objective-C.

I could be wrong I could right

I know that in the long term, I’m almost certainly on the wrong side of history. I don’t think this would be the right choice for everyone. I’m just saying that I believe it is the right choice for me right now.

For me a lot of things stand about about Objective-C over Swift. As of today there is more production code in the wild written in Objective-C than there is Swift. It’s had a 30 year head start after all. Sure the tide is changing pretty quickly, but the expansive array of libraries/frameworks and tooling that exists is immense and not something to turn your nose up at. As well as anything else; the community is large and I believe there are a lot of really great people I could learn from.

If at any stage in the future I wanted to persue iOS development in a serious way, like working with a team. There is a high likelihood/possibility that a real companies iOS code base will be 99% Objective-C based. It takes a long time for teams working on business critical applications to do something dramatic like switching programming languagues. There are inheret risks to a decision like that.

A bit of research will find that there are nices way of bridging code across the Objective-C/Swift gap and I’ll definitely be exploring that in my own projects on this blog.

iOS The Hard Way

A big factor for me and learning Objective-C over Swift is I feel, rightly or wrongly, that Swift will likely be easier for me to learn. It has more modern features/syntax that is familar and similar to languages I’ve used in the past. Where as Objective-C is a bit of a departure. The unusual veener hide a familiar OOP language, but I feel the learning curve for me is higher. I know that it’ll serve me in the long run to learn it now and work with it now while I wait for Swift to mature just that bit more, and for the community/tooling to come along just that bit more.

Conclusion

"But really the most imporant things you can do is to learn how the Apple APIs work. Once you do that, the language you write in is just syntax and highlighting."
Eric Miller "What Should I Learn First, Objective-C or Swift?"

This is a pretty short post. It’s a clear enough decision for me right now. But if you’d like to hear what someone who has more experience has to say about it, watching the above video by iOS engineering Eric Miller.

References

Title Image

Title image is based on the image “Which Way Now” by Flickr user Nick Page.

An iOS & Apple Jargon Busting Sheet for Web Developers

2016-03-15T20:30:00+00:00

It’s four in the morning. You’ve spent the last 12 hours pacing back and forth a hospital waiting room. Despite your best efforts the ground is still somehow sticky underfoot. The noise that the soles of your shoes make as your feet lift off the floor sends a stiff, uncomfortable and sweaty feeling down the back of your neck. You feel exhausted but the stress moving through you vains keeps you awake, keeps you alert. You’re anxiously waiting for news, anything that could help you understand this confusing and concerning situation you find yourself in. And then you hear the slow tapping rhythm of feet making their way through the hospital corridor. Making their way towards your waiting room.

It’s a doctor and they tell you it’s not good. I’m afraid it’s your worst fear. ‘We’ve done some ECGs, and they have confirmed our worst fear. It’s a ST elevated MI. I’m afraid we’ve caught it a bit late, our best people are currently in the middle of PCI. Oh, look at the time. I really must be getting back. Goodbye.’

Jargon can be useful. It’s a way of packing a complex idea into a tight space giving you the scope to connect high level ideas together. The only trouble with jargon is that everybody involved needs to be “on the same page” as it were (aware of the meaning of the phrase, the concept that is being conveyed and perhaps even be aware of what it does not mean).

In the case of the introductory passage above, MI stands for “Myocardial infarction” or more simply heart attack.

What I’m finding in my efforts around learning the iOS platform is that there is a vast body of jargon, concepts and ideas that I don’t know. Or at least if I do know I don’t know in a formal way to help me help someone else to get the idea.

Enter my Jargon Buster

That is why I’ve decided that while I’m learning the iOS platform I intend on keeping a log of the jargon that I encounter. I’m going to try and keep this Jargon Busting sheet up to date and explain items in a ways that I understand these ideas and concepts. Hopefully my little project might be of some use to someone in the future.

The iOS Jargon Buster Sheet

Reference

Title Image

The title image is a based on an image of a Russian Cypher Wheel. The original photograph was taken by Flickr user Paul Hudson.

Xcode; A Run Down Of The Features Of The iOS Integrated Development Environment

2016-03-08T22:27:14+00:00

Slinging code is important. It’s the most important thing I do. I know that being aware and using the features of a good text editor or IDE makes me more productive, which in the end aids my overall goals. That is why I put a great deal of care in understanding and learning the ins and outs of tools that support me doing my work.

Whether it’s my text editor of choice (Sublime Text) or a development environment (Eclipse, Visual Studio). That is why in my journey to understand the iOS platform an important step is to learn about the tool that will assist me the most in that goal, Xcode.

What Is The Point In An Integrated Development Environments

I’m coming from the web and I see a lot of people saying that IDE are pointless/not the way to do things. I’ve said it myself. In previous positions I would always rather a clean text editor with clean editing concepts over a bloat mess of an IDE any day. I even debated the point of view that having an IDE made doing the wrong thing too easy (Eclipse Java auto-complete anyone?). I do sometimes feel that way, but I’ve seen badly written Java done using text editors as well as badly written Java using IDEs.

I’ve had more experienced programmers explain to me what they believe the true power behind an IDE really is. An IDE exists as a way of treating code not as symbols in a UTF8 text file, but as living breathing data that you can manipulate and interact with. An IDE is an extension of the the compiler/interpreter that understands your language. The compiler/interpreter spits out ASTs (abstract syntax trees) of the various parts of your program and the IDE can consume and use that to inform you of deep power things.

Xcode understands the relationship your code has to the rest of the underlaying framework. It can show you the right information at the right time to help you make the best and most informed decision.

The Editor Overview

Arriving in to Xcode is overwhelming at first, as it can be with any reasonably complex development environment, but after a little while I started to get a sense of what was what, and where was where. There are a few main sections from what I can see:

The toolbar
The navigators
The editors
The debug/output area
The utility area

Navigators

On the left hand side of the Xcode interface are the navigator tree interfaces. These are a collection of tabbed views that give me a high level insight into my application. The first and most important one is the Project Tree Navigator.

It does what you’d expect. It reads the structure of my code on disk and displays a tree to represent it. It provides me with a right click context menu as for file and folder operations.

I noticed on the bottom of the navigator view you get a filter component which filters the tree to show you just the files you want. Which is a very useful features.

There are a few more navigators available in that view, but I probably won’t find them useful at first, but here is a list and a brief explanation:

Symbol Navigator
- Code being made up of various high level concept/symbols like methods. This view provides me with an outline of these symbols.
Find Navigator
- This is the global search in project view. Here I can search for text across all the different search-able text file formats that make up my project.
Issue Navigator
- As I code I end up producing errors. As those errors accumulate they amass in here. A lot of editors have errors in a table view at the bottom of their interfaces. Xcode choose to put it to the left by default.
Test Navigator
- As I writes test to to ensure the correctness of my code, they show up in this tabbed area.
Debug Navigator
- Naturally there is where a lot of the debug information, current programming stack…etc appears.
Breakpoint Navigator
- An interesting feature of Xcode is the ability to set a break point and have it remember that breakpoint between sessions (because it is keep in the project). This theoretically means I could check my breakpoints into source. I’m not sure how useful I’ll find that, but you never know.
Report Navigator
- This is a basic event log of the various actions that took place through out that Xcode session. Like builds…etc

The Project Tree Does Not Represent What Is On Disk…

Just a side note on Xcode and projects. Even though it appears that I’m moving things around into folders in Xcode it doesn’t actually do that on disk. It only does that grouping into a folder in the Xcode project. My actually files will remain scattered around that main folder like I never organized anything.

I did happen upon a SO question which clears up how to approach this issue as I’m sure that I’m going to want to interact with the code outside Xcode (through Git command line…etc.) as well as inside it. It does seems a bit mad that it wouldn’t do a grouping on the files system as a way of representing these groupings, but I’m sure there was an original rational behind it that made sense to the development team.

Editor Area

Of all the pieces of Xcode I’ll need to understand the most, the editor area is the most important. It is where the ideas stream out of my head and into reality. The interface is split between it too main aims. Writing code and creating UIs.

Source Code Editing

As a text editor Xcode is pretty okay. I’m still getting used to it. I was quite frustrated at first, because I had gotten so used to the incredibly advanced features of tools like Sublime Text that I find it quite slow to create anything in it, but I’m sure I’ll get the hang of it soon and I’ll share those insights on this blog at a later date.

I liked the fact that the key bindings are all completely configurable which is handy. And it seems that you can import an XML document with those binding into and out of the editor. I’ll probably start to tweak these to something more suitable to my particular muscle memory over time. The other large element of this part of the editor seems to be the breadcrumb.

The breadcrumb is useful in a number of ways. Firstly it lets me get an outline of the specific file I am working with. It exposes object hierarchies if I am using a file which has inherited from another object, or perhaps implements an interface/protocol of some kind. It also shows logically connected files, like an interface and an implementation files if you are talking about Objective-C or an Storyboard view and a Swift file.

There is of course the usual features you’d expect here also like code folding, line numbering, comment toggle…etc

Interface Builder

Interface Builder is an alternative editor view that does much more than designing interfaces through dragging and dropping components. Here are some of it’s responsibilities.

UI component placement.
Applying layout constraints (making the UI more responsive to different screen sizes).
Designing interface navigation, how screen connect to one another.
Wiring up my various views to their controllers so I can control their logic.
A drag and drop method of exposing UI element extension points to my controllers.

There is clearly a lot going on here, and I’ll probably be digging deeper into this at a later date.

Utility Area

Like every application and every home there is a place where the little bits and pieces must live the utility area is that place for Xcode.

Property Pane

This is where various property dialogs are displayed for the file which is in focus (typically); items such as UI settings, styling, sizing…etc. All of the various contextual property elements find there way into this area.

Library Pane

The library pane is where I can do lot of different little things. There are various sub sections such as:

There’s a file creation drawer, where I can drag and drop a file type to quickly create new files.
There is a code snippet view, where I can drag and drop various code snippets I’ll collect into the text editor.
A UI component drawer, where I can drag and drop UI elements into the Storyboard to work into my application designs.
Media Library drawer, where I can get a quick view of the various pieces of media (like images…etc) currently present in my project.

The Toolbar

The next section of the editor that I said I’d look at is the toolbar. What I find interesting about the toolbar is that it take a few designs/nods from iTunes. You have large controls to the left and right the control an overall operation, while in the center you have a currently active operation/progress bar. It’s a nice touch and make it easy enough to understand the intents of the toolbar at a glance (if you’ve previously used iTunes).

It’s primary role would be context switching. Switching to the context of running your application, stopping it. Changing the current editor type.

Run Controls

The main run controls allow me to kick off my app in the simulator/emulator mode. It even allows me to choose different target devices to run against (iPhone, iPad…etc). Being able to switch these quick seems quick hand if I were to be targeting multiple devices.

Views

The view buttons allow me to move between different editing contexts.

Standard - A single files, no fancy stuff. Just plain Storyboard or code files.
Assistant - A split view where I might have related files opposite one another (Objective-C interface and implementation for instance). It appears you can have multiple split panes as well. So you could go crazy if you had a large enough monitor.
Source Control - A source control view, looks like it would be handy for viewing file diffs…etc.

Pane Switches

The final elements in the toolbar are the pane toggle buttons. These are a quick and convenient way hidden unnecessary parts of the UI. In the version of Xcode I’m using (7.1) the buttons have a visual impression of the pane section that they hide.

Debug/console Area

Finally there is the debug output area. I’ve done some debugging and seen the stack trace there, I’ve played with inspecting variables/objects and done some print debug. It’s pretty much what I’d expect.

Advanced Features

Finally there are the advanced features that it’ll be a while before I really get my teeth into.

Build Technology

There is a whole build system technology build directly into Xcode. I must properly investigate this before writing about it. I don’t like build technology that I can’t automate. If I can’t produce a set of build steps that I can then run on some sort of continuous integration server (at the very least on a nightly task, perhaps on a different user account) then I think I’ll look at some other method to build my applications.

Others

Behaviors - These seem to be very high level events that I’m not entirely sure why you’d want to configure/change. You can configure Xcode to bring certain views into focus when say you hit a debug point. I know in the Eclipse where you had the idea of perspective and a particular perspective would get activated during certain special events. I guess this must be something similar.
Bots - These seem to be some sort of hybrid between source control hooks and integration server robots. I haven’t had a chance to use them yet and I’m not sure I will.
Source Control Integration - Git has first class integration build right into Xcode. This seems great but I’ll reserver my opinions, I’ve seem crazy things in IDE source control integration in the past and I doubt it’ll be any different here.

Conclusion

Xcode is a vast, vast tool just like any IDE. As I get more experience in it I’ll be able to write posts that go into depth about different elements of it’s functionality.

References

The following are the various links I reference to write the article above. Any links I used throughout the article can be found here.

Feature Image

Title image based the image “Bird Flying Sky Flight Fauna” from from Pixabay by user allaniversen.

Tips

The Stackoverflow question - “Moving Files Into A Real Folder in Xcode” as asked by Matthew Frederick

An introduction to the iOS architecture for web developers

2016-03-01T20:30:54+00:00

I’ve been doing web development for a few years. I had some desktop software experience prior to that. Recently I’ve had the idea that I might like to know how to make mobile applications. I’ve played around with Android in the past and found it interesting. But at the same time I though it would be good to mix things up and do something a bit different. So I’ve started to look into iOS and iPhone development instead.

Looking at the browsers architecture

To help me get to grips with learning a new platform I like to compare and contrast it to one I already know. That way I end up having a kind of base to build off of, and it helps me internalize some of the details a bit quicker. So the question I asked myself was, how does iOS compare to the stack I’ve been using these past several years. The browser.

So before I started I wrote down everything I know about the browser and it’s architecture as a starting point for myself.

Big picture for the browser

I have to admit when I start sketching out what I know I notice a few general gaps in my own knowledge. Many specific details about how events/interactions occur were actually a little bit of a mystery to me. I happened upon a blog post By Tali Garsiel and Paul Irish on HTML5Rocks which breaks down the browser and it’s internal structures really well. A lot of the detail is too fine grained and unnecessary dense for my purposes here, but it’s actually quite valuable to know. For instance if you never understood how the layout/computed style system work (dirty bits) and how changes to the DOM can ripple out pretty dramatically then you might of questioned the point of that whole React Virtual DOM business. You might also question your own web app keeping it’s application state so tightly integrated (even stored) to the DOM in the form of data attributes/classes…etc

To save you a trip over to Tali and Paul’s article here is a quick break down:

User Interface: This is pretty much anything except for the main rendering area of browser. Things like address bar, tabs, back and forward buttons, status bar…etc
Browser Engine: This is a light layer that helps the user interface interact with the rendering engine…etc below it.
Rendering Engine: This the heart of the system, in Safrai there is Webkit, in Chrome there is Blink…etc. It handles vasts amounts of functionally around parsing, understanding, consume HTML, CSS and images so that they can be present to the user.
Networking Layer: This is a basic networking layer where everything happens; HTTP requests and such.
JavaScript Engine: This is V8 in Chrome, SpiderMonkey in Firefox…etc. There are special bindings between the JS Engine and Rendering Engine for interacting with the DOM and CSS…etc
UI Backend: This is a little area of the browser that handles to creation of OS specific widgets for use by the rendering engine.

So that is basically your client platform infrastructure. Obviously, a web application typically has a server side component, which could include Apache, MySQL, PHPs, Java and even now, JavaScript. I won’t go into that right now for the sake of clarity.

The type of work I do as a web developer is to:

Write HTML/CSS/JavaScript or to describe that more formally
- Create structural elements (widget), either through HTML templating or through JavaScript (or a combination)
- Style my widget to all the needed requirements
- Make those widgets interactive through the help of the browser event system and JavaScript.
Single Page Applications (where JavaScript controls the context of what is show).
Networking in the form of communication calls to the server to load and save application state.

The Big Questions

So as such the question I’d have for the iOS platform is:

What are the equivalent element in the iOS architecture that map to my stack.
- What are the languages you use there?
- What is the part of the system that handles rendering?
- What is the part of the system that handles networking?
- What is the part of the system that handles events?
  - Are the events handle and exposed in a similar fashion?
  - Am I able to easily bind component to certain events/interacts?
- How do I layout and style my application?
  - Is there some sort of CSS for iOS?
  - Is there some sort of DOM for iOS (that handles structure separately to styling/positioning)

Looking at the iOS architecture

So now that I have general sense of the architecture I typically use and questions to help me direct my exploration; I wanted to see what would be the equivalent elements in iOS that map to these browser elements.

Big picture for iOS

There are a few layer here, and none really directly translate perfectly to the browser architecture, but you get the idea. As you go down through each layer, you are getting to lower and lower levels of abstraction until you hit the Kernel (which you have almost no way to interact with). There is a lot to unpack here so lets get started.

Cocoa Touch

Cocoa Touch is high level set of APIs which developers seem to spend most of their time in. This is where a lot of the high level APIs that help developers get things done are located. There are a few key/famous APIs at this level that are really important, UIKit being an example of one. There are also mechanism exposed at this level to help developer do things like multi-tasking, laying out their application, designing the application to be responsive on multiple screen sizes, handling complex user interacts through gesture recognition, capturing application state and interacting with other applications.

In a sense the Cocoa Touch layer represent a lot of different elements within the browser rendering engine and within web application frameworks that you might use. It does not translate directly to any element within the basic browser architecture, which isn’t surprising.

Media

The Media Layer is were a lot of the more complex graphics/rendering/audio technologies relate APIs reside. If you’ve even had a passing interest in Apple announcements you might of heard of one API within this layer which is called “Metal”. Metal is supposed to be a fairly low level graphics API suitable for use by games engine developers and people who make graphically intensive applications.

The only experience I’ve had with doing low level drawing on the web was with Canvas and help from a few different JavaScript libraries. The kind of things I did there would probably happen at this level.

Other pieces in the media layer include ones centered on audio & video, to allow you to interact with the camera/video recorder and microphones of an iOS device. As far as the browser is concerned a lot of the APIs that expose the same type of functionality come from the “UI Backend” layer. You might have a high level binding in JavaScript to allow you to access the web cam (WebRTC for instance).

Core Services

Core Services is as low as most applications get. This is where you find one of the most fundamental iOS APIs, the “Foundation” API. This is a related piece of the architecture called Core Foundation which is a C based API and collection of data types which developer rarely need to go near. Foundation on the other hand is an Objective-C (and Swift) based API.

Other services at this layer are:

SQLite
iCloud Storage
The Grand Central Dispatch (an API related to multi-tasking)
The Automatic Reference Counting (ARC) system

Core OS

The Core OS is the deep dark bowels of iOS. Specifically though, this is where you would go to get fine grain control over data from the Bluetooth radio, various security related services, the Network Extension framework (for working with VPNs) and a variety of little System APIs. There is also a very interesting set of APIs for “discrete signal processing” (DSP) and other complex mathematical operations. If you wanted to get really heavy into Instragram filters this would be the place to go :P.

The Big Answers

So after a good bit of digging into the Apple Developer docs I still felt a bit mystified as to the questions I set out with, so here is a breakdown of what I’ve discovered.

What languages are used in iOS?

The native languages that I’m going to be using when developing anything in iOS seem to be:

Swift
Objective-C
XML (for configuration data)
C, but incredibly unlikely

There is also a JavaScript VM available to use a part of the overall iOS architecture. But I’m to understand that it can be quite complex to use and to interact with the rest of the system… and anyway, you’re here to learn something new, not get things done :P.

What is the part of the system that handles rendering?

Rendering is largely taken care of by the OS through a set of drawing systems called Core Graphics (formally known as Quartz). There are many layers that built on top of Core Graphics which abstract away the complex details found there. Ultimately, if all I want is a button I may never need to go too deep into the rendering engine/APIs. In the browser application world we are more concerned with rendering because it can have a real impact on application performance. Yes this is a concern to iOS developers but not in the same way as it is in the browser.

What is the part of the system that handles networking?

The primary network API is called NSURLSession, this is exposed in the Foundation framework (Core Services). If you intend on contacting third party web services or say your own server backend this is where you might want to go. However, if you were making an application primarily keeps it’s state locally on a device and you were concerned with backing that data up somewhere then you could look to using Core Data and iCloud infrastructure which are found in the Core Services instead of building out a whole crazy backend for yourself.

What is the part of the system that handles events?

Events and user interaction are baked into the UIKit and other frameworks at the Cocoa Touch level. As I mentioned previously there is a gesture recognition system found there, so I expect this to be quite straight forward to use. As for the more nuanced questions of how do I bind to certain events, I might be better off talking about those in another blog post.

Is there a DOM for iOS?

In a sense there sort of is. If by DOM do you mean a method to structurally describe you application/views you are probably talking about the UI system called Storyboard. Storyboard is a WYSIWYG UI building system which allows you to describe the different elements of your application on individual pages, and even how they connect together. So in some sense this is like writing web pages and creating hyper text references between them.

Is there some sort of CSS for iOS?

Again, there does in fact seem to be a way to achieve this. By CSS I’m probably referencing to a means by which to have fine grained control over styling/theme of my application. There is an API infrastructure called UIAppearance that seems to allow me to have a centralized location for you different application styling configuration.

There are even libraries that have emerged that build on this infrastructure and make it more abstract by allowing me to define my rules in JSON, and even something remarkably like CSS.

One of the other concerns of CSS is position. The technology that exists to handle that is probably auto layout. Auto layout is a layout constraint system built into Storyboard which seems to allow me to describe in general how I want various UI element to be position relative to the window size and other component. In effective this is how you achieve responsive applications that work on many different screen sizes.

Conclusion

I’m still at the beginning of my journey with iOS. I’ve bought a Mac Mini (which I will talk about in the future), I’m starting to get familiar with the infrastructure and architecture of the system and hoping over the coming weeks and months I’ll get to share all the ups and downs of my exploration of iOS.

References

The following links the various reference I used to write the article above. Any links I used throughout the article can be found here.

Title image

Includes the vector image “Realistic spider web” by the Openclipart user tulvur

Articles

“How Browsers Work: Behind the scenes of modern web browsers” By Tali Garsiel and Paul Irish

Apple Documentation

Libraries

UISS: the “UIKit Style Sheets” by Github user robertwijas
nui: “Style iOS apps with a stylesheet, similar to CSS” by Github user tombenner

Analysing The Quantity Of Duplicated Code In Your Codebase

2015-09-21T23:41:21+01:00

There are a lot of great tools out there to help you analyse source code for patterns, bugs, issues…etc. Normally one might take the time to install SonarQube somewhere and let it do the heavy lifting, but sometimes you don’t have the time or resources to do that. In this post I will talk about some simple tools and techniques you can use to find dupl icate code in your code base.

Tools

Bash (MinGW)
Git/SVN…etc
CPD 5.3.4
CLOC 1.64
Microsoft Excel 2013

Analysis Goals

Find percentage quantity of duplicate code (copy and pasted code)
Present data to show that it is growing over time
Visualize the growth of the code base growth and rate of growth of duplication

Why Do An Analyse

One of the key questions you might ask yourself before embarking on doing an audit of your code is, what is the point? Well in my view things like copy/paste programming are a code smell that indicate deeper issues. Why are programmers copying code rather than reusing for instance? Do they lack the skill or knowledge to do proper code sharing? Do you need to invest in training? Perhaps you are missing some sort of design layer and this is the only way programmers have found to cope? Perhaps your team is too rushed to do proper design and you might being running head long into big problems in the medium to long term?

Ultimately, for me that old expression “information is power” really captures it. A business wouldn’t go long periods without auditing their finances, or measuring the sales performance of their sales staff. Even the idea of evaluating employees is common places with quarterly/yearly reviews. Why don’t we scrutinize our code as keenly as business do their employees? The more data we have to hand the better decisions we can make, or at the very least we can see what decisions we are clearly not making.

Getting The Data

First you needed the data. If you are using source control (please tell you are using source control…) then this process should be quite straight forward. I’m working on a project at the moment, it’s a small WordPress site and happen to have the WordPress source on my machine. So why don’t we have a look at that code base?

The version I’m using is hosted on Github so I’ll be using Git to handle getting the right code. I’ve done this analysis with SVN as the VCS in the past and it’s quite straight forward to get code out of that too. Either way you’ll want to loop over a range of some sort. In my script I’ve hard code tag version of the code base to check out but if you were checking out revision numbers or were thinking about incrementing a date a great bash command to do this is seq. This is a great little tool for producing a number output based on a few parameters, using that command with a for in loop you have an easy looped range to work with.

In my script I use a sub-shell to perform the checkout/update. There isn’t really any reason to not have jumped into the working directory once and stay there for the duration of the work, but I wanted to keep my files outside of the source code area I was analysing because I didn’t want anything to happen to those files when jumping between versions/tags.

Git Tags Version From My Script

for revision in "2.0" "2.5" "3.0" "3.5" "4.0" "4.3.1"; do
    ...etc
    echo "Working out revision: $revision"
    (cd $WORKING_DIRECTORY && git checkout $tagVersion --quiet)
    ...etc
done

SVN Revision Checkout Version

for revision in $(seq 1 100 1000); do
    ...etc
    echo "Working out revision: $revision"
    (cd $WORKING_DIRECTORY && svn up -r$revision --quiet)
    ...etc
done

Revision Date

With this WordPress analysis and other ones I’ve done I’ve needed the date to corresponding to the tag/revision I’m analysing. To do that my script used the following bash command. Relying on the assumption that we have just checked out the version of the code base we are about to look at. We gets the top commit from Git and grabs the grep the date and commit SHA.

Git Tags Version From My Script

(cd $WORKING_DIRECTORY && git log --max-count=1 | grep -E 'Date|commit')

In SVN you can do the same with svn info which presents information about your local working copy. If you grep the string “Last Changed Date:” then you should have just the line with your working copies last revision date on it. I’ve additional include the sed command that will remove the words “Last Changed Date:” to leave you with the pure date information.

Getting Date Information From Svn Info

search_pattern="Last Changed Date: "
(cd $WORKING_DIRECTORY && svn info | grep $search_pattern | sed -e 's/\($search_pattern\)*//g')

Duplicate Code

CPDs mission statement is explained on it’s website.

"Duplicate code can be hard to find, especially in a large project. But PMD’s Copy/Paste Detector (CPD) can find it for you!".
CPD pmd.github.io/pmd-5.3.3/usage/…

And it is exceptionally good at doing just that. The great thing about the CPD tool is it tokenizes the language you are scanning and looks for duplication at that level. So any kind of difference in code formatting, comments, blank lines…etc do not throw it off. Renaming variables, method names or subtly changing an algorithm would thrown it out though so those lines will not show up in its output.

Once you have a snapshot in time of your code base you’ll be able to run the CPD tool across the code. Sometimes you might need to exclude certain directories for various reasons, such as not analysing 3rd party libraries or other parts of the code base that are not relevant. In the script I’ve included a “EXCLUDE_PATTERN” concept that allows you to exclude a directory in this way. It limited to excluding a single directory right now, but it’s easily extended.

There are a large array of languages that CPD supports. I usually like to try out the scanner a few times and inspected the kind of duplication it finds. In some JavaScript projects I’ve analysed I’ve felt 75 tokens was about right. But for WordPress/PHP it seems you’re best off ramping that up considerable. I’ve put the bar all the ways up to 200 tokens for this analysis.

The kind of graphs you can get out of this analysis are interesting. Here we can see the raw quantity of duplicated code seems quite large and is growing fairly rapidly in the WordPress code base. When you look at the raw data however you find that a lot of duplication that does show up is really just a few lines here and there. The kind of thing that is difficult to avoid or even pointless to address. The key thing is to check out a sample of what is reported by CPD and see for yourself whether this figure is really as bad as it seems.

Total Number Of Lines Of Code

Graphing the duplication versus the total amount of code is a better way to gauge things. To get the figures for the total amount of code in the system I use a tool called cloc. The type of things you can find with these figures are if duplication and code base are growing at the same rate relative to one another (which is what you’d expect/normally find) or maybe the rate of duplication is clearly getting out of hand? Finally with this tool you can get the data that allows you to calculate the percentage amount of duplication versus the overall code base.

Cloc is a great tool for giving you insight into the amount of different languages in a code base. What I like about it is when it parses the code it can calculate the number of blank lines, comments and then total amount of code. Instead of a plain you have 100 lines of code you can find out you have 60 lines of code with a bunch of comments and blank lines. This is useful information.

Here is a straight forward graph showing the growth of different languages over time. Obviously PHP was going to be winning out over everything else, and perhaps it’s not surprising to see that there is more CSS than there is JavaScript in the system. The really interesting thing for me is you can see that SASS is only really getting a foothold in the project and there doesn’t seem to be any big strides to move the CSS over to SASS at the moment.

The Future Of WordPress Language Usage

Some of the silly thing you can do with this data is to reason about the future of the project. For instance here you can see then number of languages that make up the source code and a graph projecting potential for new one over time. If these numbers are anything to go by WordPress will consist of 8 different languages by about 2026 :P.

Top Ten Biggest Files

I’ve had this script kicking for a little while and recently I added something to it called top ten files. The basic idea is I can get an output of the top ten largest files (this is based on raw number of lines). The length of files can be a code smell like anything else and I figure it was an interested figure to graph either way. In the case of WordPress you can see that the size of the top ten files are increasing with each version, but are roughly the same length, except for one outlier in version 3.0.

Full Script

Here is the full script, warts and all. There is plenty of issues with it, but it was something I put together in a few minutes, so I’m not super concerned by it’s quality. There are few main components of the script;

Configuration at the top.
- Turn on and off certain analyses.
- Turn on some debug printing.
- Turn on/off the deletion of temporary files (useful for debugging).
The various methods for getting/cleaning the data.
Main method:
- Cleaning up before starting.
- Looping over versions.
- Checking out code from VCS.
- Getting date information (if checking out code by date which is easy in SVN, not so much it other, it’s unnecessary).
- Running CPD and producing duplication data set.
  - Produce an offset file which is the length of each duplication data set. Useful in Excel.
- Running CLOC.
- Getting the top ten biggest files.
- Creating the final output for duplication analysis.
- Removing temporary files.

Complete CPD/CLOC Script

#!/bin/bash

# Print some script debug information
CPCLOC_DEBUG_SCRIPT=true

WORKING_DIRECTORY="wp/"
LOCATION_TO_ANALYSIS="wp/"
EXCLUDE_PATTERN="themes"
EXCLUDE_PATTERN_PATH="wp-content/$EXCLUDE_PATTERN"
OUTPUT_LOCATION="cpd5.3.4"
# You can hang onto the temporary files to see if there is an error in the script
# and the data that it collects.
DELETE_TEMP_FILES=false

# Turn off parts of the script you don't want easier than commenting out stuff
CHECKOUT_DIFF_VERSION=true
CAPTURE_DATES=true
CAPTURE_DUPLICATION=true
CAPTURE_TOTAL_LOC=true
CAPTURE_TOP_FILES=true

# Support languages are the intersection of CPD & CLOC. You can always create parsers to add new supported languages to each.
# http://cloc.sourceforge.net/#Languages && https://pmd.github.io/pmd-5.3.4/usage/cpd-usage.html
SOURCE_LANG="php"

# Output format options: csv, csv_with_linecount_per_file, text, vs, xml
# Source: https://pmd.github.io/pmd-5.3.4/usage/cpd-usage.html
OUTPUT_FORMAT="csv_with_linecount_per_file"
TOKEN_SIZE="200"
CPD_LIB_DIR="/c/Development/pmd/lib"

# Adjust this in case CPD blows up while processing your code base.
# The values represents the number of Megabytes to allocate to the JVM
# heap.
JAVA_HEAP_SIZE="3072"

debugPrint() {
    if $CPCLOC_DEBUG_SCRIPT ; then
        echo "DEBUG::: $1"
    fi
}

normalizeCpdOuput() {
    # Remove the titles of the columns
    grep --invert-match "occurrences"
}

cutCpdOutputFields() {
    # 1=Number of duplicated token
    # 2=Number of detected file with duplication
    # 4=Number of lines of duplication
    # For CPD 5.3.4 and csv_with_linecount_per_file
    cut --delimiter="," -f 1,2,4
}

runCpd() {
    tagVersion=$1

    # Making Cpd similar to cloc with regards to exclusion paths
    excludeDir="$WORKING_DIRECTORY/$EXCLUDE_PATTERN_PATH"

    java -Xmx${JAVA_HEAP_SIZE}m -cp "$CPD_LIB_DIR/*" net.sourceforge.pmd.cpd.CPD --minimum-tokens $TOKEN_SIZE --files\
        "$LOCATION_TO_ANALYSIS"\
        --language "$SOURCE_LANG" --format "$OUTPUT_FORMAT"
}

printSourceControlDateForVersion() {
    # SVN: (cd $WORKING_DIRECTORY && svn info | grep "Last Changed Date:" | sed -e 's/\(Last Changed Date: \)*//g')
    (cd $WORKING_DIRECTORY && git log --max-count=1 | grep -E 'Date|commit')
}

# I've account for the output format in here. You really only have two options though,
# XML and CSV since those are the only other compatible options for CLOC/CPD intersection
printLocMetrics() {
    if [[ $OUTPUT_FORMAT == *"csv"* ]]; then
        CLOC_OUTPUT_FORMAT="--csv"
    fi

    if [[ $OUTPUT_FORMAT == *"xml"* ]]; then
        CLOC_OUTPUT_FORMAT="--xml"
    fi

    if [[ $EXCLUDE_PATTERN != "" ]]; then
        CLOC_EXCLUDE="--exclude-dir=\"$excludePattern\""
    fi

    cloc $LOCATION_TO_ANALYSIS --quiet $CLOC_EXCLUDE $CLOC_OUTPUT_FORMAT
}

printDuplicateFileMetric() {
    tagVersion=$1

    # Couldn't get the output of CPD to pipe properly for me for some reason, making temporary file
    runCpd "$tagVersion" > "$OUTPUT_LOCATION/temp$tagVersion.temp-csv"
    cat "$OUTPUT_LOCATION/temp$tagVersion.temp-csv" | cutCpdOutputFields | normalizeCpdOuput
}

main() {
    echo "Script starting"
    echo "."
    echo ".."
    echo "..."
    echo "Removing files before starting"
    rm --force ${OUTPUT_LOCATION}/*.csv
    rm --force ${OUTPUT_LOCATION}/all.txt
    rm --force ${OUTPUT_LOCATION}/tagLoc.txt
    rm --force ${OUTPUT_LOCATION}/topFiles.txt
    echo "---"

    mkdir --parents "$OUTPUT_LOCATION"

    debugPrint "Java Heap Size - $JAVA_HEAP_SIZE"
    debugPrint "Cpd Lib Dir - $CPD_LIB_DIR"
    debugPrint "Token Size - $TOKEN_SIZE"
    debugPrint "Location To Analysis - $LOCATION_TO_ANALYSIS"
    debugPrint "Source Lang - $SOURCE_LANG"
    debugPrint "Output Format - $OUTPUT_FORMAT"

    for tagVersion in "2.0" "2.5" "3.0" "3.5" "4.0" "4.3.1"; do

        if $CHECKOUT_DIFF_VERSION ; then
            echo "Working out tag: $tagVersion"
            # I'm suppressing the Git checkout STDOUT here because I don't care.
            (cd $WORKING_DIRECTORY && git checkout $tagVersion --quiet)
        fi

        if $CAPTURE_DATES ; then
            echo "Getting release date for this version"
            printSourceControlDateForVersion >> "$OUTPUT_LOCATION/tagDates.temp-csv"
            echo "---"
        fi

        if $CAPTURE_DUPLICATION ; then
            # I end up creating a lot of unnecessary temporary files. Must improve my bash-fu.
            echo "Analysing duplicate code for this version"
            printDuplicateFileMetric $tagVersion > "$OUTPUT_LOCATION/file.temp-csv"
            numberOfRecords=$(cat "$OUTPUT_LOCATION/file.temp-csv" | wc --lines | tr --delete "[:space:]")
            echo "Tag$tagVersion,offset,$numberOfRecords" >> "$OUTPUT_LOCATION/offsets.temp-csv"
            echo "Tag$tagVersion" > "$OUTPUT_LOCATION/$tagVersion.csv"
            cat "$OUTPUT_LOCATION/file.temp-csv" >> "$OUTPUT_LOCATION/$tagVersion.csv"
        fi

        if $CAPTURE_TOTAL_LOC ; then
            echo "Analysing code base total size for this version"
            echo "Tag$tagVersion" >> "$OUTPUT_LOCATION/tagLoc.txt"
            printLocMetrics >> "$OUTPUT_LOCATION/tagLoc.txt"
        fi

        if $CAPTURE_TOP_FILES ; then
            echo "Capturing the top 10 files for this version"
            echo "Tag$tagVersion" >> "$OUTPUT_LOCATION/topFiles.txt"
            find $LOCATION_TO_ANALYSIS -name "*.$SOURCE_LANG" -not -path "*$EXCLUDE_PATTERN*" -exec wc --lines '{}' \; | sort -rn | head --lines=10 >> "$OUTPUT_LOCATION/topFiles.txt"
        fi
    done

    if $CAPTURE_DUPLICATION ; then
        cat ${OUTPUT_LOCATION}/*.csv > ${OUTPUT_LOCATION}/all.temp-csv
        cat ${OUTPUT_LOCATION}/tagDates.temp-csv ${OUTPUT_LOCATION}/offsets.temp-csv ${OUTPUT_LOCATION}/all.temp-csv > ${OUTPUT_LOCATION}/all.txt
    fi

    if $DELETE_TEMP_FILES ; then
        echo "Cleaning up temporary files"
        rm --force ${OUTPUT_LOCATION}/*.temp-csv
    fi
}

main

Excel Formulas

Inside of Excel I used a few basic formulas after I got all of my data into it.

Duplicate Lines Of Code Per Duplicated Section

First, since I had my columns of data about duplicated code I wanted the number of lines of code that could theoretically be deleted or at least redesigned properly. When duplicate lines are being presented in CPDs output; it displays the chunk of code and all the places it shows up. This includes the location of the single representation that you potentially want to keep. No problem if you account for that in your formula.

LOC That Could Be Deleted

n = Number of files
d = Duplicate lines of code

(n*d)-d

Once you have a column of these figures you can sum up all those totals and have an overall total amount of lines (per version/revision) that might need immediate attention.

Fancy Excel Foot Work

When you do this analysis a few times you start to loath scrolling. Some of the data sets I end up with are 15K+ lines long (when you concatenate them all together). That’s a lot of scrolling, so when I output my data I make sure to measure the size of the data so I have a series of offsets for each file. Then using Excels “INDIRECT” function I can do a SUM on a range of data without having to scroll up and down on it.

Example Of Excel INDIRECT

D  = The column containing my calculated data.
N8 = Contains the value of the starting position for range.
O8 = Contains the value of the finishing position for range.

=SUM(INDIRECT("D"&N8&":D"&O8))

With this formula I am able to SUM big chunks of an Excel file without having to scroll. I just need to calculate the start and end position of each set of data which is easily done using the offset data I collecting during analysis of the source code.

Total Percentage Of Duplicated Code In The System

If you have the total amount of code in your system and the total duplication it’s natural to want the percentage duplication as well. With some basic math we get a value that represents 1% of the total overall code in the code base and then we divide that figure into the number of detected duplicate lines of code. Now you have the percent duplicated lines of code in your system.

Percentage Formula

d = total duplicate lines of code (per revision)
t = total amount of overall code (excluding comments and blank spaces)

d/(t/100)

Graphing Data; Dates Starting At Different Times But On The Same Graph

In past analyses I sometimes end up with a number of piece of data starting at different points in time. At first you’d think this should be easy to graph, just grab the series of data and Excel will figure it out. But that can result in weird incorrect graphs where everything start at the same time. The way to handle this situation is to look at/arrange your data properly. I didn’t figure this out on my own, I landed on a blog post by an author called Jon Peltier who gives a good step by step guide.

The basic thrust of the articles says that the graph component is very smart when it comes to dates, but you have to organise your data in a certain way for it to figure things out. So if you have two or more stacks of data starting at different points in time arrange the like so.

Your dates will all be in column (A) and your corresponding data in the other (three in my example) columns. When you go to graph the data you have something pretty decent. The only trouble then is gaps.

To get over this right click on the graph and you should get the option “Select Data” (Excel 2013 anyway) and on this new modal you’ll have a button with “Hidden and Empty Cells” which allows you set the behaviour for data with gaps that appear in the set. Choosing “Connect data points with lines” should help resolve the broken graphed data.

Tools That Could Of Helped

After doing this analyse I was curious as to other approaches I could of taken. I did a bit of Googling and found some interesting possible tools that could of helped me do this even quicker.

CSVKit

CSVKit is a great tool for working with CSV files at the command line. The work I did in excel for calculating the number of duplicate lines could of been calculated using this. I might of just ended up with a smaller set of figures and less work in the Excel document overall.

StatSvn & GitStat

I recently tried StatSvn against a code base I’ve been working on, but found that it was very slow and eventually exploded in the process. Because of the central nature of SVN, network traffic and load on the server can have an effect on your ability to get the log data StatSvn needs. The last time I used this tool I had tried to index and analyse everything in the repository (which was very, very large) which would explain why it would of fallen over. If you are using it on a very large code base, it’s probably best to get it to work on a subsection of your code base rather than the whole thing like I did.

Conclusion

Sometimes you are in a position where it doesn’t really make sense to have to invest in a big tool/infrastructure or large amount of time to analysis your source code. A side project for instance. Much of the time gathering a little data can get you 90% of the value a bit system like SonarQube could. But sometimes you do need those system and when you do, you should absolutely set them up.

The advantage of sitting down and dealing with numbers in this kind of primitive way is it gives you a sense of your code base, it makes you curious and want to find more involved and interesting figures.

If you’d like to see some of that data I used to create my graph above, feel free to download it and inspect it for yourself.

Reference

Title Image

The title image was a cropped version of “Analytics Red People Tracking” by uglowp

Data

WordPress analysis data used to create graphs.

Articles

“Plot Two Time Series With Different Dates” by Jon Peltier

Source Code

WordPress source mirrored on Github

Tools

Misc

Should I Store My Project Documentation in My Source Code Repository?

2015-09-14T22:38:10+01:00

There are some things you’ll often hear a programmer says, and others you will not. The majority will say, source control is good. But it’s not often you hears one say, documentation is good. That is my view. It is immensely important to the success of any software project. The question that interests me is where you keep your documentation? Should it be under some sort of revision control? If yes, should it be the same revision control system as your source code?

Who needs these documents?

After a good bit of thought I figured it might be helpful to break the question down further. Documentation is something that is shared between all the stakeholders of a project. Those could be programmers, “Product Owners”, requirement engineers, architects, project managers, CTOs, CIOs software division heads. They could also be other departments like Q/A, marketing, localization, services/support and release/delivery.

Given the breath of people who are affected by these assets it would seems important to decided on the right tools and process for handling this information. What’s important then is establishing what your priorities are.

Is the documentation a part of the product? (providing a 3rd party library for sale)
How do we get documents to and from one another without taking up each others time?
How do we keep a track of changes to documents for traceability?
How do we make sure everyone, both technical and non-technical users, who needs access to information can get to it?
What tools are people going to need to work with the documents?
Will there be a need to provide training?

My experiences with documentation and Git

My personal experience with this is keeping high level and low design documentation in a Git repository. The solution at the time was a Rakefile (Rake is a Ruby version of the make build system) that was designed to pick up the project docs (written in markdown) and recreates the directory structure in an output folder and renders all the files into a HTML within this output folder. Things got messy when there were images and other assets that needed to come along for the ride and linking between documents was a bit of a nightmare. We used Jenkins to build software, so each build we’d create our docs which we could get at them through a URL, but we had massive problems with that later when our Jenkins set up changed slightly (all our URL got messed up).

We went through a variety of issues with the docs, such as getting access to them for editing, because you’d have to get access to repository. You’d sometimes end up in situations where’d have to merge them (which wasn’t that bad), but mostly people weren’t happy with markdown partly because we were using the Kramdown rendering engine and all the side by side rendered markdown editors didn’t use that renderer and the document they were working on didn’t render the same on our servers because we had custom CSS.

If I had to assess the way we did those documents on that project I would say it sort of worked but needed a lot of attention to get all the issues ironed out (which didn’t happened while I was there).

Measuring up that set-up with the goals above:

It was difficult to get the documents for editing.
We could share the documents with other departments, but they didn’t quite understand the structure.
There was traceability since they were in Git.
Since we were serving a HTML render version the other team members could see the docs.
You only needed a text editor for editing, but team members also used tools like:
- Markdownpad
- Markpad
We probably needed to write a document to explain our documentation process.

Patching the problem

I started to think about the problem and reasoned what might help would be using something other than straight Ruby and Kramdown to produce the rendered output and something other than Jenkins for serving the documentation. The following is a list of projects that seemed promising.

Git-wiki: https://github.com/sr/git-wiki
- Simple experimental project, “I wrote git-wiki as a quick and dirty hack, mostly to play with Sinatra”.
- I didn’t think it was suitable, but one of the forks might have been.
Olelo: https://github.com/minad/olelo
- “Wiki with git back end”
- Fork of Git-wiki
- Docs managed in markdown format
- Renders docs on the fly.
- Good inter file linking.
Gollum: https://github.com/gollum/gollum
- “A simple, Git-powered Wiki with a sweet API and local front-end.”
- The software behind Githubs project Wiki’s.
- Docs managed in markdown format.
- Renders docs on the fly.
- Edit documents through web interface.
- Didn’t work for us because it wasn’t compatible with Windows.
Jingo: https://github.com/claudioc/jingo
- “Node.js based Wiki”
- Docs managed in markdown format.
- Renders docs on the fly.
- Good inter file linking.
- Edit documents through web interface.
- Never got a chance to try it.
mkdocs: http://www.mkdocs.org/
- Liked the simplicity of this one.
- Docs managed in markdown format.
- Statically generated site.
- Interfile linking.
- Themes.
- Menu with a list of all documentation.
- Doesn’t understand branches…etc
flatdocs: http://ricostacruz.com/flatdoc/
- “Flatdoc is a small JavaScript file that fetches Markdown files and renders them as full pages.”
- JavaScript/Browser based solution.
- Docs managed in markdown format.
- Small and lightweight missing many features we needed on that project.
ditto: https://github.com/chutsu/ditto
- “Lightweight Markdown Documentation System”
- Javascript/Browser based solution.
- Inspired by flatdocs. Basically the same.
- Aim at literate style documentation for JavaScript libraries.
- Tried to Github.
markdoc: [http://markdoc.org/][GHmarkdoc]
- “Markdoc is a lightweight Markdown-based Wiki system. It’s been designed to allow you to create and manage Wiki’s as quickly and easily as possible.”
- Docs managed in markdown format.
- Statically generated site.
- You need to host it yourself.
- Doesn’t understand branches…etc
ikiwiki: http://ikiwiki.info/
- “Ikiwiki is a wiki compiler. It converts Wiki pages into HTML pages suitable for publishing on a website.”
- Understands source control.
- Docs managed in markdown format
- Good inter file linking because it’s a Wiki :).
- Seems very promising, never got a chance to try it out.
Dokuwiki: https://www.dokuwiki.org/
- “DokuWiki is a simple to use and highly versatile Open Source Wiki software that doesn’t require a database”
- Support for markdown through plugins
- Proper access control including LDAP if you wanted to use your Windows credentials.
- Seems very promising, never got a chance to try it out.
Gitblit: http://gitblit.com/features.html
- “Gitblit is an open-source, pure Java stack for managing, viewing, and serving Git repositories. It’s designed primarily as a tool for small work-groups who want to host centralized repositories.”
- Understands Git and your source code.
- User authentication with LDAP support.
- Can render markdown documents.
- Good inter file linking.

You can see that I was leaning towards Wiki’s and even a full on Git web interface. I think I was beginning to understand and see all of the different goals and needs of a good documentation system and that is the reason I was doing that.

Collaboration versus tools

As I studied the options more and more I realised why it is difficult to use your own code RCS for your docs. Keeping your docs besides your code base made it difficult for other team member to create or contribute to them. The tools for writing the documents were also quite alien to them. People are used to using things like Word or even MediaWiki for creates documents. They don’t really know markdown, git, svn…etc and definitely don’t want to use Notepad (lets face it, we’ll tell them to use Notepad++ or Sublime Text all we want. It’s just not important to them).

The success stories of docs beside code are usually open source project. One of the main reason I believe this is the case is ease of use. Open source projects are usually about the code. Documentation is something that helps to explain the design and provides examples. If there is an API then documentation explains how to use it, some of the documents will be partly (or entirely) generated from the code itself. So in that case the primary concerns is around collaborating between the different programmers working on the code base.

The trouble is, in an ordinary work place the main aim of documents are around collaborating of the technical, less technical and non-technical staff.

An equally bad approach

Microsoft Office rules the commercial world. Many people are very familiar with it and know how to use Microsoft Word. Which is probably why you will often find most documentation being written using that tool. Given that you’ll often find Word + SVN/Sharepoint as a fairly common pattern in the commercial software world. There are probably plenty of examples of places that just have a bunch of Word documents on a network drive (this is just plain bad because of traceability).

These kind of approaches is great, and Word has come a long way over the years. You can even add annotations and in-line comments to documents (not in real-time as far as I know). The trouble is Git doesn’t version binaries files very well. I know that SVN does have ways of working with Word documents, that is why you often see the two together, but we were using Git so Word didn’t make sense.

Conclusion

In the end, I don’t think our experiment with keeping docs next to code worked very well. Keeping, accessing and using the documents that were kept in source was too difficult. Having a better system on-top of those docs like the list I enumerated above probably would have gone a long way to solve that problem, but I’m not sure. My advice would be that you should be careful about that decision, it could turn out to be more trouble than it’s worth.

As the user bstpierre noted in the Stackoverflow question, “What Part of Your Project Should be in Source Code Control?”:

"Project documentation is cumbersome to maintain in a source control system. Project docs are always ahead of the code itself, and it's not uncommon to be working on documentation for the next version while working on code for the current version. Especially if all your project docs are binary docs that you can't diff or merge."
bstpierre What Part of Your Project Should Be in Source Code Control?

References

Below are a series of articles and tools you might find useful when deciding to take on this challenge.

Title Image

Based on image taken by ammcintosh1.

SO Questions

“What Part of Your Project Should be in Source Code Control?”

Tools for editing Markdown

Tools To Bolt onto Git

Git-wiki: https://github.com/sr/git-wiki
Olelo: https://github.com/minad/olelo
Gollum: https://github.com/gollum/gollum
Jingo: https://github.com/claudioc/jingo
mkdocs: http://www.mkdocs.org/
flatdocs: http://ricostacruz.com/flatdoc/
ditto: https://github.com/chutsu/ditto
markdoc: [http://markdoc.org/][GHmarkdoc]
ikiwiki: http://ikiwiki.info/
Dokuwiki: https://www.dokuwiki.org/
Gitblit: http://gitblit.com/features.html

Understanding Technical Debt

2015-09-05T20:00:00+01:00

I’ve been aware of the debt metaphor for a few years now. When I heard of it at first it instantly clicked with me. I grew up on a farm and given my parents were self employed I thought about debt a lot. It inform my view on finances and now programming. I’ve been reading a lot about it over the last while to get a better understanding of the nuances of the metaphor. In this post I want to explain what it is and what I’ve learnt.

What is Technical Debt

Say you are tasked with developing a feature. It’s pretty big and could, by your estimates, take anywhere from 5 to 6 weeks. At this point you make a calculated decision (more on this later) to take a short-cut and get the feature out in 2 weeks. Those 3 or so weeks would be considered technical debt.

To put it in simple terms. Technical debt is any decision (inadvertent or deliberate) that results in a lower quality code base, or more importantly in making it harder to maintain the code base and build future features. When you take a short cut everyone who has to interact with that short cut takes a hit on productivity in either understanding or usability, that is the interest payment if you will.

For a more thorough explanation it would probably be worth hearing what the man who conceived the metaphor thinks, the following video is pretty short at 4:44 so have a watch/listen.

The main take away from the video for me is the need to manage and understand debt. The trouble I’ve seen in my career to date is with the inadvertent type of debt that projects usually end up with.

Martin Fowlers: Debt Quadrant

When Ward conceived the debt metaphor it was a way of explaining to his managers and to the business people who he worked with why one would conduct a refactor to a code base. A code base/module that for all intents and purposes worked, was fine and probably even now in use in the field. To Ward it was about putting back the understanding/learning you gained from the process of building the product.

It’s clear from that explanation that Ward is a conscientious and professional programmer. Given that it would seem to me that he is probably going to have design and risk management at the forefront of his mind. For a lot of developers out there it might feel like that’s not the case where they work. Martin explores the metaphor a bit more thoroughly for rest of us.

The main element of Martins musings is the concept of the Debt Quadrant.

I think this keenly describes what is felt in most projects. Martins career and writings are based mainly on the principals that design is good and helps you to move faster. And that is what he explores here.

What is interesting about Martins quadrant is where certain concepts fall. Does the idea of “move fast break things” lay in deliberate & reckless or deliberate & prudent? What about if you solved a design problem with a lot of inheritance where a visitor pattern would of been more appropriate? Were you being reckless by not being fully versed in all the design tools you needed? Is there a base level of design you should of had before being given the helm of that project?

The main distinction is probably what you do after you ship. In my view a “prudent” decision becomes a “reckless” one when you don’t face or even admit consequences. Lets say you make that decision to knock three weeks off the schedule. Where have you tracked that decision? Have you informed your team mates or colleagues about that decisions? If you have a bug/task database has you logged that decision to be addressed at a later date?

I must warn you, I’m a bit of a purest

Thankfully in my life I have not had to have the burden of a lot of financial debt. I live in Ireland; we have “free” education which come with some costs though nothing like North America and elsewhere. As such I’m a touch adverse to the idea of debt. I save up to buy things like cars, televisions…etc where many people use loans and financing options. I have values by which I live my life. I’m a bit different to most people.

Given this I am trying to understand the need and point of debt. Debt is a tool and like any tool when used correctly it can allow you to do things you could never achieve without it. But like a tool it can very easily turn into a deadly weapon. One of the main causes of business failures is cash flow, guess what can impact cash flow? That’s right loan repayments.

Steve McConnell is a very smart guy and once said:

"I think there is some non-zero amount of technical debt that is okay from a business point of view for the project to take on."
Steve McConnell IEEE Society Podcast

This is a tough pill to shallow for most programmers and Steve even had a potential explanation for this perspective:

"A lot technical staff have been burnt by implicit debt often enough that they have gravitated towards to the position that the only good amount of debt is zero"
Steve McConnell IEEE Society Podcast

And once again I think this comes down to what ones perspective is on their work. Firstly are you explicit about the decisions you are making? And are you explicit about the short cut you are taking? Finally have you got enough confidence and more importantly professionalism to fight for the need to fix this short cuts at a later date. Obviously if it makes sense to fix those short cuts. But there again is the point, deliberateness.

What constitutes technical debt?

Some of the more generally agreed upon examples of technical debt are:

Lack of documentation
Hacks and short-cuts we take when under schedule pressures
Modules of code that everyone hate working on and will do their best to avoid.
Compiler/linter warnings/errors.
Extremely low quality code obtained during an acquisition.
Technological progress, the solutions of today are the legacy systems of tomorrow.
And perhaps even types of vendor lock in

There is a bit of debate about this point. Some professional out there, such as Bob Martin, would be inclined to think that a mess is not debt. What Uncle Bob means by “a mess” in this context is unprofessional practises like poorly formatted code, badly named variables, badly written comments…etc. To Bob these are down to laziness, they are not decision taken with the aim of achieve anything in particular.

"The more technical debt you take on, the tighter your disciplines need to be. You should do more testing, and more pairing and more refactoring. Technical debt is not a license to make a mess. Technical debt creates the need for even greater cleanliness."
Bob Martin A Mess Is Not a Technical Debt

And on reflection I suppose I’d have to agree with him. I listen to a great podcast by a very experienced entrepreneur Sean McCabe. He is a part of the design industry. The whole point and purpose of his podcast is about bringing more professionalism to his industry. He’s a hand letter and given that premise you’d think there wouldn’t be that much to learn. You would be wrong to think that. In particular there is one thing that Sean says a lot about designers and artists a lot. He talks about the situation where someone ends up with awkward clients and they lament it. What Sean says about this situation is, “There’s no such thing as clients from hell because only designers from hell take on those type of clients”.

I think Uncle Bobs perspective of debt and a mess is coming from the same place. So to paraphrase Sean:

There are no code bases from hell just programmers from hell who would choose to work on those code bases

How to manage and track debt?

To be completely frank. I’m not sure. I’ve been reading a lot about this topic in preparation for writing this post and it would seem there are a variety of good options, all with their pluses and minuses.

There is a great article by David Laribee called Code Cleanup - Using Agile Techniques to Pay Back Technical Debt. In this article he talks about using a system thinking process called Theory of Constraints (ToC) which was developed by Eliyahu Goldratt. The idea behind it seems to be identifying bottle necks and some degree of root cause analysis. It certainly seems like an idea I will be looking into in the future.

A few presentations I watched talked about using Scrumm and Agile development methodologies to help manage and track debt. In these development system you are talking about what you are doing at a very fine grained interval (daily stand ups), perhaps even pair programming. In Scrumm you might make a story or task which you would estimate when taking on debt. After a time you re-estimate you backlog items you get to find out if your debt is going up or down, and if you are monitoring velocity you can perhaps correlate those decisions against those value too.

Other means of measuring debt would include using tools like Sonar to track metrics like cyclomatic complexity. Though it generally consider a good idea to reduce values such as cyclomatic complexity it is dangerous to have people focus on numbers. Metrics are a powerful tool, but in the wrong hands or just used incorrectly can do more damage than good. Just look at what happened with most justice systems around the world. Many countries are coming around to the dangers of metrics in their police forces, the health care system, their schools. Should the business and engineering world pay attention as well?

Solutions

I’m not an expert by any means. I have an Internet connection and an interest. From what I can tell they type of things that would be worth doing might be:

Audit your system. Find out how much debt there is.
- Static analysis tools can tell you a lot about your system. Try them out, understand what the metrics mean, use them.
- Read your code base, look for “code smells”. Do a bit of code grepping, a tool like ack! really helps with this process.
Keep an eye on all new debt being generated.
- Use peer code reviews to ensure no obviously technical shortcuts are being taken. And if they are track them.
- If possible use pair programming
Train you programmers to have a higher bar of quality.
- Start in house technical talks.
- Invest in good books and on-line courses.
- Create a coding standard.
- Enforce you coding standard.
- “An elegant code base is one that was worked on by multiple people but looks like it was worked on by one person” - Nina Zakharenko
Stay accountable
- Pre commit hooks (run tests)
- Continuous integration

If the above seems potentially overwhelming and perhaps you are worry about the cost it will have on your productivity it might be worth considering this perspective from David Simner.

"Instead of looking at the entire application and looking at metrics over it, you basically just look at what you're going to have to change. Because some bits of the application are just going to sit there and not have to change. More interesting is the bits you are going to have to change and how those bits affect technical debt."
David Simner Measuring Technical Debt

Conclusion

There are a lot of intertwining issues when it comes to using the debt metaphor. It’s first and foremost role is to help you as a programmer find a way of talking to business staff. The language we use as engineers is alien to them. We need a common vernacular. Debt is way of talking about technical decisions that a lot of people can understand. Much more common and in reach than talking about “refactoring” a “clean code”. One of the biggest problems for me in my career is communication and developing my “soft skills”. Ultimately programming is about people. We are making tools for people with people. Finding novel ways of thinking and talking about problems is what separates programmers from professionals.

References

Below are a series of articles and videos I used to research this topic. You might find it interesting to dive into them and get a deeper/better sense of this topic yourself.

Other resources

http://www.ontechnicaldebt.com/ - “OnTechnicalDebt is the premiere social media community for discussing Technical Debt”
Ack! - ack is a tool like grep, optimized for programmers
Seanwes Podcast - The seanwes podcast is to help people make a living with your passion.
SeanWes Podcast Episode 122 “10 Mistakes You’re Making With Clients That Cost You”.
“Eliyahu Goldratt Consulting”

Choosing The Right Tool To Schedule Your Software Builds

2015-08-30T18:47:52+01:00

I love Task Scheduler and Cron. I think they are wonderful tools to help automate away repetitive tasks. I’m sure there are many manual tasks that you do, that I still do, which could be shifted off to those tools and would make your/my lives immensely better. However, I think it’s important to choice the right tool for the right job. If you are using these tools to manage your build system/build servers you have chosen the wrong tool. In this post I’ll explain why that is and what type of tools are the right ones for you.

You current setup using Task Scheduler or Cron

If you’re reading this and know about continuous integration, then you are either convinced and using it or are on the fence. I want to help push you off that proviberal fence or put you on it if you are on the “wrong side”. Before I continue let me congratulate you on making the leap to automating your build system. Lets not even talk about those people that still do that manually! If you have Task Scheduler or Cron doing your build work then you probably have a command line build system. You’re on the right track!

My first question then is does the following look familiar?

Machine 1:
- Nightly task
  - kick off my software build (make, rake, grunt)
  - Maybe deploy the software somewhere…?
  - Get a textual report
  - Email reports somewhere
Machine (N):
- Nightly task
  - kick off my software build
  - Maybe deploy the software somewhere…?
  - Get a textual report
  - Email reports somewhere

Odds are you have to set-up those machines manually? Odds are you have to configure the tasks manually? Odds are you are probably only doing a nightly build at best? Odds are you have those machines are sitting ideal for a long periods of time? Odds are you are not able to monitor the execution of the processes on those machines dynamically as they run?

When things go wrong, does it look like this?

Let me ask you a few questions about your setup.

What happens when a build fails?
When do you rebuild after a failure?
How do you change the frequency of builds if you find daily is not enough or too much? (there is no such things as too much)
Where do you have to go to do that?
Do you have situations where a build could take a long time and you have a situation where two builds are happen on the same machine at once against the same software causing massive corruptions?
What happens when your source control system goes down?
- If you are running at a defined interval, when will the rebuild happen after you fix your source control server? How will you kick that off? Will you leave everybody sitting waiting for it?

If you are using Cron, and you have a large product you building I can probably guess the answer to these. If you’ve make you build scripts graceful enough they’ll send you an email. In fact if you have a few machines running you get a lot of emails. You’ll probably send that to a mailing list that lots of people are signed up to. Most of those people will have an email filter set up to stash them away or even delete them. Turns out if you spam your developers with build emails they will ignore them.

On a failure you might frantically log into all you build sever to read the logs, and then you might manually kick off you’re builds.

To change the interval you might need to log into a heap of machines and change crontabs or jump into Task Scheduler. Odds are, even if you know you need to have more frequent builds you’ll never bother because it’s a hassle.

If you builds take a long time, you wouldn’t even dream of setting up more than one per build server… if you have a lot of branches you probably have a server per branch. If you don’t have that many machine you’ll probably have some builds happen a few times a week, then others on alterating days. And that results in a long lead time between development changes and testing, acceptance and releasing.

Goals of your new improved build system

Here are a set of goals you should shot for:

Make the most of the server you have dedicated to the task of building software.
- You’re spending money on it. May as well make it work. Why not have multiple builds happen on it daily. You build can’t take that long?
Be able to build all your software with one a single command.
Have a build system that can capture and easily process the data that is captured from your build scripts.
Have a way to see all the data captured by your build scripts in a historical and easily navigable sense.
Have a system that understands when the source control changes and can respond appropriately.
- By running a full build
- Or a sub set of the build (because you have a really long one).
- By running tests.
- Testing and integrating changes.
A build system that is smart enough to only email the right people at the right time.
Have a build system that can understand an integrate with your release process.
- If it can talk to your source control, it can understand your branches.
Have a build system that can integrate with your bug tracking software/process.
Have a dashboard where you can see the outcomes of your builds/processes you kicked off.
- Bonus points for a matrix display.

You cannot easily achieve all of these goals with Cron or Task Scheduler. You shouldn’t try, because those are not the right tools to use to accomplish this epic set of requirements. So the question is…

What are the right tools to use?

An example Jenkins Dashboard, find all your build data in one place

What I describe in the previous section was a continuous integration (CI) server. Something like Hudson, Jenkins, Cruise Control…etc. To quote Jenkins, a continuous integration server is an “application that monitors executions of repeated jobs, such as building a software project”. That quote really does not fully convey the power of using a CI server. All of those goals I describe previously can be accomplished by using one of these systems and by having granular build script steps.

How CI Servers match your goals

Lets take the first point, “making the most of the server”. Because these CI understand Source Control, as you make changes to the code base they can pull that code down and do things with it. CIs can react, they are dynamic. Surely that’s all I need to say to sell this to you, but let me continue.

What about building your software with a single command. Well, you should have that sorted with your build scripts, the makes, rake, grunts of the world. All a CI server will do is kick those off and monitor them. But here is the interesting part, CIs monitor the output of your system. They grab the STDOUT of the process they are running and can reason about it. They provide a web front-end for your build logs. No more do you have to jump from one machine to another to find out what’s been going on with your builds. The more data you split out of your build process, running tests that produce an output, running code coverage that processes an output, running static analysers or linters that produce an output, all of that is captured and processed by these tools.

You really don’t know how life changing that is until you experience it. I know I sound like I’m over egging it, but it really does change everything. You have a dashboard for everything that happens with your software, these systems integrate with Intranet software, they have APIs to grab data and stick that data on TVs/Dashboards if you wish! CI servers integrate with your bug tracking software so you can find out exactly when a piece of code checked in is built and ready to be Q/A or whatever your process is.

Save you time, money and headaches

The CI software handles things like checking out code, emailing, managing processes…etc you now no longer need to put that stuff in your make, rake, grunt build systems. You’ve reduce the amount of code you have to deal there. Less code means less complexity, means less bugs/problems. It’s a win-win.

CI software systems integrate with the vast majority of bug tracking software, have API and plugin infrastructures that allow you to write whatever you want. You more than likely only need to search for a plugin and you have an incredibly complex feature straight away, no work. You focus on things you actually care about.

Multiple build servers

While talking about these CI servers at my current workplace I brought up one of the most important benefits of this technology. Imagine if you will, you have multiple build servers. They all have their own independent setup and need to build different branches/versions of your code. How does CI software help you manage all the data that will come out of each? Do I have to go to multiple dashboards to see what going on?

CI Server Master Slaves Architecture

Tools like Jenkins provide a sophisticated master slave architecture. In this system you have a single Jenkins instance at the top which manages all build related work and captures and displays all the data for that work carried out… on the slave servers you can have another instance of Jenkins which broadcasts it’s wiliness and ability to build software. The master then coordinates all of the slaves, gets them to do their building and then captures all of their output.

You get all of your data for every build in ones place, you get to see the build progress of these different environments. You have visibility into your multiple build server based build system set-up. It’s incredible. What is better is since they are CI instances they can set them up to quote builds. You can get the maximum benefit out of every machine, having it run the maximum amount of builds possible in any 24 hour period without the possibility of corruptions.

Conclusion

I have listed some compelling reason why I think a continuous integration server is a much better option than Cron or Task Scheduler for your software products. Now it’s up to you if you want to use one. These kind build engineering tools and practises are becoming standard in our industry and I think it’s time to take notice. Someone smarter than I once said:

the build server is critical-- it's the heart monitor of your software project.
Jeff Atwood The Build Server: Your Project's Heart Monitor

I think this is more true now than it was in 2006. So go forth and get rid of those scheduled jobs and get some new found peace of mind while you are at it!

References

Below is a list of links that used to research this topic. I’ve place them here for your convenience and so you can go double check the information I’ve given you if you would like.

Blog post title image, “head scratching” by James arboghast.
Jenkins Dashboard image from wiki.cloudbee.com

CI Servers Options

There are loads of CI servers out there, you don’t need me to tell you which one to use. That’s up to you. I’ve listed a few options throughout this article here is neat list of those for your reference.

Useful Articles/Videos

The following are articles I used as reference points in this post. I’d highly recommend watching some of the Etsy video, they are very inspirational because once you go past just using CI servers you find out what it takes to ship your product multiple times a day.

The Build Server Is Your Projects Heart Monitor by Jeff Atwood
How Facebook Pushes New Code Live by Chuck Rossi
Mobile CI at Etsy by Daniel Schauenberg
Continuous Deployment at Etsy: A Tale of Two Approaches by Ross Snyder
Distributed builds using JenkinsCI

Other

Here are few miscellaneous links, articles and software. The most useful of which is probably the Shopify’s Dashing framework.

Dashing Dashboard Framework
Wikipedia Article on the Do One Thing Well Principle
How not to use Cron by Tom Limoncelli
SO question - “Is it reasonable to run processes with CI tools?” asked by smp7d

How I Transformed a Grunt Build System

2015-08-19T20:33:55+01:00

Just F***ing Ship It has become the mantra for the modern age. In software you can refine that to “Contiuously F***ing Ship It” because of the trend toward Continuous Integration and SaaS software development. Given that, I believe a highly important and routinely ignored part of any healthy software project is a decent build system. The great thing about modern software development is there are many different options available to us. You aren’t just tied to make and other ancient tools. (Don’t get me wrong, make is a great tool, but so was CVS back in the day, times have moved on and so have project sizes and complexity.)

The build system I updated recently was a Grunt system for a web front-end. Previously it had been been a bit of an odd structure:

master
- Gruntfile.js
- package.json
ComplexBuildStep
- Gruntfile.js
- package.json
- jshint.json
BundlingBuildStep
- Gruntfile.js
- package.json
SassyBuildStep
- Gruntfile.js
- package.json
startTheBuild.bat

Other problems with the build system included:

No easy way to build it locally (it used hard-coded values for a build server).
If you did manage to build it locally you would inevitably have edited one or more of the additional package.json files. Meaning you ran the risk of accidentally checking those back in.

First let me explain the structure. Each part of the build system was a whole self contained Grunt build system. In fact there was a fair bit of copy/pasta going on here. So it volatilized the DRY principal to an extreme extent. Given that they were individual build systems it would mean you’d have to do an npm install in several different directories and some of the plugins/dependency would have to be downloaded in multiple places and the package.jsons had gone out of sync, meaning some of the build system depended on new versions of certain plugins. Clearly a bit of a disaster.

It was time to clean things up.

Grunting in unision

My task was to merge all of the different grunt files and package.jsons into one place. I started with the package.jsons, which seemed like an easy enough job. I reasoned that there would be no adverse affects to getting the build system to converge on the latest version of each of the plugin. I was correct and was able to get a workable package.json fairly quickly.

I knew looking at the different grunt files that merging those into a single file right away would be a bad idea. So I researched a few plugins which would help me split the contents up a bit. I found a couple of great articles on managing grunt build system and the two plugins that stood out from that research were:

Splitting into multiple files using load-grunt-config

Load-grunt-config is a great plugin. It is a nice little piece of work created by [Anatoliy Syernyi][asyernyigithub] which handles the loading of plugins configurations and tasks for you. This allowed me to create a whole new file structure that looks something like this:

buildStuff
- config
  - concat.js
  - copy.js
  - exec.js
  - jshint.js
  - …etc
- tasks
  - fullBuild.js
  - fastBuild.js
  - …etc
- utils
  - helpers.js
  - globalSettings.js
Gruntfile.js
jshint.json
package.json
devSettings.json.sample

Let me explain the structure. Now there was a single entry point for the build system, one single Gruntfile. One single package.json. Now one npm install was required to set everything up. All the different plugins the build uses belong there, as for the individual tasks, they would be defined in a file of there own under tasks. This looked like over kill at first, but eventually you have very specific handler routines and configuration for some tasks which it would make sense to encapsulate in a single location. I also include a special utils area where shared functionality would live (helpers.js) and derived project paths (globalSettings.json).

The major and important element of this restructure was the devSettings.json.sample file. This is a special configuration file that would sit on each developers machine and when renamed to devSettings.json would allow them to have their own local file paths/and overrides for the various build settings. The file devSettings.json was made to be ignored by the source control system meaning there was no chance of it accidentally getting checked back in.

I got this to work by using the postProcess handler method that load-grunt-config exposes which passes you the an object instance of the package.json file content. Here you can use a plugin like deep-extend to merge those package.json settings with your own personal overrides, which is what I did.

Speedier Builds

One of the upsides of using load-grunt-config was it allowed me to use a plugin called jit-grunt. This plugin reports to be able to have a build system where:

“Load time of Grunt does not slow down even if there are many plugins.”

This sounds great and I used it right out of the box. I don’t think the amount of plugins that are there currently are causing that much of an issue, but I know I want developers to use the build system as a part of their daily work flow and that the run speed was going to be an issue so I decided to be proactive. Anyway, it was only 3 lines of code to active it.

Plugins that don’t play nice

One of the downsides of load-grunt-config is it depends on somewhat of a defacto naming pattern for plugin tasks and their npm names. This can fall down quite easily, I encounter this with two plugin in this build system:

grunt-svn-fetch
grunt-multi-dest

In the end I had to sully the main grunt file with two grunt.loadNpmTasks which annoyed me greatly.

Mo' Plugins Mo' Problems

One catch with using jit-grunt is because it does not load tasks into the namespaces until they are explicitly called; then running grunt –help and grunt –version –verbose no longer worked properly. As a result I am currently unable to do any kind of bash autocompletion of the tasks for my newly cleaned up build system :(. There might be a way to get it back, I’ve put in a github ticket against grunt-cli to see what those guys think of my issue.

Better logging

The previous build system was lacking on the logging front. If something went wrong it would typically get swallowed, which is a pain because being able to understand what went wrong and how long it might take to fix it is important. Throughout this restructure and rewrite process I took the trouble of using grunt.log, grunt.fail.warn and grunt.fail.fatal where appropriate.

I also made sure to include an extra command line flag, –debug, so our email reports wouldn’t be drowned in debug details.

Additional cool stuff

time-grunt
- Cool and advanced task/time tracking. I haven’t gotten around to include the details provide here in the reported output, but it is handy in debug mode.
grunt-ftpsync
- Get your files to the server all in one simple grunt step: ‘'grunt buildAndFtp’'

Conclusion/Benefits

Using this set up I was able to create a build system was:

Modular
Maintainable
DRY
Easily enhanceable
Easy to setup (single npm install in single folder)
Locally override-able using a local developer settings file.
- Without the risk of accidental check-ins

References

Looking Back Over Two Years of TypeScript Development

2015-06-04T23:03:34+01:00

I moved jobs recently. Previously I had been using web technologies to create an application visualization development environment, a bit of a mouthful. The application was a form of website builders which is the simplest description I’ve been able to come up with. Using web technologies meant JavaScript and the place where I was working felt it might be a bit of a risky technology to use. The reasons from where firstly, JavaScript being an interpreted language meant that it was an inherently unsafe, in the sense that you’d have to wait till runtime for things to blow up, language. The other primary driver (look at me going all business-speak) was the very large gap between JavaScript and Java (the main development language in the rest of the company) meaning you might not be as easily able to use programmers from other teams on this new one. Plus tooling, development environments, documentation…etc.

A decision was made, and I ended up with two years+ worth of TypeScript experience.

TLDR: It took me a long time to warm up to it, but TypeScript turned out to be a great tool, jump to the conclusion to find out more.

What is TypeScript?

TypeScript lets you write JavaScript the way you really want to. TypeScript is a typed superset of JavaScript that compiles to plain JavaScript. Any browser. Any host. Any OS. Open Source.

TypeScript give you Ecma 6 features today. It’s a transpiler that takes Ecma 6 syntax and various additional TypeScript features and turns them into valid Ecma 5 or Ecma 3 code. The good people over at Microsoft have been working very hard on achieving this and have done a fantastic job so far. On top of that you get something that is not present in Ecma 6, type safety. Type safety is of course a very contentious issue. I’d be surprised if the average programmer hasn’t been witness to, or been part of, the great debate of static versus dynamic typing. I never really had strong feelings either way, I just had a general unease with the clunkiness of Java’s type system versus the elegance and succinctness of languages such as Python and Ruby. To be honest I think most of the debate around static versus dynamic is probably Java’s fault, a view I’ve seen kicking for a while.

The genius of TypeScript is the fact that the type system is optional. You can happily use TypeScript for its other features, such as; its class syntax, the module system and other types of inferred type checking without ever breaking out explicit: “number”, “boolean”, “string”, “Object”, “Event”, “any”…etc

The Good Parts

Some of the following “good parts” I must admit are a bit tenuous. There are plenty of features in TypeScript that you could of gotten from the likes of CoffeeScript (initially at least, TypeScript’s feature list keep growing though). For instance if all you wanted was JavaScript with a slightly nicer syntax and something to handle chunky code around closures, you would be better off with CoffeeScript.

That Sweet Sugary Syntax

A small snippet of Typescript (circa v1.0)

/// 
module Mahonny {
  
  export enum EMagicSpells {
      fire,
      fira,
      lightning,
      whatever
  }

  export ITouchOfMagic {
      castThatSpell(): void;
  }

  export class Report implements ITouchOfMagic {
      public someData = "Hi There";

      generate() {
          // I take a closure
          FancyThirdPartyLibrary.execute(() => {
              // Doing interesting things here, but the context
              // of "this" is correct!
              this.someData;
          })
      }

      castThatSpell(): void {
          // put codez ere
      }
  }

  export class ExpenditureReport extends Report {
      private wayInOverOurHeads: boolean;

      constructor(public account: Account) {
      }
  }
}

I’m not going to go into a full feature list. But as you can see the above style of syntax is the kind of thing you’d expect from an OOP language. One of the main developers of the language was Anders Hejlsberg, who is the chief designer of the C# development language and .NET framework. So it coming from an excellent pedagogy.

As you can see from the code snippet, there is a class syntax which is cleaner and more recognizably traditional OOP development languages like C# and Java. Under the hood the code that is generated is essentially the usual prototypical inheritance system you’d expect.

Naturally for an OOP language it has interfaces, public/private methods and class attributes.

Finally, for me one of the great features of TypeScript is its syntax, for the most part, does what you’d expect. You don’t need to worry about the context of “this” so much. Using the “fat arrow” Ecma 6 syntax (which original came from CoffeeScript) you can get around a lot of messy closure issues.

Reduced testing

What happens if I don't pass anything to this JavaScript function?

function(someParameter) {
  // Ever had check the parameter was passed to you before running you routine?
  if (someParameter) {
      // do fancy smancy codez ere
  }
}

Imagine a world where you didn’t need to check every parameter into a method for “nil” state? Great isn’t it? Well with a complication step you express your desire for a method to never be called without some parameter and then the compile makes it so.

Now that isn’t the whole story, but it pretty much captures it. TypeScript (and generally anything that isn’t plain old JavaScript) gives you extra tools of expression and puts rules in place to make sure your intentions are respected. Surely that can’t be so bad?

DefinitelyTyped

Having an outline of an API of libraries was a godsend. No longer did you need to know a library inside out before being able to use it effectively. In fact I would say that would be one of JavaScripts major issue, there is only so much context and code you can hold in your mind at one time (that is the case for me anyway) and the more things you have to juggle the slower you will go.

Now that isn’t the whole story, but let me save that for the Bad Parts below.

Readability

As I mentioned above, TypeScript gives you a few more syntax tools to allow you to express your intentions. As such you have a better idea about the intent of a method beyond just the name, you get types to go along with the input parameter names. You can see at a glance whether there is a return value and what type that is going to be. The pesky open ended “options” object now had a proper description of the keys and values and expect types of all of their possible contents. All very useful things. And of course the tools (that are really only showing up in the last year or so) allows you to generate decent dependency graphs…etc so you can plan refactors and see hotspots pretty quickly.

Since there is a much richer source code to draw from documentation generation should becomes a lot easier. In the case of JSDocs, you have to keep those comment sections in sync with the code section and since there is nothing to properly enforce that relationship things go out of sync quickly and the comment become a lie. Ultimately I didn’t find any decent documentation generation tools for TypeScript, I tried getting a TSDoc style system up and running but with little success.

The Bad Parts

As yes, and after all that positively comes the pain.

Tooling took a long time

The primary reason we went with TypeScript was “tooling” which is hilarious when you consider we took it up when it was [v0.8]. The only tooling that existed was Visual Studio, which at the time we couldn’t get to work for us. At that stage Visual Studio and TypeScript needed to be taken up at the same time, we had begun writing TypeScript code before then. We didn’t have a huge code base but it was big enough for someone to say, screw it, when it didn’t instantly work. When I was leaving the team was looking at Webstorm which looks like a fantastic tool and would be a major boon to anyone looking at development environments for any kind of web development.

The reason VS didn’t work for us was because of the “reference” paths. For the compiler to be able stitch things together properly you’d have to include a reference to the paths of other files that it needed (later we found the “reference file” method to be easier and more maintainable). For some reason I think that screwed with VS and it probably didn’t help that we had separate modules going and we weren’t using VS to build our code.

Obviously though, that was not the languages fault.

Sourcemaps

Source maps, though they sound like a nice idea are a nightmare. The technology was made in hell and deserves to burn there, forever.

Hyperbola out of the way, the experience in Chrome, which was the main browser I was debugging in, was terrible. Not initially though, initially it was great. Even having the Sublime Text style “Ctrl+P” open a file with fuzzy search was brilliant. The problem was when you placed breakpoints and subsequently edited your code. This is where things went to hell. I never quite figure out the set of steps to reproduce it (to be honest I turn off Sourcemaps pretty quickly) but when you had break points and were refreshing the page after doing an edit you could find yourself debugging code out of sync with the new edits. Sometimes I found I was looking at a piece of code that wasn’t the code I thought I was debugging and wondering what in the hell was going on, why were some things nil when they should have a value. All manner of strange things would happen with Sourcemaps and me (other develops never seem to run into issues) so I had to give up on them.

Obviously though, that was not the languages fault.

Inheritance and third party libraries issues

Handing over a substantial amount of your code (the generated and manage inheritance system for instance) to TypeScript has some trade offs. For one, it depended on the library makers to be sane individual and to not do weird JavaScripty things (read normal JavaScripty things) when enhancing some core library you depended on. For instance I found myself integrating a few third party Backbone plugins, with Backbone and the code we had written for TypeScript. At the time I could not get TypeScript to agree with what was happening to it. I’m pretty sure the code would of run if I tried it in the browser, but the compiler freaked out so I just left it be.

Integrating with third party libraries has a tendency to be a pain in any language at times. I found it to be more so with TypeScript. Now if the library was written in TypeScript then that’s a different story, but 99% of libraries for the web were written in JavaScript so…

MaybeTyped and probably okay

You know that thing I said about “you need to know a library inside out before being able to use it” and how TypeScript/DefinitelyTyped solved that, yeah only kinda. First off the bat, not all the libraries are there. Lot of them are, I grant you that, but not all of the ones you might need (for instance there is lots of Angular and very little Backbone). If you need a library that is not a part of that repo, then tough luck, go write your own. Which if you have to do that and are under time constraints can be kind of tough.

There are a few issues in a real production application that you have to account for when using a library. You want to make sure not to be too quick to update libraries, sometimes things change pretty radically (it’s usually okay if they use semantic versioning). That radically different stuff might mean you can’t upgrade right now. That’s an issue.

Sometimes DefinitelyTyped definitions can be written badly. It’s just a fact of life that a piece of code can be written badly, this is a community project after all. These people aren’t doing this for money, it’s on their spare time. With that in mind it makes sense to keep up to date with a d.ts because I’ve seen bugs from a d.ts having an “any” where it should have a more solid type and engineers wasted a bunch of time wondering why in the hell the compiler didn’t catch something.

Sometimes you can’t move up compiler version, that can be an issue if someone in the community re-write huge swatches of a d.ts to use some new compiler feature (which is the correct thing to do in my view).

So you have a few vectors… the third party library version can increment, the d.ts version can increment at its own pace as well, the compiler version can increment too. Frankly it was a nightmare to manage and keep an eye on it. What would be needed would be a package management system for those DefinitelyTyped files, which does exist now, but that is still in its early days. Frankly it would of been better for the TypeScript team to weight in on the design and production of those kind of support systems because it was sorely lacking during my time with the language.

The Build

Having a build system was a problem, for reason you’d think of and reasons you wouldn’t.

From a pure JavaScripty development style it means there is an extra step between you and the result of what you’ve written, though I think that is an okay thing. Putting the breaks on can help in a design and logic process. It makes you a better engineer to tease things out instead of just throwing code at a wall and seeing what sticks. However, after a while the compile step becomes pretty heavy (20 -30 seconds).

Now frankly that could of been our fault. We may have not been taken full advantage of the module system, or were using it incorrectly. When I was there we had three modules with about 30kloc~. Not massive when you consider TouchDevelop looks to be in or around 160kloc~. I’m not sure what we were doing wrong but I found development to be slower even then what I had experienced with Java.

The other issue with that was we had to re-compile our whole code base when any file (TypeScript) changed. I reckon if the compiler allowed you to compile individual files yet somehow was still able to figure out how to stitch them together without having to consume and read the whole code base again, that would produce an enormous build performance improvement. But that is just speculation. I’m sure the builds could be acceptable if it we had the right type of modules/build setup.

When the build fails

My single biggest issue with TypeScript is that it produces artifacts even when the compiler booms out and blows up with errors about your crappy (mine in this case) source code. The worst thing was since we had this kind of cascading build system (we used rake and task dependencies) it would find these artefact’s on a second build and then just assume everything was find. That I see as an genuine issue with the compiler. But I’m sure it’s probably a feature not a bug.

Conclusions

In the end, despite the warts, despites the complexity I enjoyed working with TypeScript. TypeScript or transpiled languages make a lot of sense for large scale JavaScript application development. JavaScript is great with a small team, it’s great for making things at the 10k lines of code, but once you start hitting seriously big teams and LOC you see in enterprise code bases you need the extra expression. You need the types. You need the structure.

The biggest disappointment was the Chrome debugging. I ended up having to debug in JavaScript all the time if I want to have consistent and reliably experience. The tooling in general made things very hard. IDE weren’t reliable for a long time, plugins for editors like Sublime Text only went so far. Additional tooling like documentation generation was fragile and didn’t work without breaking your back (I broke my back and it still didn’t work). The tooling situation has changed and improved massively in the mean time, but I’m talking about my experience, so.

I found there was a great deal of verbosity in the code I was writing. I don’t blame the language for that, more and more features appeared (even long before I left my last job) that was going to make the code we had written far simpler, it just we didn’t have time to look at upgrading the compiler right then.

Ultimately the root cause to all of my pain with TypeScript was that we adopted it too early. It was too early for good tools, any tools. It was too early for the community, it was too early to find out what would be the best set of “best practices”. We spent a lot of time wrangling with technology instead of writing the tool we were building. But that is the nature of being on the bleeding edge and having gone through that pain the project was in a better place and on a better trajectory as a result.

References

The Windows Environment Variable Dialog

2015-04-06T22:10:46+01:00

The Windows environmental variables dialog (WEVD). We’ve all been there? I’ve been there. I’ve spent far more of my computing life and career there than I care to imagine there. I don’t know what your particular method is, perhaps you’re a type in place kind of person, maybe even a copy out to an external editor and paste back? I’ve been the later for a long time, because lets face it, the dialog sucks.

Why go there at all?

Ideally you’d never need to go there. The software you use and install would take care of all the details itself, but that is an ideal world, one we don’t live in. For me, it was Java. In the pre-Orcale days of Java whenever you installed the JDK on a new machine you’d end up in the WEVD. It was always a pain in the butt, but you’d look past it because you’d be in and out in moments (once you know what you needed to do). As a matter of fact, I’d like to take a moment to thank Orcale for creating the Java Control Panel and making sure to update the environment variables for when you are install the JDK on your machine.

Obviously there is more of an issue that would pertain to doing Java development without an IDE such as Eclipse or Netbeans. In college we weren’t encourage to use IDEs, so I didn’t. So I had to deal with these issues alot.

Characteristics

The main characteristics/flaws of the WEVD are:

You can not resize the dialog.
The semicolon delimited variable contents are treated as long strings of text.
There is “undo” in the context of the input field, but no undo to speak of.
Which only works at one level of edits.
Since the dialog invokes other mini dialogs for editing purposes you can also “cancel” for a mass undo.
The dialog does not provide a way to make a backup before preceeding. (I know that you could theortically use the registry backup facility)
Variables are dialogs on dialogs (the WEVD itself) on dialogs (the system properties dialog) (Windows 7 at least)

History

I have been doing a bit of research on the environmental dialog because I have become somewhat obsessed with this little corner of the Windows world. From what I can tell the dialog itself goes back to Windows NT, possibly closer to the late end of the 90’s. It wasn’t until NT was folded into the consumer version of Windows that it showed up in the every day users world (not that anyone really knew what they were doing with it back then, well me at least).

Windows 95

Right before that, Windows had the autoexec.bat file. This file and usage of it was much closer to the Unix philospy than what it would become. You had a plain text file you could edit and get your particular program or scripts/whatever to show up in your system path. You could edit this file with whatever you wanted e.g. notepad.

I presume the reason for the dialog was Windows shift to the registry model for storing OS and application level information on Windows. As such the dialog is an interface to the registry database. And what a woeful interface they ended up making. But it did the job. And it continues to do the same job… probably something like 17 years later. The WEVD has not changed in any significant way in roughly 17 years. 17 years.

Win2000 through Win10

Below I present to you every version of the WEVD in the last 15 years:

Windows 2000

Windows XP

Windows Vista

Windows 7

Windows 8

Windows 10 Technical Preview

They know not what they do

All of this is a bit unfair of me I suppose. A basic principal of Microsoft has always been, “good enough”. In truth the WEVD is “good enough”. And there are plenty of alteratives that can do the job much better job. I’ve been using a tool called RapidEE for quite a while now and it’s everything you’d hope for in an environment variable editor. I guess it’s just frustrating when you see it and makes you wonder how it’s possible from a software development point of view for a part of a software system to remain developmentally dorment for so long?

Writing Everyday For Three Years

2014-12-11T20:10:15+00:00

I’ve found a bug in your code…

Have you ever been in the following situation; where it has been 8 months since you worked on a piece of code, you have moved onto bigger and better things. Different features, a different project and horror of horrors you get an email saying someones has found a bug in your code. It was a reasonably complicated feature you worked on, it took a lot for you to get it to the state it was in, and perhaps it wasn’t as well unit tested as it should of been, or perhaps worst yet, it’s unit tested out the back end and every test is passing?

I’ve been in this kind of situation. I haven’t been working professionally as a software developer for that long, yet I have had it happen to me on at least 5 or 6 different occiasians. The first few times it was a bit of a disaster. Not because the issue was big, but it was a blow to my confidence. Lets move on from that though. As I got my hands on larger and larger features it was a much bigger deal. All that valuable information was stored in the far reaches of my brain and noone where else. When the last commit message to a piece of code has your name on it you better get ready.

This is why I take notes when I work. I take very meticulous notes in fact. Would it surprise to know that I can tell you what I was doing, in a reasonable amount of detail, for every single day of my professional software development career? I can tell you about defects I’ve worked on, how I felt about developing certain software features. The rational I choose when it came to certain decisions I made about certain features. Why I started something, but didn’t go back to it. Exactly when I learnt about a really useful feature of a framework I was working with. I can give you references to all manor of Stack Overflow questions and answers I’ve used during developing a feature or working on a defect.

Check yourself, before you wreck yourself

I believe in writing everyday. I believe in this principal not to become a better writer and communicator (though that would be a side effect). I do it to cover my back. It’s the case that the software and documentation processes I follow are not detailed. Well… they are detailed from certain perspectives. But they are in fact not detailed from a personal perspective. Take anyone you work with, take a feature they have worked on. If there isn’t a sentence in a document describing why they choose to do (x) instead of (y), it’s likely they don’t remember and never will. I know that is the case for me when it came to college projects and the very early days of my career.

It’s also useful from a administrative perspective. Where I work I’m expected to fill out a monthly time sheet. I know when I was in or out sick, I know when I was working on features for one project versus another. I know whos time I took up and when. I know how many meetings I go to and what was discussed and decided there. This is highly valuable information for my employer. They can be sure I am billing the right projects for the right amount of time every month.

When you can’t learn in the open

From time to time I’ve heard people talking about the value of “learning in the open”. Sometimes that is not possible. I could not blog about the intimate details of the technologies I deal with daily. It’s just not possible, because it’s all propriety technology. It is the IP of the company I work for not mine. As such the kind of details I am keeping, though they could never be used to engineer or reverse engineer any of their tools is still considered sensitive and I take a personal and professional responsibility in never revealing those details publicly.

When and why did I started doing this?

I started doing meticulous note taking, in the way in which I do it today, back in 2008. While at college I had a lecturer who insisted we completing a weekly “Learning Journal” for his subject “Industrial Automation”. To be frank, it was a bit of a pain at the time. The questions were broad and required a great deal of effort to come up with adequate answers to complete them. The following were the different question heading we’d have to complete week to week:

Summarise the work that you as an individual did this week (~200 words)
What are the some of the most significant ideas that you have learnt this week (~3 sentences on each)?
Take any one of the learning outcomes and discuss how your work & learning is related to that learning outcome (~100 words)
How has your work & learning contribute to the project that you are working on?
What aspects are causing you the most difficulty at the moment?
What would you do differently if you were to repeat this weeks work?

But I’ll tell you right now, I can tell you more about his subject then any of the rest of the subjects from that time combined… and when it came to writing a final report, it was a breeze! The level of detail required from us, and the level of introspection around what we were learning would carve that knowledge deep into my synapses of your brain. And ultimately it was a fantastic resource.

I did a thought masters in the same college and had the same lecture for a project. Again, the learning journal was a requirement of my output. And once again it was an immense help when it came to understanding what I was working on (deep introspection) and a life saver when writing the final dissertation.

What does my note taking work flow look like?

I have three tools in my arsenal when it comes to note taking:

A pen/pencil
An A4 pad
RedNotebook written by Jendrik Seipp

I think digital notes are very important. Being able search, my brain as such, is a highly value ability. However, I do not have or bring a laptop or tablet with me into a meetings or when formulating ideas. I think that pen and paper are faster much more valuable when it comes to getting ideas out my head.

The note taking format that I use is a variation on the Cornell method.

It is basically identical to the Cornell system except I put the points/questions column on the right hand side instead of the left.

An important element of my note taking method and the one I recommend you adopt even if you don’t do any of the rest of what I describe in this article is to date and time stamp every page of writing you do. It’s a frustrating experience (for me at least) to go over older note and not know when I wrote them. I get a great sense of enjoyment seeing my progress as an engineer and developer from reviewing my notes, and not having those dates hampers my ability to do that effectively.

Digitizing

Once I finish working on an idea, or come out of a meeting I proceed to digitize my notes. I don’t go overboard. I usually just take the main points and tasks/summary and place them into my note taking application. RedNotebook is a desktop journalling tool. It allows me to write a daily log of everything I work on.

In the image there are the main parts of the application; on the left is the the date widget and the tag cloud. In the middle is the main note taking space (formatted in markdown) and on the right you can place tags (very useful for searching).

RedNotebook also supports pre-written blocks of text called templates (they are like code snippets) which you can throw into your current work day when ever you would like. This is a useful feature when it comes to putting notes around meetings into my log (I go to a lot of meetings). I usually tag each day with a high level view on what I’ve worked on then.

At the end of all of this I now have detailed hand written notes, with the most important information in RedNotebook… which is also acting as a form of indexing system for my hand written notes. So when someone comes knocking at my door asking about some feature I’ve worked on. I fire up RNB, start searching, find relevant details there and if not there I know what date to flick to in my binder full of notes.

Lies, Damn Lies and Statistics

I’ve gone to the trouble of doing a bit of graphing for this post. Above I’ve provided a graph with days versus number of words written. As you can see there is a lot of variation, and my average number of words written reflects on the amount of information that I keep. Granted some of it is boiler plate (personal meeting minutes, where I detail who attended and the like).

RedNotebook provides some basic statistical information namely the following table of data which is accurate for my journal as of the 11/December/2014.

Statistic Type	Data
Words	106177
Distinct Words	9442
Edited Days	687
Letters	663692
Days between first and last Entry	978
Average number of Words	154.55
Percentage of edited Days	70.25%

Conclusion

Ultimately what I am saying is, writing is good. Keeping notes is a rewarding and more importantly professional thing to do. And when a process is not extremely granular and you want to avoid “project lore” use a process to keep a track of that information. If you can make that public (within your company, Intranet blogs?) and if not, then maybe consider a personal system like I have.

A Review of Coder to Developer by Mike Gunderloy

2014-11-22T18:20:59+00:00

The fundamental premise of “Coder to Developer” is to provide a starting point or practical guide to becoming a well rounded Software Developer. To transcend the basic title of “Coder” and become someone who has a full understanding and command of the entire software development life cycle. Mike Gunderloy, following in the footsteps of authors such as Andy Hunt and Dave Thomas of The Pragmatic Programmer fame, tries to give an insight (opinionated but practical) into what has gone into making him the developer that he is today.

Who is Mike Gunderloy?

Before I go onto to tell you what I thought of this book and what you might get out of it. Let me just tell you a thing or two about Mike Gunderloy. At the time of writing this book, Mike was a .NET developer working as a software consultant for “Larkware” his company which is still around. At the time Mike had many years of experience as a professional programmer. So he definitely knows and knew his stuff. In more recent times (2006) he has moved on to be Ruby on Rails developer.

He keeps a personal blog which it’s tag line describes it as being “Notes on Rails and other development”. I’ve pop by there a few times and found some very interesting links. If you pop over to his professional website you can get his email address to contact him and you can also find him on Twitter as @MikeG1.

Overview

The book tackles practical topics that are rarely emphasised in technical books, but are of extreme value and importance in real world production applications. Dry and boring ones such as logging, requirements, schedules, intellectual property. Of course you may be into that kind of thing, but I know that I don’t find them the sexiest of topics.

The book is broken down into 15 chapters which goes to varying levels of depth. At times Mike talks in very specific terms about technologies and concepts from Windows development and the .NET framework. He also provides us with an example application (a download tracker) which he tackles from time to time while moving through the book and its development life cycle.

Stand out chapters

Planning Your Project
Organising Your Project
Using Source Code Control Effectively
Coding Defensively
Preventing Bugs With Unit Testing
Pumping Up The IDE
Digging Into Source Code
Generating Code
Tracking And Squashing Bugs
Logging Application Activity
Working With Small Teams
Creating Documentation
Mastering The Build Process
Protecting Your Intellectual Property
Delivering The Application

Above I’ve listed the chapters in Mikes' book. I want to talk a bit about some of the chapters I found quite interesting while reading this book. There are a lot of valuable insights to be had throughout, but here are a few of my highlights.

Tracking and Squashing Bugs

For all of my professional career to date, I’ve work in large organization with defined processes around bug management and tracking. In fact much of what is discussed in this book from a process POV I’ve never given any real deep thought too because of that fact. In one sense this chapter didn’t tell me anything I didn’t know, but in another it was incredibly revealing. Of course different organization have different processes also, so the one Mike was detailed did differ from what I’ve been used to. Hence, it made me sit up and pay attention.

One of the areas I don’t have much exposure too as a programmer where I work is “Risk Management”. The way in which Mike talks about risk; how it factors into his work flow and the way in which he managed it (through statistically probabilities) resonated strongly with me. In the same way as risk management caught my attention, so too did Mikes take on “Zero Bug Goals/Mentalities”. I suppose at the time the time I was on had just committed to a zero bug goal from that point on, the quote that hit me hard was this:

Microsoft on Windows 2000 "Rather than focus on a zero-bug goal, they focused on having zero unfixed bugs that required fixing"

Finally, the main reason I really enjoyed this chapter was Mike would go into detail about his own particular independent software development set-up. For instance, when advocating for robust testing for your application, Mike describes his own test set-up (a lot of PC in his home office). I guess I just loving hearing about that kind of thing, a developer going out there and making it on their own!

The thing that absolutely rocked this chapter for me was Mikes advise for lone developers, and how they might go about testing their software without a dedicated QA team.

Logging Application Activity

Honestly, I would never have thought that anyone could say so much about logging. What Mike managers here was astonishing. Now some people out there will find nothing new here, and this is a chapter where Mike goes into specifics about Microsoft Windows development tools such as the Microsoft Windows Enterprise Instrumentation Framework (EIF). But you see, there were some concepts in here that I had never encountered. Some of the frameworks that exist for .NET development sound quite incredible and of course this is an old book so I could only imagine what kind of cool technology exists now.

I’ve read a bit about logging as a concept before and Mike just hammered home some of the ideas I’ve been reading.

Creating Documentation

I don’t know how it was possible to write so many interesting things about documentation, but Mike did it. I suppose on one level it might be down to where I work, and having those kind of concerns taken care of for me by other engineers. Mike explores first the importance of documentation, both from the POV of the customer using your products and to the engineers working on with your API (if you are on a small team, or producing an API for someone to use). What interested me the most was the idea around information architecture, how do you make good documentation (for the end user), how can you avoid make something which is based on “Mystery Meat Navigation”. Who would of thought that information architecture was something that would give me kicks? I know I didn’t.

In another chapter, “Generating Code”, Mike explores the idea that you might be able to automatically generate a certain amount of your documentation as well. Which, though not a new concept/idea, is always a good to remain engineers that it’s a possibility.

Criticisms

One of the criticisms I’ve seen levelled at this book on-line is its heavy use of specific technologies while discussing various topics. On one level I would agree with these criticisms, it would of been nice to know I was getting myself into a Microsoft Windows/.NET take on becoming a better developer. Outside of that though I don’t think it’s completely valid. Using platform/technology specific examples exposes you to possibilities. I’m not a .NET developer, but when exposed to the types of ideas that can be found in that stack; as a result I wanted to know if the same was available for the systems I use and believe for. I believe you should always be exposing yourself to new ideas and concepts. It is true that a book might not be the best place to be exposed to those kind of ideas, but I didn’t think it took away that much from the experience.

Conclusion

Would I recommend “Coder To Developer”? It’s a well written book attempting to tackle very complex topics which at times get sidelined but reveals very interesting nudges of information all the same. I enjoyed the book and feel I was able take away a lot from reading it.

To purchase Coder To Developer head on over to Amazon and grab yourself a copy.

References:

How a car is like a programming language

2014-05-01T09:16:24+01:00

There are times in ones life where you find that something clicks inside you and it all makes sense. I had one of those moments recently.

I drive an ‘Opel Corsa 1997’. I should say I did drive one, as I recently got an upgrade of a ‘VW Golf’. The Corsa was great, it got the job done and it was great getting the freedom a car affords you. That being said, it was a Spartan driving experience. No ABS, no power steering, a cassette player instead of CD…etc. The bare essentials and nothing but. Which is what makes driving a powerful modern car like a Golf a delightful experience. It handles beautifully, has real power in it’s engine. It’s responsive and definitely feels safer.

There was a moment recently, while driving where I was sailing up a steep incline on my way somewhere, when I didn’t quite hit the ‘bite point’ and you get that awful drop in power, you panicking to get to a lower gear and to keep moving for fear of stalling. That was the moment when it hit me. All cars have the same kind of interface to control them. How much they do for you can vary wildly. Driving a car well isn’t about the controls, don’t get me wrong, they are incredibly important. Driving a car is about the nuances of its operation, about constantly assessing how different it is to your ‘a prior’ knowledge of how a car operates.

When it comes to programming its about few things: knowing you basics and knowing them well, constantly “restating your assumptions” and seeing how they are affected by what you are doing, being engaged.

Copywork for the Professional Software Developer

2014-04-28T08:50:22+01:00

Recently I learnt of a learning and study concept called “Copywork”. I subscribe to a great newsletter called the Art Of Manliness, which I highly recommend. They featured an article on this technique but pitched it more from the point of view of a author. This learning technique was popular in the 19th century and before. The way it worked was an individual would manually and meticulously copy great works of literature, or just course material if you were talking about schools; page for page, word for word. It acted as a form of study and even was consider a meditative practise.

When I heard of this idea at first, I was fascinated. However, on reflection I realised that I did in fact, learn in a similar way when I thought of my school days and copying work down from the black board. This probably served a similar purpose? However, “copywork” has a few distinct differences all the same.

Depending on which author or authority on the subject you were to talk to at the time you’d get a different perspective. Copywork is as much about concentration, spelling, grammar, memory as it was about study. Some authors for instance would read a paragraph/page and try to reproduce it from memory. Over and over again, until they got it right. The blog post even mentioned how Jack London employed this technique in an aid of finding his voice and improving his style after initially being rejected by publishers.

What strikes me the most about this idea/practise is it is nothing new to me. I’ve spent countless hours, playing along to music I love in the same way. I’m also aware painters and sculptors spend a lot of their time recreating the work of the greats, all in search of a better understand of them and themselves. To find their voice, style and niche.

And how/could, I now wonder, I used this technique with programming? How is programming different to any other creative discipline I’ve mentioned? Would one benefit from transcribing the full source code of a program most consider to be brilliant? What would those code bases even be? And how would you even approach it?

Well I have some answers to these questions, as I’ve been trying this technique out. I’m going to update the blog at some later date with the exact details, but suffice to say, it’s hard work.

Long time no post, also what happened to jCodeCollector?

2014-04-25T23:05:04+01:00

I wanted to venture into open source software and do something cool. I still think jCC is a cool project, I just don’t have a lot of time to work on it right now. I hope that sometime in the future I can get back to it. Until then, use gist or the myriad of other services that can already do this kind of thing…

As I bring it up, that is kind of the reason I sort of walked away from it, for the moment at least. I kind of noticed that I wasn’t solving a problem, I was resolving it and probably in a not very useful way either.

I’ll get back to it someday I reckon though.

Considering JavaFx for jCodeCollector...

2013-08-29T09:00:00+01:00

One of the things that bother me about JCC are the depencancies, in particular the depenancy on “MacWidgets”. This code base doesn’t seem like it is being actively developed, not to mention forcing the Mac style onto other operating systems is inappropriate to say the best. I do not appreciate an application casting it’s opinion on the OS I am using (it’s not a mac) and I don’t think Linux users would appriciate it that much either.

Anyway, I’ve taken some time off and one of the things I’ve been looking into is JavaFX, I’ve been following an excellent guide by “Marco Jakob” over at this site on creating a simple address book application using the technology.

What I like about JavaFX, what I’ve seen of it so far… is the kind of separation it is putting between your code base and the look and feel and flow of the application. It reminds me of .NET and WPF, which I think are kind of good technologies.

The state of jCodeCollectors source code...

2013-05-13T09:00:00+01:00

With the past few days I have been reading and analyzing jCodeCollectors source code. Frankly it isn’t very good, a fact which Alessandro would readily admit and did on his Github readme.

Using Google Code Pro, it’s clear there is a lot of dead code. From just reading the comments and translating them using Google Translate a lot of them are extremely redundant. There is a lot of pointing out the obvious, commenting for the sake of commenting and not enough about why certain decisions were made. Comments are about the why, not the what.

I haven’t drawn up or used any software to derive the design of the overall application; from what I can gleam from the code base, there is none. Just lots of singletons, massively nested code sections and God objects.

Lots of the work is done on the GUI, which isn’t that bad in the current application design, on a standard desktop it’s very quick, too quick to be an issue. Realistically I would like there to be a lot less of these kind of blocking operations.

Other issues with the code base are:

Lack of proper logging.
Not internationalized.
No unit tests.
Plenty of corner cases probably not covered.
Database is not abstract enough, not easy to swap in a new one.

The type of plan I would propose at this point is, putting a good suite of tests in place, a job which I have started by using Code Pro to generate a fair whack of them yesterday. Because I intend on making a final token release of this software for people who may or may not still be using it I intend on just making some subtle refactors to the code base and cleaning up some ugly behavior on the front end. A more longer term plan would be the refactor the code into a more client/server architecture and perhaps move towards a more web-based approach. At the very minimum I would like to experiment with JavaFX for the frontend. Sorry mac peeps, this apps going to end up looking very different.

jCodeCollector Update...

2013-05-10T09:00:00+01:00

Just a quick post on the state of JCC. I spent a bit of time today get re-antiquated with the application, making notes and what not and am slowing translating the many, many comments in the code base from Italian to English. Thank you Google Translate!

Aboutmatt.net

What is the JSLint of iOS development?

Compiling is linting…

Objective-C Tools

OCLint

Infer

Beautifying

Swift Tools

SwiftLint

Tailor

Conclusion

References

Title Image

Tools

Misc

Do I Choose Swift or Objective-C For My iOS Application?

I could be wrong I could right

iOS The Hard Way

Conclusion

References

Title Image

Links

An iOS & Apple Jargon Busting Sheet for Web Developers

Enter my Jargon Buster

Reference

Title Image

Xcode; A Run Down Of The Features Of The iOS Integrated Development Environment

What Is The Point In An Integrated Development Environments

The Editor Overview

Navigators

The Project Tree Does Not Represent What Is On Disk…

Editor Area

Source Code Editing

Interface Builder

Utility Area

Property Pane

Library Pane

The Toolbar

Run Controls

Views

Pane Switches

Debug/console Area

Advanced Features

Build Technology

Others

Conclusion

References

Feature Image

Tips

An introduction to the iOS architecture for web developers

Looking at the browsers architecture

Big picture for the browser

The Big Questions

Looking at the iOS architecture

Big picture for iOS

Cocoa Touch

Media

Core Services

Core OS

The Big Answers

What languages are used in iOS?

What is the part of the system that handles rendering?

What is the part of the system that handles networking?

What is the part of the system that handles events?

Is there a DOM for iOS?

Is there some sort of CSS for iOS?

Conclusion

References

Title image

Articles

Apple Documentation

Libraries

Analysing The Quantity Of Duplicated Code In Your Codebase

Tools

Analysis Goals

Why Do An Analyse

Getting The Data

Revision Date

Duplicate Code

Total Number Of Lines Of Code