Learning From The Past

One Beyond
July 27th 2014
Posted In: Business, Software Engineering

The history of software development is peppered with fads and fashions; things which seem like inevitable progress at the time but are later discarded as flawed dead-ends. Even temporary infatuations leave their marks: legacy software, books, web content, “best practices” and so on. Among all this debris it can often be hard to identify things which have stood the test of time, even when modern ideas go over the same ground.

As an example, let’s consider a current hot topic close to our hearts here at One Beyond: Microservices.

Fred George’s microservices (read the slides) boil down to two essential rules and two guidelines:

A system must support multiple live versions of every component
Only one thing must change at at time (add or remove one component, reconfigure one service, etc.)
the smaller and more independent the components are, the better
the smaller changes are, and the sooner they are deployed, the better

In traditional application design this is so unusual as to be heretical. Architects keep designing bigger tightly-coupled solutions which need huge amounts of ceremony and require heroic work to stop the whole edifice from catching fire if anything goes wrong (c.f. “towering inferno”)

What may surprise, though, is that we all use this kind of microservice system every day. The World Wide Web.

If we view each web page as a “microservice”, the web matches Fred’s concept very closely, particularly the kind of “static” web pages which are stored separately on a server and delivered to web browsers on demand. Such pages present some information to a user and refer to other pages using hyperlinks; sometimes they also gather some information from a user and process it with some in-page javascript and/or the assistance of other remote services.

The web has always supported multiple versions of web pages on a server. All it takes is a change to a link URL to include a new page in a “web site”. Deploying a page is as simple as saving a file, and the limitations of what people are willing to read tend to make web pages small and understandable.

So why is it that web “applications” are often so clumsy and fragile to develop, test, deploy and understand? Why have these same web apps become a classic example of the problems Fred is trying to address?

In some ways the answer lies in the history of web development. The very first web servers could do nothing but serve pages from files. There was no Javascript, no “server-side” software, only links to other pages. For dissemination and browsing of pre-written information this was fine, but people soon wanted more. For web sites with lots of information it became very labour-intensive to manually mark up everything in HTML, and practically unworkable for any information which changed faster than the time and skills available to write the pages. This became even more of a bottleneck when web pages started gaining style as well as raw information. Changing a logo or header image across many web pages became a fiddly slog, so the people creating these web pages looked for ways to make things easier.

The earliest kind of dynamic pages were built to address these issues, and consisted of two basic technologies “server-side includes” (SSI) and “Common Gateway Interface” (CGI). SSI addressed the issue of common headers on multiple pages, by supporting what have now become known as “partials” (page fragments re-used in multiple places.) CGI was more far reaching in that it allowed, for the first time, web pages to be generated as they were requested, by running a script. A “CGI script” has a very simple interface: the web server sets some environment variables representing the HTTP header, then passes the request body as the input to the script, and returns the output of the script to the client.

CGI was the engine which powered the early days of the dynamic web, and there are still many web sites which rely on this venerable technology. Just like the original static web pages, CGI scripts fit pretty well in the context of Fred’s microservices. A good CGI script does one job (building a single web page) and can be substituted simply by changing a link URL. An application consisting of several static web pages, perhaps with a bit of SSI for common sections, and some CGI scripts to do the hard work has all the characteristics of a microservices deployment.

As an aside it is important to talk about skills. One of the key emergent characteristics of a microservices architecture is that services can be developed using whatever technologies and skills are available and suitable at the time. As long as a service can handle its job, it is unimportant how it is implemented. In turn, if an implementation decision is later seen as inappropriate, the service can be re-implemented without impact on the greater system. This is hugely important to the practical building and maintenance of such systems. When extra development is required, extra people can be brought on to the team and be useful immediately, with whatever skills they already possess. Development (and re-development) can proceed on many services at once, without requiring complex documentation, training, release processes, or meetings.

Up to this point in the history of the web, this technology independence and freedom still held. But the clouds were looming.

The main problem with CGI as a web technology was held to be one of performance. As web pages became more complex, requiring more information from more diverse sources, typical CGI implementations began to feel the strain. CGI-based web application software might need to make several requests to one or more remote databases for every page, as well as running whatever code is required to build the HTML and text on the page. Database access was a particular problem; the independent, stateless, nature of CGI scripts means that a new database connection must be opened and closed for every page. This began to be the major limit on the number of pages which could be served.

To address this problem servers were built which, instead of starting a whole new process to run a CGI script for every page access, started a single long-running process which could hold things such as database connections and popular data in memory for much faster access. Early examples include apache modules and Java servlets. At a stroke this massively improved the performance of the dynamic web but at considerable, and often overlooked, expense in development.

No longer could a script be implemented and re-implemented at will, instead it had to be compatible with the containing server, which in turn probably meant a very much reduced choice of languages, tools, and frameworks.
No longer could a CGI script be substituted or upgraded whenever required, instead it required changes to the configuration of the server, which in turn almost always implied a server restart.
No longer was each script responsible for single job. Code began to be shared, and changes to one component could have unexpected knock-on effects on many others.

The development of web components was now locked into a larger application, with rules about when the server could be restarted to deploy changes, and specific skills needed. This in turn both decreased ease and speed of development, and increased the difficulty of finding and training more developers.

The response of the software industry to this problem has been diverse, but mostly concentrated on attempting to hide or abstract “common” or “difficult” aspects of a system into frameworks and libraries. This has the short term benefit that suitable applications might need a little less code, but at the long-term cost of even more stuff to add to the job spec, more to go wrong and be misunderstood, and the increasingly worrying possibility of discovering that a chosen framework, library, language, or server is no longer cost-effective for the needs of your particular project. Frameworks in particular can act like magnets, pulling at application code and distorting the natural separation of responsibilities until every change involves the framework.

This situation has become so normal now that it is hardly ever challenged. Job advertisements for web development specify a baroque assortment of skills and experience, sometimes down to specific versions of specific languages or frameworks; and project managers the world over complain about both the quality of staff and the pace of development. Deployment of web applications is routinely late, requires huge amounts of testing, and still frustrates and burns out development teams.

Starting a web development project has now become a matter of placing large bets on unsubstantiated guesses on the suitability and productivity of a collection of third-party software. Even people with experience of particular technologies are rarely in a position to know for certain how things will turn out, as no two business needs are the same.

If we want to improve this situation we need to learn from the past, and discard some commonly held assumptions about software development.

A framework based on a solution to someone else’s problems, however clever and comprehensive it may seem, is never as useful as you expect.
To speed up development you need systems split into genuinely independent chunks, even at the cost of some duplication, so that multiple people can work without impeding each other.
Finding and hiring productive software developers is much easier if the project is less prescriptive about technologies, so leave tool choices and “standards” as late as as you can and avoid “lock-in” wherever possible.
And finally for now, remember that you probably do not need the complex, unwieldy, and expensive solution which might be suggested by “best practices”. Look for simplicity and don’t be afraid of solving your own problems in your own way.

If you keep an eye on these suggestions, you may find that building your own can work out much cheaper than buying in over the length of a typical software system.

Cookie	Duration	Description
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
AWSALBCORS	7 days	This cookie is managed by Amazon Web Services and is used for load balancing.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__hssc	30 minutes	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
aqcamp	1 month	This cookie is used to customize the application behavior to user preferences.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
messagesUtk	1 year 24 days	HubSpot sets this cookie to recognize visitors who chat via the chatflows tool.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
_gaexp	1 month 10 days 11 hours	Google Analytics installs this cookie to determine a user's inclusion in an experiment and the expiry of experiments a user has been included in.
_uetsid	1 day	Bing Ads sets this cookie to engage with a user that has previously visited the website.
_uetvid	1 year 24 days	Bing Ads sets this cookie to engage with a user that has previously visited the website.
ADRUM_BTa	past	This cookie is used to optimize the visitor experience on the website by detecting errors on the website and share the information to support staff.
AWSALB	7 days	AWSALB is an application load balancer cookie set by Amazon Web Services to map the session to the target.

Cookie	Duration	Description
__hstc	1 year 24 days	This is the main cookie set by Hubspot, for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_CNWWV4VG3L	2 years	This cookie is installed by Google Analytics.
_gat_UA-3669062-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
ajs_anonymous_id	20 years	This cookie is set by Segment to count the number of people who visit a certain site by tracking if they have visited before.
ajs_user_id	never	This cookie is set by Segment to help track visitor usage, events, target marketing, and also measure application performance and stability.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
hubspotutk	1 year 24 days	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
_opt_expid	past	Set by Google Analytics, this cookie is created when running a redirect experiment. It stores the experiment ID, the variant ID and the referrer to the page that is being redirected.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
MUID	1 year 24 days	Bing sets this cookie to recognize unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Learning From The Past

Refactoring: When and Why Should You Do It

The Granularity Of A Microservice

Cookie	Duration	Description
_ce.cch	session	No description
_ce.gtld	session	No description
_ce.s	1 year	No description
_dc_gtm_UA-3669062-1	1 minute	No description
_gaexp_rc	past	No description available.
_hjSession_1933882	30 minutes	No description
_hjSessionUser_1933882	1 year	No description
_obid	1 year	No description
adzab_all	1 month	No description
adzuna_epoch	1 year	No description
adzuna_in_your_inbox	1 month	No description
adzuna_session_ads	1 hour 30 minutes	No description
alr	30 minutes	No description
AnalyticsSyncHistory	1 month	No description
aqcamplast	session	No description available.
asst	30 minutes	No description available.
cass	2 hours	No description available.
cebs	session	No description
cebsp	session	No description
cookietest	session	No description
dcid2	2 years	No description
gdId	10 years	No description
gdsid	6 hours	No description
GSESSIONID	2 hours	No description available.
li_gc	2 years	No description
SameSite	past	No description available.
session	session	No description available.
trs	1 year	No description available.

Learning From The Past

Are you looking for a bespoke software solution for your business?

Refactoring: When and Why Should You Do It

The Granularity Of A Microservice