The Granularity Of A Microservice

One Beyond
July 20th 2014
Posted In: Software Engineering, Understanding new technology

The most frequent question I am asked about microservices is how small should they be. This is the wrong question. It’s not the lines of code that matter, but whether the service does one thing and does it well. A service that does one thing well can be developed quickly and is unlikely to change. If and when its purpose or technology becomes stale it can be easily removed or replaced. This is one of the key benefits of a micro-service architecture.

However just as low test coverage and broken tests may warn you of an unloved codebase, a high number of lines of code can indicate that a service has more than one job. So despite being the wrong question, it’s still useful to have an idea of the answer. Fred George, the father of micro-services suggests between 50 and 100 lines of code. I met Fred in 2012, and after going through his incredible two week bootcamp (which involved writing a relatively sophisticated application in just 22 lines of ruby), I was keen to try out micro-services in a real-world setting. Enter Campaign Manager, an application for deciding which ads to show on a 500 page/second web site.

Campaign Manager was a disjointed application mostly written in PL/SQL, with additional logic spread across multiple JSP pages and configuration beans. Its purpose was to control which ad slots were enabled by channel, sub-channel and page type. The replacement system had to do all this, but in addition consider region, device and orientation. It also had to incorporate a scheduling component and automatically activate / deactivate prominent slots when the commercial team’s ad platform (DFP) had appropriate inventory. Finally some slots had to be mutually exclusive in some regions, so for example a Billboard and an MPU Top could not both be active on the same page in the UK.

My approach was to develop a set of micro-services, whose design would be guided by the tenets of Unix Philosophy, with special attention given to doing one thing well. All but one service was written in JavaScript and deployed to nodejs, the remaining service was written in Groovy because it integrated with DFP via a java library. The services and their respective line counts were as follows:-

Service	Purpose	Lines
cm-slot-config	Served the ad slot configuration for each page request.	96
cm-slot-admin	Provided the slot management API.	154
cm-web-ui	User interface for managing slot configuration and admin operations (mostly client-side js).	1168
cm-dfp-extract	Synchronised Campaign Manager’s slot configuration with the line items in DFP.	419
cm-inventory-check	Monitored the real-time analytics feed and temporarily deactivated prominent slots if DFP was throttling adverts.	134
cm-status	Provided a REST endpoint for displaying the status of each service.	121
cm-heartbeat	Published a heartbeat message onto the ESB so we could quickly diagnose connectivity problems.	21
cm-compact-indexes	Regularly compacted some Redis indexes for keys deleted using ttl expiry.	35
cm-redis-logger	Persisted Campaign manager events published to the ESB and provided a RESTful API for retrieving them.	78
cm-riemann-logger	Forwarded Campaign manager events published to the ESB on to Riemann.	71
cm-smoke-test	Continuously created and verified a fake set of configuration (only visible from Bouvet Island).	83

Of the services in the list there are two which breach Fred’s guideline of 50-100 line by a considerable amount, cm-web-ui (1118 lines) and cm-dfp-extract (419 lines). cm-web-ui broke the scales because we deliberately consolidated all of the web-ui functionality into a single service. We did this because sharing state and managing static assets hosted by different services is problematic. In the case of cm-dfp-extract, the DFP Java API is ugly, and the code interacting with it incurred some splash damage.

Just as interesting are the services which fall below Fred’s lower limit of 50 lines (cm-heartbeat and cm-compact-indexes). Do these services justify their own existence? Retrospectively I think not. They fail a second micro-service architecture guideline, they have no conclusions that are worth publishing. cm-compact-indexes should have been moved into cm-slot-admin and cm-heartbeat was superseded by cm-smoke-test and so should have been deleted.

Managing a small battery of micro-services is not without its difficulties, so it’s of no surprise that the second most frequent question I get asked is how to do this. In the spirit of doing one thing well, I’ll leave my answer for another blog post. Instead I’d like to round things off with a game of Devil’s advocate and challenge whether Fred’s 50-100 lines is too small. How would things have panned out if I’d have followed a more conventional architecture?.

Firstly I would still have split the Campaign Manager into distinct frontend and backend components. The frontend component needed to be horizontally scalable and it would have been a security risk to host the backend component on a publicly accessible server.

Service	Purpose	Lines
cm-slot-frontend	Served the ad slot configuration for each page request.	96
cm-slot-backend	Everything else	2234

With only two components, build and deployment would undoubtedly have been easier. Configuration would have been simpler too, since the application would interact via function calls instead of an ESB or remote API. However it would have been harder for a team of developers to work independently and releases would have been more risky – with fine grained services we never had to worry about accidentally releasing unfinished code or adopting strategies such as branching. With no fear of releasing to production, we delivered features as soon as they were ready, often multiple times per day.

I also think the resulting codebase would have been worse. A total of seven developers worked part-time on Campaign Manager over a period of eight months, but we never had more than four developers at any one time. The first release went live after three months and I rotated to a different project after four. I returned three months later to find that several new features had been implemented. Not everybody designs or writes software in the same way, and no one way is objectively best, however some of the changes were overly complicated and one feature in particular, a resource hungry analytics tool, was completely unnecessary. Because of the micro-service architecture we were able to systematically refactor service by service and I was able to remove the entire analytics service with one command.

In contrast single codebase applications have no physical boundaries. It’s far easier for poor code to spread ivy-like across it, and just like ivy, poor code can be surprisingly hard to remove. In an ideal world, organisations would only hire the best employees, who never made mistakes, who never needed to learn new tools on the job. In the real world micro-services can at least provide a degree of damage limitation and a controlled path back to sanity when things do go astray.

Cookie	Duration	Description
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
AWSALBCORS	7 days	This cookie is managed by Amazon Web Services and is used for load balancing.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__hssc	30 minutes	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
aqcamp	1 month	This cookie is used to customize the application behavior to user preferences.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
messagesUtk	1 year 24 days	HubSpot sets this cookie to recognize visitors who chat via the chatflows tool.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
_gaexp	1 month 10 days 11 hours	Google Analytics installs this cookie to determine a user's inclusion in an experiment and the expiry of experiments a user has been included in.
_uetsid	1 day	Bing Ads sets this cookie to engage with a user that has previously visited the website.
_uetvid	1 year 24 days	Bing Ads sets this cookie to engage with a user that has previously visited the website.
ADRUM_BTa	past	This cookie is used to optimize the visitor experience on the website by detecting errors on the website and share the information to support staff.
AWSALB	7 days	AWSALB is an application load balancer cookie set by Amazon Web Services to map the session to the target.

Cookie	Duration	Description
__hstc	1 year 24 days	This is the main cookie set by Hubspot, for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_CNWWV4VG3L	2 years	This cookie is installed by Google Analytics.
_gat_UA-3669062-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
ajs_anonymous_id	20 years	This cookie is set by Segment to count the number of people who visit a certain site by tracking if they have visited before.
ajs_user_id	never	This cookie is set by Segment to help track visitor usage, events, target marketing, and also measure application performance and stability.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
hubspotutk	1 year 24 days	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
_opt_expid	past	Set by Google Analytics, this cookie is created when running a redirect experiment. It stores the experiment ID, the variant ID and the referrer to the page that is being redirected.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
MUID	1 year 24 days	Bing sets this cookie to recognize unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

The Granularity Of A Microservice

Refactoring: When and Why Should You Do It

Learning From The Past

Cookie	Duration	Description
_ce.cch	session	No description
_ce.gtld	session	No description
_ce.s	1 year	No description
_dc_gtm_UA-3669062-1	1 minute	No description
_gaexp_rc	past	No description available.
_hjSession_1933882	30 minutes	No description
_hjSessionUser_1933882	1 year	No description
_obid	1 year	No description
adzab_all	1 month	No description
adzuna_epoch	1 year	No description
adzuna_in_your_inbox	1 month	No description
adzuna_session_ads	1 hour 30 minutes	No description
alr	30 minutes	No description
AnalyticsSyncHistory	1 month	No description
aqcamplast	session	No description available.
asst	30 minutes	No description available.
cass	2 hours	No description available.
cebs	session	No description
cebsp	session	No description
cookietest	session	No description
dcid2	2 years	No description
gdId	10 years	No description
gdsid	6 hours	No description
GSESSIONID	2 hours	No description available.
li_gc	2 years	No description
SameSite	past	No description available.
session	session	No description available.
trs	1 year	No description available.

The Granularity Of A Microservice

Are you looking for a bespoke software solution for your business?

Refactoring: When and Why Should You Do It

Learning From The Past