Continuous Integration using Docker

Sat 19 March 2016
Original article: Continuous Integration Platform Using Docker Containers: Jenkins, SonarQube, Nexus, GitLab Continuous Integration is a software development practice where members of a team integrate their work frequently, usually each person integrates at least daily - leading to multiple integrations per day. Each integration is verified by an automated build (including ...

Practices of Continuous Integration

Sat 19 March 2016
Original article: Continuous Integration Platform Using Docker Containers: Jenkins, SonarQube, Nexus, GitLab Maintain a Single Source Repository Everything you need to build your source code and run your software should be kept in a version control system (VCS). Automate the Build A central impartial instance that takes care of compiling ...

Docker aliases/functions

Mon 14 March 2016
The content of .bashrc file: # ---------------------------------------------------------------------- # Docker aliases/functions # d-help() { declare -F | grep d- | cut -d " " -f 3; } # remove exited containers d-rm-ec() { EXITED_CONTAINERS=$(docker ps -a | grep Exited | cut -d " " -f 1;); if [ -z $EXITED_CONTAINERS ]; then echo "No exited containers"; else docker rm $EXITED_CONTAINERS; fi } # remove none images d-rm-ni() { NONE_IMAGES=$(docker ...

Object Storages

Thu 10 March 2016
An object is defined as data (typically a file) along with all its metadata, all bundled up as an object. This object is given an ID that is typically calculated from the content of that object (both file and metadata) itself. An object is always retrieved by an application by ...

Apache HBase Intoduction

Thu 10 March 2016
Original article: https://learnhbase.wordpress.com/ Use Apache HBase when you need random, realtime read/write access to your Big Data. This project’s goal is the hosting of very large tables — billions of rows X millions of columns — atop clusters of commodity hardware. Apache HBase is an open-source, distributed ...

Oozie Datasets

Thu 10 March 2016
Datasets Lecture notes, Apache Oozie Essentials A Dataset is a collection of data, which is identified by some logical name. For example, the press release can be defined as follows: {nameNode}/learn_oozie/ch04/input/pressrelease/${YEAR ...

AVRO-1124: RESTful service for holding schemas (re-post)

Mon 16 November 2015
Original post: AVRO-1124 RESTful service for holding schemas Motivation It is nice to be able to pass around data in serialized form but still know the exact schema that was used to serialize it. The overhead of storing the schema with each record is too high unless the individual records ...

Java samples

Sun 08 November 2015
brettryan/io-recurse-tests Create some benchmarking tests for variations of recursively listing all I/O files.

Java libs

Sat 07 November 2015
Command line interface args4j is a small Java class library that makes it easy to parse command line options/arguments in your CUI application.

Twelve Principles of Agile Software

Sat 07 November 2015
Principles behind the Agile Manifesto We follow these principles: Our highest priority is to satisfy the customer through early and continuous delivery of valuable software. Welcome changing requirements, even late in development. Agile processes harness change for the customer's competitive advantage. Deliver working software frequently, from a couple of ...