Continuous Integration using Docker
Sat 19 March 2016Original article: Continuous Integration Platform Using Docker Containers: Jenkins, SonarQube, Nexus, GitLab Continuous Integration is a software development practice where members of a team integrate their work frequently, usually each person integrates at least daily - leading to multiple integrations per day. Each integration is verified by an automated build (including ...
Practices of Continuous Integration
Sat 19 March 2016Original article: Continuous Integration Platform Using Docker Containers: Jenkins, SonarQube, Nexus, GitLab Maintain a Single Source Repository Everything you need to build your source code and run your software should be kept in a version control system (VCS). Automate the Build A central impartial instance that takes care of compiling ...
Docker aliases/functions
Mon 14 March 2016The content of .bashrc file: # ---------------------------------------------------------------------- # Docker aliases/functions # d-help() { declare -F | grep d- | cut -d " " -f 3; } # remove exited containers d-rm-ec() { EXITED_CONTAINERS=$(docker ps -a | grep Exited | cut -d " " -f 1;); if [ -z $EXITED_CONTAINERS ]; then echo "No exited containers"; else docker rm $EXITED_CONTAINERS; fi } # remove none images d-rm-ni() { NONE_IMAGES=$(docker ...
Object Storages
Thu 10 March 2016An object is defined as data (typically a file) along with all its metadata, all bundled up as an object. This object is given an ID that is typically calculated from the content of that object (both file and metadata) itself. An object is always retrieved by an application by ...
Apache HBase Intoduction
Thu 10 March 2016Original article: https://learnhbase.wordpress.com/ Use Apache HBase when you need random, realtime read/write access to your Big Data. This project’s goal is the hosting of very large tables — billions of rows X millions of columns — atop clusters of commodity hardware. Apache HBase is an open-source, distributed ...
Oozie Datasets
Thu 10 March 2016Datasets Lecture notes, Apache Oozie Essentials A Dataset is a collection of data, which is identified by some logical name. For example, the press release can be defined as follows:
AVRO-1124: RESTful service for holding schemas (re-post)
Mon 16 November 2015Original post: AVRO-1124 RESTful service for holding schemas Motivation It is nice to be able to pass around data in serialized form but still know the exact schema that was used to serialize it. The overhead of storing the schema with each record is too high unless the individual records ...
Java samples
Sun 08 November 2015brettryan/io-recurse-tests Create some benchmarking tests for variations of recursively listing all I/O files.
Java libs
Sat 07 November 2015Command line interface args4j is a small Java class library that makes it easy to parse command line options/arguments in your CUI application.
Twelve Principles of Agile Software
Sat 07 November 2015Principles behind the Agile Manifesto We follow these principles: Our highest priority is to satisfy the customer through early and continuous delivery of valuable software. Welcome changing requirements, even late in development. Agile processes harness change for the customer's competitive advantage. Deliver working software frequently, from a couple of ...