Building Zeppelin in windows 8 Pre - Requirements java 1.7 maven 3.2.x or 3.3.x nodejs cywin Here is my version in windows8 (64 bit) Incubator-zeppelin is build success. Few issues you can face with windows ERROR 01 [ERROR] Failed to execute goal com.github.eirslett:frontend-maven-plugin:0.0.23:bower (bower install) on project zeppelin-web: Failed to run task: 'bower --allow-root install' failed. (error code 1) -> [Help 1] you can find ...

Zeppelin Note for load data and Analyzing Previous post is introduction for zeppelin notebook. Here we will more more detail view where it will used for researcher. Using shell interpreter we can download / retrieve data sets / files from remote server or internet. Then using Scala in Spark to make class from that data and then used SQL to play with the data. You can analysis ...

Zeppelin NoteBook Here is my previous post to build zeppelin from source. This post will take you a tour on “notebook feature of zeppelin”. NoteBook contain with note. Note will have paragraphs. 1. Start you zeppelin by entering /incubator-zeppelin $ ./bin/zeppelin-daemon.sh start 2. Go to localhost:8080 and click on ‘NoteBook’ in top menu. Then click on ‘Create new note’. Now you will ...

Data Binding in Angular Data binding is the process that establishes a connection between the application UI (User Interface) and model/business logic. In JavaScript world we used 'Backbone.js', 'KnockoutJS', 'BindingJS' and 'AngularJS'. This post will go through over the data binding in Angular. Traditional Data Binding System Most web frameworks focus on one-way data binding and classical template systems are only one direction. they ...

AngularJS, is an open-source web application framework maintained by Google and a community of individual developers. It address many of the challenges encountered in developing single-page applications. Angular [1] is built around the belief that declarative programming should be used for building user interfaces and connecting software components, while imperative programming is better suited to defining an application's business logic. ...

React.JS and Virtual DOM React is a open source UI library developed at Facebook to facilitate the creation of interactive and reusable UI components. It is not only does it perform on the client side, but it can also be rendered server side, and they can work together inter-operably. React has pluggable back-ends so it can be used to target the DOM, HTML, canvas, ...

Density-based clustering algorithm (DBSAN) and Implementation Density-based spatial clustering of applications with noise (DBSCAN)[1] is a density-based clustering algorithm. It gives a set of points in some space, it groups together points that are closely packed together (points with many nearby neighbors), marking as outliers points that lie alone in low-density regions. In 2014, the algorithm was awarded the test of time award at the leading ...

scikit-learn to generate isotropic Gaussian blobs Scikit-learn is an open source machine learning library for the Python programming language. It features various classification, regression and clustering algorithms ,support vector machines, logistic regression, naive Bayes, random forests, gradient boosting, k-means DBSCAN, Decision Trees, Gaussian Process for ML, Manifold learning, Gaussian Mixture Models, Model Selection, Nearest Neighbors, Semi Supervised Classification, Feature Selection etc. I was working on them ...

CouchDB 2.0 (Developer Preview) with HTTP APIs Introduction The Apache CouchDB project had announced a Developer Preview release of its CouchDB 2.0. The Developer Preview 2.0 brings all-new clustering technology to the Open Source NoSQL database, enabling a range of big data capabilities that include being able to store, replicate, sync, and process large amounts of data distributed across individual servers, data centers, and geographical regions in ...

CouchDB-fauxton introduction Here we will be using CouchDB (developer-preview 2.0). You can build couchDB with preview guide line in here. https://couchdb.apache.org/developer-preview/2.0/ After built is success, you can start couchDB from 'dev/run' Above command starts a three node cluster on the ports 15984, 25984 and 35984. Front ports are 15986, 25986 and 35986. Using front port to check the nodes by http://localhost:15986/nodes Then ...

Building Apache Zeppelin Apache Zeppelin (incubator) is a collaborative data analytics and visualization tool for Apache Spark, Apache Flink. It is web-based tool for the data scientists to collaborate over large-scale data exploration. Zeppelin is independent of the execution framework. Zeppelin integrated full support of Apache Spark so I will try sample with spark in it's self. Zeppelin interpreter allows any language/data-processing-backend to ...

CouchDB with Fauxton in windows 8 This post mainly about installing and running ‘Fauxton’ in windows environment. Fauxton is the new Web UI for CouchDB. For this post I will be using windows 8 (64bit). Prerequisite for Fauxton 1. nodejs (Download from here) 2. npm (now npm comes with node) 3. CouchDB (Installation from binaries or sources. I will have post on installing couch DB from ...

Installing Flask in Windows8 Flask is a lightweight web application framework written in Python. Flask depends on two external libraries, Werkzeug and Jinja2. Werkzeug is a toolkit for Web Server Gateway Interface (WSGI) Jinja2 renders templates Flask is called as microframework because it keeps the core simple. It has no database abstraction layer, supports extensions (object-relational mappers, form validation, open authentication technologies). It have ...

Basic Functionality of Series or DataFrame in Pandas Throughout this post I will take you over the fundamental mechanics of interacting with the data contained in a Series or DataFrame in pandas(python). Reindexing Reindexing is a critical method on pandas objects. 'Reindexing' means to create a new object with the data conformed to a new index. Here is my object that I will be using for this post ...

Pandas for Data Manipulation and Analysis Pandas is a software library written for the Python programming language for data manipulation and analysis. In many organizations, it is common to research, prototype, and test new ideas usinga more domain-specific computing language like MATLAB or R then later port those ideas to be part of a larger production system written in, say, Java, C#, or C++. Whatpeople ...

Python For Beginners Python is an interpreted dynamically typed Language with very straightforward syntax. Python comes in two basic versions one in 2.x and 3.x and Python 2 and 3 are quite different. This post is bais for Python 2.x In Python 2, the "print" is keyword. In Python 3 "print" is a function, and must be invoked with parentheses. There are no ...

Python with CSV CSV (Comma Separated Values) format is the most common format in computer world (export and import). In python 'csv module' implements classes to read and write tabular data in CSV format without knowing the precise details of the CSV format used by Excel. Here it is reading csv file and filtering data in it. Create new CSV file and moved ...

Maven 3.3.x for Mint 1.Open the terminal and download the 'apache-maven-3.3.1-bin.zip'wget http://mirrors.sonic.net/apache/maven/maven-3/3.3.1/binaries/apache-maven-3.3.1-bin.zip 2. Unpack the binary distributionunzip apache-maven-3.3.1-bin.zip 3. Move the apache maven directory to /usr/localsudo cp -R apache-maven-3.3.1 /usr/local/ 4. Adding PATH and MAVEN_HOMEgedit .bashrc OR vi .bashrc Then add two of them as belowexport PATH="/usr/local/apache-maven-3.3.1/bin:/opt/java/jdk1.8.0_40/bin:$PATH"export MAVEN_HOME="/usr/local/apa

Predictive modeling is the process by which a model is created or chosen to try to best predict the probability of an outcome. Most often it wants to predict is in the future or unknown event. The model is chosen on the basis of detection theory. Models can use one or more classifiers. Models There are three predictive models Parametric ...

Gaussian function Gaussian function is a function of the below form a, b, c and d are arbitrary real constants. The graph of a Gaussian is a characteristic symmetric "bell curve" shape that quickly falls off. The parameters a is the height of the curve's peak b is the position of the center of the peak c (the standard deviation, sometimes called ...

Previous Page