{"id":2667,"date":"2015-12-21T06:27:49","date_gmt":"2015-12-20T23:27:49","guid":{"rendered":"http:\/\/dasaptaerwin.net\/wp\/?p=2667"},"modified":"2015-12-21T06:27:49","modified_gmt":"2015-12-20T23:27:49","slug":"mengklasifikasi-mata-air-dengan-r","status":"publish","type":"post","link":"http:\/\/dasaptaerwin.net\/wp\/2015\/12\/mengklasifikasi-mata-air-dengan-r.html","title":{"rendered":"Mengklasifikasi mata air dengan R"},"content":{"rendered":"<p>Project description<\/p>\n<div class=\"container-fluid main-container\">\n<pre><code># Title: PCA of Cisanti Area\r\n# Data: PKM Project in Cisanti Area, Bandung\r\n# Area: Northern Bandung\r\n# Team leader: Arif Susanto\r\n# Code and analysis: Dasapta Erwin Irawan\r\n# Data acq: Aditya Pratama, ..., ... (to be added)\r\n# Software: R\r\n# Package used: pcamethods, cluster, readxl\r\n# Keyword: multivariate statistics, cluster analysis, principal component analysis<\/code><\/pre>\n<p>Dalam blog post ini saya mencoba menceritakan secara singkat teknik mengklasifikasi mata air berdasarkan data kualitas airnya. Software R akan digunakan dalam analisis ini, dengan teknik:<\/p>\n<ol style=\"list-style-type: decimal;\">\n<li>Principal component analysis (PCA)<\/li>\n<li>Cluster analysis (CA)<\/li>\n<\/ol>\n<p>Data set: data set ini berasal dari riset PKM tahun 2015 yang diketuai oleh <a href=\"https:\/\/www.researchgate.net\/profile\/Arif_Susanto3\">Arif Susanto<\/a> dari KK Geologi ITB. Data set kita berukuran 7 x 33 (7 baris dan 33 kolom).<\/p>\n<p>Package yang diperlukan:<\/p>\n<p>Sebenarnya fungsi standar telah ada dalam R, yaitu:<\/p>\n<ol style=\"list-style-type: decimal;\">\n<li>PCA: <code>princomp()<\/code> atau <code>prcomp()<\/code>, gunanya untuk mengekstrak variabel (component) berpengaruh dalam suatu data set dengan jumlah variabel yang sangat banyak. Fungsi ini akan mengelompokkan variabel menjadi lebih ringkas, misal: bila semua kita punya 33 variabel, maka nantinya akan dapat menjadi dua atau tiga kelompok variabel yang disebut PC (principal component)<\/li>\n<li>Cluster: <code>kmeans()<\/code> dan <code>hclust()<\/code>, gunanya untuk menguji kemiripan sampel berdasarkan perhitungan <a href=\"https:\/\/en.wikipedia.org\/wiki\/Euclidean_distance\">Euclidean distance<\/a> dan mengelompokkannya dalam sebuah <a href=\"https:\/\/en.wikipedia.org\/wiki\/Dendrogram\">dendogram<\/a>.<\/li>\n<\/ol>\n<p>Namun demikian dalam kesempatan ini saya akan menggunakan package:<\/p>\n<ol style=\"list-style-type: decimal;\">\n<li><code>pcamethods<\/code> yang ditulis oleh Wolfram Stacklies, Henning Redestig, dan Kevin Wright. <a href=\"http:\/\/www.bioconductor.org\/packages\/\/2.10\/bioc\/html\/pcaMethods.html\">link<\/a><\/li>\n<li><code>cluster<\/code> yang ditulis oleh Friedrich Leisch dan Bettina Gruen <a href=\"http:\/\/cran.cnr.berkeley.edu\/web\/views\/Cluster.html\">link<\/a><\/li>\n<\/ol>\n<p>Tahapannya akan saya jelaskan lebih rinci besok ya per blok <a href=\"https:\/\/goo.gl\/Vw8FwS\">kode<\/a>. Data set juga akan segera tersedia setelah publikasi diterbitkan. Sekarang saya tampilkan saja tiga grafik sebagai hasil utamanya.<\/p>\n<p>Terimakasih sudah berkunjung.<\/p>\n<p>follow <span class=\"citation\">@dasaptaerwin<\/span> (www.twitter.com\/dasaptaerwin)<\/p>\n<\/div>\n<p><script>\/\/ <![CDATA[\n\/\/ add bootstrap table styles to pandoc tables\n$(document).ready(function () {\n  $('tr.header').parent('thead').parent('table').addClass('table table-condensed');\n});\n\/\/ ]]><\/script><\/p>\n<p><!-- dynamically load mathjax for compatibility with self-contained --><br \/>\n<script>\/\/ <![CDATA[\n  (function () {\n    var script = document.createElement(\"script\");\n    script.type = \"text\/javascript\";\n    script.src  = \"https:\/\/cdn.mathjax.org\/mathjax\/latest\/MathJax.js?config=TeX-AMS-MML_HTMLorMML\";\n    document.getElementsByTagName(\"head\")[0].appendChild(script);\n  })();\n\/\/ ]]><\/script><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Project description # Title: PCA of Cisanti Area # Data: PKM Project in Cisanti Area, Bandung # Area: Northern Bandung # Team leader: Arif Susanto&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3,46,31],"tags":[],"class_list":["post-2667","post","type-post","status-publish","format-standard","hentry","category-data-analysis-writing","category-multivariate-analysis","category-research-and-teaching"],"_links":{"self":[{"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/posts\/2667","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/comments?post=2667"}],"version-history":[{"count":1,"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/posts\/2667\/revisions"}],"predecessor-version":[{"id":2671,"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/posts\/2667\/revisions\/2671"}],"wp:attachment":[{"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/media?parent=2667"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/categories?post=2667"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/dasaptaerwin.net\/wp\/wp-json\/wp\/v2\/tags?post=2667"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}