http://databricks.com/spark-training-resources
Spark Workshops https://www.sics.se/~amir/ ,搜索 sparkhttp://databricks.gitbooks.io/databricks-spark-reference-applications/content/https://github.com/apache/spark/tree/master/examples/src/main/pythonlitaotao 的一系列 spark 文章 (python+spark)http://litaotao.github.io/spark-dataframe-introductionlitaotao 也整理了一系列spark的学习资源http://litaotao.github.io/spark-resouces-blogs-paperSpark修炼之道(进阶篇)--Spark入门到精通http://blog.csdn.net/lovehuangjiaju/article/details/48580863spark 编程向导--书籍http://endymecy.gitbooks.io/spark-programming-guide-zh-cn/content/deploying/running-spark-on-yarn.htmlhttp://taoistwar.gitbooks.io/spark-operationand-maintenance-management/content/以WordCount为例, 讲解Spark内核作业调度机制http://www.cnblogs.com/yoyaprogrammer/p/dive_into_wordcount_1.htmlSpark(四) -- Spark工作机制 http://blog.csdn.net/qq1010885678/article/details/45728173====================部署专题====================CDH5 集群中 Spark 集群模式的安装过程配置过程http://blog.javachen.com/2014/07/01/spark-install-and-usage.htmlSpinning up an Apache Spark Cluster: Step-by-Step http://blog.insightdatalabs.com/spark-cluster-step-by-step/ Hadoop+Spark+Hbase部署整合篇 http://blog.csdn.net/qq1010885678/article/details/46673079Spark On Yarn & Spark as a Service & Spark On Tachyon http://blog.csdn.net/qq1010885678/article/details/46242143Windows平台下安装Hadoophttp://www.cnblogs.com/kinglau/p/3270160.htmlIntroduction to Spark for .NET Developershttps://msdn.microsoft.com/en-us/magazine/mt595756.aspx Spark 部署https://docs.qingcloud.com/guide/spark.htmlSpark集群安装和使用 - JavaChen Blog http://blog.javachen.com/2014/07/01/spark-install-and-usage.html====================调优====================一个实际PySpark项目性能调优http://flykobe.com/index.php/2015/06/01/pyspark-spark-tuning/美团点评的 Spark性能优化指南http://tech.meituan.com/spark-tuning-pro.html====================pyspark 专题====================Spark Python API函数学习:http://www.iteblog.com/archives/1395pyspark文章, https://districtdatalabs.silvrback.com/getting-started-with-spark-in-python, 中文翻译: http://blog.jobbole.com/86232/示例代码库 https://github.com/DistrictDataLabs/spark-workshop/ ,非常棒!Using Jupyter on Apache Spark: Step-by-Step with a Terabyte of Reddit Datahttp://blog.insightdatalabs.com/jupyter-on-apache-spark-step-by-step/How To Write Spark Applications in Pythonhttp://blog.appliedinformaticsinc.com/how-to-write-spark-applications-in-python/spark streaming+kafka, 另外还有spark-submit如何传入依赖的python package和jar包. http://rustyrazorblade.com/2015/05/spark-streaming-with-python-and-kafka/http://www.csdn.net/article/2014-01-28/2818282-Spark-Streaming-big-data【Spark1.3官方翻译】 Spark Submit提交应用程序http://blog.csdn.net/mycafe_/article/details/44923265 Spark 入门(Python、Scala 版)http://my.oschina.net/leejun2005/blog/411605Spark编程指南--Python版http://www.csdn.net/article/2015-04-24/2824552pyspark与spark的集成方式http://flykobe.com/index.php/2015/04/18/pyspark-and-spark/开发模式下, 如何方便解决jar的依赖, 或者直接将jar加到SPARK_CLASSPATH中, 参见compute-classpath.shhttp://zhangyi.farbox.com/post/wen-ti-jie-jue/solve-spark-issue-of-all-masters-are-unresponsivehttp://blog.csdn.net/qq1010885678/article/details/46052055====================SQL 专题====================平易近人、兼容并蓄--Spark SQL 1.3.0概览http://www.csdn.net/article/2015-04-03/2824407Spark ETL Techniques (包括python/scala的优劣对比)http://www.slideshare.net/DonDrake/presentations基于spark1.3.1的spark-sql实战-01 http://blog.csdn.net/stark_summer/article/details/45825177Spark SQL 1.3测试http://www.cnblogs.com/kxdblog/p/4488991.htmlSpark SQL 之 Data Sourceshttp://www.cnblogs.com/BYRans/p/5005342.html有几个spark 和RDMS交互的文章http://www.sparkexpert.com/category/etl/Spark-1.3.1与Hive整合实现查询分析http://shiyanjun.cn/archives/1113.html瞌睡中的葡萄虎的cnblogs, 包含很多Spark SQL文章http://www.cnblogs.com/luogankun/Spark RDD写入RMDB(Mysql)方法二http://www.iteblog.com/archives/1290Spark读取MySQL的方法http://www.iteblog.com/archives/1275Spark SQL整合PostgreSQLhttp://www.iteblog.com/archives/1369Spark-1.3.1与Hive整合实现查询分析http://shiyanjun.cn/archives/1113.htmlspark sql 访问postgresqlhttp://zhangyi.farbox.com/post/access-postgresql-based-on-spark-sql?utm_source=tuicool====================ML 专题====================一号店的段石石同学的Machine Learning With Spark的几个notebookhttp://hacker.duanshishi.com/?p=1282几篇spark机器学习的文章http://blog.selfup.cn/tag/sparkhttps://www.codementor.io/spark/tutorial/building-a-recommender-with-apache-spark-python-example-app-part1end-to-end tutorial for a recommendation engine using PySparkhttp://tech.marksblogg.com/recommendation-engine-spark-python.html====================Hadoop 相关====================Hadoop学习笔记-20.网站日志分析项目案例(一)项目介绍http://www.cnblogs.com/edisonchou/p/4449082.htmlHadoop学习笔记-2.不怕故障的海量存储:HDFS基础入门 http://www.cnblogs.com/edisonchou/p/3538524.htmlhadoop 2.0 详细配置教程http://www.cnblogs.com/scotoma/archive/2012/09/18/2689902.htmlhdfs指令:http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-common/FileSystemShell.htmlhadoop2.6.0】安装+例子运行http://www.cnblogs.com/dplearning/p/4145209.html