Skip to content

Latest commit

 

History

History
70 lines (56 loc) · 2.74 KB

BigDataandCloud.md

File metadata and controls

70 lines (56 loc) · 2.74 KB

Big Data and Cloud

  1. aws-cli
    Universal Command Line Interface for Amazon Web Services
    Project Source: https://github.com/aws/aws-cli

  2. boto
    Boto is a Python package that provides interfaces to Amazon Web Services.
    Project Source: https://github.com/boto/boto
    Project Documentation: http://docs.pythonboto.org/en/latest/

  3. dpark
    DPark is a Python clone of Spark, MapReduce alike computing framework supporting iterative computation.
    Project Source: https://github.com/douban/dpark
    Project Documentation (Chinese): https://github.com/jackfengji/test_pro/wiki

  4. dumbo
    Python module that allows one to easily write and run Hadoop programs.
    Project Source: https://github.com/klbostee/dumbo
    Project Homepage: http://klbostee.github.io/dumbo/

  5. Fabulous
    Deploy django apps to Amazon EC2 with ONE command.
    Project Source: https://github.com/gcollazo/Fabulous
    Project Homepage: http://elweb.co/

  6. falcon
    Falcon is a high-performance Python framework for building cloud APIs and web app backends.
    Project Source: https://github.com/racker/falcon
    Project Homepage: http://falconframework.org/

  7. gae-boilerplate
    Google App Engine Boilerplate gets your project off the ground quickly using the Google App Engine platform.
    Project Source: https://github.com/coto/gae-boilerplate
    Project Homepage: http://appengine.beecoss.com/

  8. goagent
    a gae proxy
    Project Source: https://github.com/goagent/goagent
    Project Homepage: https://code.google.com/p/goagent/

  9. Hue
    Hue is an open source Web UI for doing big data with Hadoop.
    Project Source: https://github.com/cloudera/hue
    Project Homepage: http://gethue.com/

  10. libcloud
    Apache Libcloud is a Python library which hides differences between different cloud provider APIs and allows you to manage different cloud resources through a unified and easy to use API.
    Project Source: https://github.com/apache/libcloud
    Project Documentation: http://libcloud.apache.org/

  11. mrjob
    Run MapReduce jobs on Hadoop or Amazon Web Services.
    Project Source: https://github.com/Yelp/mrjob
    Project Documentation: https://pythonhosted.org/mrjob/

  12. nova
    OpenStack Nova provides a cloud computing fabric controller, supporting a wide variety of virtualization technologies, including KVM, Xen, LXC, VMware, and more.
    Project Source: https://github.com/openstack/nova
    Project Homepage: http://www.openstack.org/

  13. pyes
    pyes is a pythonic way to use ElasticSearch since 2010.
    Project Source: https://github.com/aparo/pyes
    Project Documentation: https://pypi.python.org/pypi/pyes/

  14. s3cmd
    Command line tool for managing Amazon S3 and CloudFront services.
    Project Source: https://github.com/s3tools/s3cmd
    Project Homepage: http://s3tools.org/s3cmd