Featured Post

14 Top Data Pipeline Key Terms Explained

Image
 Here are some key terms commonly used in data pipelines 1. Data Sources Definition: Points where data originates (e.g., databases, APIs, files, IoT devices). Examples: Relational databases (PostgreSQL, MySQL), APIs, cloud storage (S3), streaming data (Kafka), and on-premise systems. 2. Data Ingestion Definition: The process of importing or collecting raw data from various sources into a system for processing or storage. Methods: Batch ingestion, real-time/streaming ingestion. 3. Data Transformation Definition: Modifying, cleaning, or enriching data to make it usable for analysis or storage. Examples: Data cleaning (removing duplicates, fixing missing values). Data enrichment (joining with other data sources). ETL (Extract, Transform, Load). ELT (Extract, Load, Transform). 4. Data Storage Definition: Locations where data is stored after ingestion and transformation. Types: Data Lakes: Store raw, unstructured, or semi-structured data (e.g., S3, Azure Data Lake). Data Warehous...

Case Study On Cloud Computing

The concept of cloud computing is in use for many years, but in recent years has it become a highlighted and come in picture. In the year 1990s, cloud computing was developed by major IT providers such as Sun, Microsoft, Google, and Amazon.

Cloud computing
Cloud computing
Different products came into use for different levels of users. The most popular services for end users include web-based email systems (SaaS), e.g. AOL, Gmail, Hotmail, and Yahoo! Mail, and office applications such as Google Docs, Microsoft MS Office Online, Cloud-canvas.com, and Write.fm, etc.

Developers can run their programs on the cloud (PaaS) like Google AppEngine, Windows Azure, and Force.com. Companies or organizations store or backup their large data on remote servers (IaaS), for example, Rackspace, Microsoft Azure, Animoto, Jungle Disk and Amazon's EC2 or S3 servers.

In 2011, the Primary Research Group (PRG) published a report of its recently conducted survey on library use of cloud computing (Primary Research Group, 2011). Participants included 70 libraries worldwide with the majority from the United States. The survey report reveals that 61.97 percent of libraries in the sample used free SaaS while 22.54 percent of libraries sampled used paid subscription SaaS; less than 3 percent of libraries surveyed used PaaS, and 4.23 percent used IaaS.

Most libraries using PaaS or IaaS had annual budgets over $5,000,000. Smaller libraries usually used the servers of their parent organizations, while libraries with multi-million dollar budgets tended to use their own servers.

  • Few case studies also show the exploration and adoption of other models of cloud computing in libraries. For example, California State University libraries have migrated their key library systems to vendors' cloud-based servers (i.e. a public cloud) and to campus IT's internally virtualized environment (i.e. a private cloud) (Wang, 2012). 
  • The Burritt Library at Central Connecticut State University used Amazon's S3 to back up their high-resolution digital objects (Iglesias, 2011). Murray State University library experimented with Dropbox for library services (Bagley, 2011). University of Arizona Libraries have migrated their ILS, Digital Libraries website, Interlibrary Loan system and repository software to cloud-based services (Han, 2010). 
  • The Z. Smith Reynolds Library in Winston Salem, NC, started to use Amazon's EC2 for hosting its website, discovery services, and digital library services in 2009 (Mitchell, 2010b).

The service model represent server administration and maintenance responsibilities are moved from local personnel to the hosting vendor, while the management of the application remains in the traditional way, i.e. librarians are still able to access the backend of the system for local customizations as if they were managing the system locally.

Comments

Popular posts from this blog

SQL Query: 3 Methods for Calculating Cumulative SUM

Big Data: Top Cloud Computing Interview Questions (1 of 4)

How to Fix datetime Import Error in Python Quickly