Pro Python System Administration 2nd Edition
Download
Introduction
The role of the system administrator has grown dramatically over the years. The number of systems supported
by a single engineer has also increased. As such, it is impractical to handcraft each installation, and there is a
need to automate as many tasks as possible. The structure of systems varies from organization to organization,
therefore system administrators must be able to create their own management tools. Historically, the most popular
programming languages for these tasks were UNIX shell and Perl. They served their purposes well, and I doubt they
will ever cease to exist. However, the complexity of current systems requires new tools, and the Python programming
language is one of them.
Python is an object-oriented programming language suitable for developing large-scale applications. Its syntax
and structure make it very easy to read—so much so that the language is sometimes referred to as “executable
pseudocode.” The Python interpreter allows for interactive execution, so in some situations an administrator can use
it instead of a standard UNIX shell. Although Python is primarily an object-oriented language, it is easily adopted
for procedural and functional styles of programming. Given all that, Python makes a perfect fit as a new language for
implementing system administration applications. There are a large number of Linux system utilities already written
in Python, such as the Yum package manager and Anaconda, the Linux installation program.
The Structure of this Book
This book contains 14 chapters, and each chapter solves a distinctive problem. Some examples span multiple
chapters, but even then, each chapter deals with a specific aspect of the particular problem.
In addition to the chapters, several other organizational layers characterize this book. First, I grouped the
chapters by the problem type. Chapters 1 to 4 deal with network management issues; Chapters 5 to 7 talk about the
Apache web server and web application management; Chapters 8 to 11 are dedicated to monitoring and statistical
calculations; and Chapters 12 and 13 focus on database management issues.
Second, I maintain a common pattern in all chapters. I start with the problem statement and then move on to
gather requirements and proceed through the design phase before moving into the implementation section.
Third, each chapter focuses on one or more technologies and the Python libraries that provide the language
interface for the particular technology. Examples of such technologies could be the SOAP protocol, application
plug-in architecture, or cloud computing concepts.
More specifically, here’s a breakdown of the chapters:
Chapter 1: Reading and Collecting Performance Data Using SNMP
Most network-attached devices expose the internal counters via the Simple Network Management Protocol (SNMP). This chapter explains basic SNMP principles and the data structure. We then look at the Python libraries that provide
the interface to SNMP–enabled devices. We also investigate the round robin database, which is the de facto standard
for storing the statistical data. Finally, we look at the Jinja2 template framework, which allows us to generate simple
web pages.
Chapter 2: Managing Devices Using the SOAP API
Complicated tasks, such as managing the device configuration, cannot be easily done by using SNMP because
the protocol is too simplistic. Therefore, advanced devices, such as the Citrix Netscaler load balancers, provide the
SOAP API interface to the device management system. In this chapter, we investigate the SOAP API structure and the
libraries that enable the SOAP–based communication from the Python programming language. We also look at the
basic logging functionality using the built-in libraries. This second edition of the book includes examples of how to
use the new REST API to manage the load balancer devices.
Chapter 3: Creating a Web Application for IP Address Accountancy
In this chapter, we build a web application that maintains the list of the assigned IP addresses and the address
ranges. We learn how to create web applications using the Django framework. I show you the way the Django
application should be structured, tell how to create and configure the application settings, and explain the URL
structure. We also investigate how to deploy the Django application using the Apache web server.
Chapter 4: Integrating the IP Address Application with DHCP
this chapter expands on the previous chapter, and we implement the DHCP address range support. We also look
at some advanced Django programming techniques, such as customizing the response MIME type and serving AJAX
calls. This second edition adds new functionality to manage dynamic DHCP leases using OMAPI protocol.
Chapter 5: Maintaining a List of Virtual Hosts in an Apache Configuration File
this is another Django application that we develop in this book, but this time our focus is on the Django
administration interface. While building the Apache configuration management application, you learn how to
customize the default Django administration interface with your own views and functions.
Chapter 6: Gathering and Presenting Statistical Data from Apache Log Files
In this chapter, the goal is to build an application that parses and analyses the Apache web server log files.
Instead of taking the straightforward but in exible approach of building a monolithic application, we look at the
design principles involved in building plug-in applications. You learn how to use the object and class type discovery
functions and how to perform a dynamic module loading. This second edition of the book shows you how to perform
data visualization based on the gathered data.
Chapter 7: Performing Complex Searches and Reporting on Application Log Files
this chapter also deals with the log file parsing, but this time I show you how to parse complex, multi-line log file entries.
Chapter 8: A Web Site Availability Check Script for Nagios
Nagios
is one of the most popular open-source monitoring systems, because its modular structure allows users
to implement their own check scripts and thus customize the tool to meet their needs. In this chapter, we create two
scripts that check the functionality of a website. We investigate how to use the Beautiful Soup HTML parsing library to
extract the information from the HTML web pages.
Chapter 9: Management and Monitoring Subsystem
this chapter starts a three-chapter series in which we build a complete monitoring system. THe goal of this
chapter is not to replace mature monitoring systems such as Nagios or Zenoss but to show the basic principles of the
distributed application programming. We look at database design principles such as data normalization. We also
investigate how to implement the communication mechanisms between network services using the RPC calls.
Chapter 10: Remote Monitoring Agents
This is the second chapter in the monitoring series, where we implement the remote monitoring agent
components. In this chapter, I also describe how to decouple the application from its configuration using the
ConfigParser module.
Chapter 11: Statistics Gathering and Reporting
this is the last part of the monitoring series, where I show you how to perform basic statistical analysis on the
collected performance data. We use scientific libraries: NumPy to perform the calculations and matplotlib to create
the graphs. You learn how to find which performance readings fall into the comfort zone and how to calculate the
boundaries of that zone. We also do the basic trend detection, which provides a good insight for the capacity planning.
Chapter 12: Distributed Message Processing System
this is a new chapter for the second edition of the book. In this chapter I show you how to convert the distributed
management system to use Celery, a remote task execution framework.
Chapter 13: Automatic MySQL Database Performance Tuning
In this chapter, I show you how to obtain the MySQL database configuration variables and the internal status
indicators. We build an application that makes a suggestion on how to improve the database engine performance
based on the obtained data.
Chapter 14: Amazon EC2/S3 as a Data Warehouse Solution
this chapter shows you how to utilize the Amazon Elastic Compute Cloud (EC2) and o oad the infrequent
computation tasks to it. We build an application that automatically creates a database server where you can transfer
data for further analysis. You can use this example as a basis to build an on-demand data warehouse solution.
Home Web Development Pro Python System Administration 2nd Edition