Python – CGI Programming

Course Curriculum

Python – CGI Programming

The Common Gateway Interface, or CGI, is a set of standards that define how information is exchanged between the web server and a custom script. The CGI specs are currently maintained by the NCSA.

What is CGI?

  • The Common Gateway Interface, or CGI, is a standard for external gateway programs to interface with information servers such as HTTP servers.
  • The current version is CGI/1.1 and CGI/1.2 is under progress.

    Web Browsing

    To understand the concept of CGI, let us see what happens when we click a hyper link to browse a particular web page or URL.

  • Your browser contacts the HTTP web server and demands for the URL, i.e., filename.
  • Web Server parses the URL and looks for the filename. If it finds that file then sends it back to the browser, otherwise sends an error message indicating that you requested a wrong file.
  • Web browser takes response from web server and displays either the received file or error message.
    However, it is possible to set up the HTTP server so that whenever a file in a certain directory is requested that file is not sent back; instead it is executed as a program, and whatever that program outputs is sent back for your browser to display. This function is called the Common Gateway Interface or CGI and the programs are called CGI scripts. These CGI programs can be a Python Script, PERL Script, Shell Script, C or C++ program, etc.

    CGI Architecture Diagram

    Web Server Support and Configuration

    Before you proceed with CGI Programming, make sure that your Web Server supports CGI and it is configured to handle CGI Programs. All the CGI Programs to be executed by the HTTP server are kept in a pre-configured directory. This directory is called CGI Directory and by convention it is named as /var/www/cgi-bin. By convention, CGI files have extension as. cgi, but you can keep your files with python extension .py as well.
    By default, the Linux server is configured to run only the scripts in the cgi-bin directory in /var/www. If you want to specify any other directory to run your CGI scripts, comment the following lines in the httpd.conf file −

    AllowOverride None
    Options ExecCGI
    Order allow,deny
    Allow from all
    Options All

    Here, we assume that you have Web Server up and running successfully and you are able to run any other CGI program like Perl or Shell, etc.

    First CGI Program

    Here is a simple link, which is linked to a CGI script called hello.py. This file is kept in /var/www/cgi-bin directory and it has following content. Before running your CGI program, make sure you have change mode of file using chmod 755 hello.py UNIX command to make file executable.

    print ("Content-type:text/htmlrnrn")
    print ('')
    print (''
    print ('<title>Hello World - First CGI Program</title>')
    print ('')
    print ('')
    print ('<h2>Hello World! This is my first CGI program</h2>')
    print ('')
    print ('')

    If you click hello.py, then this produces the following output −

    Hello World! This is my first CGI program

    Hello World! This is my first CGI program

    This hello.py script is a simple Python script, which writes its output on STDOUT file, i.e., screen. There is one important and extra feature available which is first line to be printed Content-type:text/htmlrnrn. This line is sent back to the browser and it specifies the content type to be displayed on the browser screen.
    By now you must have understood basic concept of CGI and you can write many complicated CGI programs using Python. This script can interact with any other external system also to exchange information such as RDBMS.

    HTTP Header

    The line Content-type:text/htmlrnrn is part of HTTP header which is sent to the browser to understand the content. All the HTTP header will be in the following form −

    HTTP Field Name: Field Content
    For Example
    Content-type: text/htmlrnrn

    There are few other important HTTP headers, which you will use frequently in your CGI Programming.

    Sr.No. Header & Description
    1 Content-type:

    A MIME string defining the format of the file being returned. Example is Content-type:text/html

    2 Expires: Date

    The date the information becomes invalid. It is used by the browser to decide when a page needs to be refreshed. A valid date string is in the format 01 Jan 1998 12:00:00 GMT.

    3 Location: URL

    The URL that is returned instead of the URL requested. You can use this field to redirect a request to any file.

    4 Last-modified: Date

    The date of last modification of the resource.

    5 Content-length: N

    The length, in bytes, of the data being returned. The browser uses this value to report the estimated download time for a file.

    6 Set-Cookie: String

    Set the cookie passed through the string

    CGI Environment Variables

    All the CGI programs have access to the following environment variables. These variables play an important role while writing any CGI program.

    Sr.No. Variable Name & Description
    1 CONTENT_TYPE

    The data type of the content. Used when the client is sending attached content to the server. For example, file upload.

    2 CONTENT_LENGTH

    The length of the query information. It is available only for POST requests.

    3 HTTP_COOKIE

    Returns the set cookies in the form of key & value pair.

    4 HTTP_USER_AGENT

    The User-Agent request-header field contains information about the user agent originating the request. It is name of the web browser.

    5 PATH_INFO

    The path for the CGI script.

    6 QUERY_STRING

    The URL-encoded information that is sent with GET method request.

    7 REMOTE_ADDR

    The IP address of the remote host making the request. This is useful logging or for authentication.

    8 REMOTE_HOST

    The fully qualified name of the host making the request. If this information is not available, then REMOTE_ADDR can be used to get IR address.

    9 REQUEST_METHOD

    The method used to make the request. The most common methods are GET and POST.

    10 SCRIPT_FILENAME

    The full path to the CGI script.

    11 SCRIPT_NAME

    The name of the CGI script.

    12 SERVER_NAME

    The server's hostname or IP Address

    13 SERVER_SOFTWARE

    The name and version of the software the server is running.

    import os
    print("Content-type: text/htmlrnrn");
    print ("<font size="+1">Environment</font>");
    for param in os.environ.keys():
    print ("<b>%20s</b>: %s" % (param, os.environ[param]))

    GET and POST Methods

    You must have come across many situations when you need to pass some information from your browser to web server and ultimately to your CGI Program. Most frequently, browser uses two methods two pass this information to web server. These methods are GET Method and POST Method.

    Passing Information using GET method

    The GET method sends the encoded user information appended to the page request. The page and the encoded information are separated by the ? character as follows −

    http://www.test.com/cgi-bin/hello.py?key1=value1&key2=value2

    The GET method is the default method to pass information from browser to web server and it produces a long string that appears in your browser's Location:box. Never use GET method if you have password or other sensitive information to pass to the server. The GET method has size limitation: only 1024 characters can be sent in a request string. The GET method sends information using QUERY_STRING header and will be accessible in your CGI Program through QUERY_STRING environment variable.
    You can pass information by simply concatenating key and value pairs along with any URL or you can use HTML tags to pass information using GET method.

    Simple URL Example:Get Method

    Here is a simple URL, which passes two values to hello_get.py program using GET method.
    Below is hello_get.py script to handle input given by web browser. We are going to use cgi module, which makes it very easy to access passed information −

    # Import modules for CGI handling
    import cgi, cgitb
    # Create instance of FieldStorage
    form = cgi.FieldStorage()
    # Get data from fields
    first_name = form.getvalue('first_name')
    last_name  = form.getvalue('last_name')
    print ("Content-type:text/htmlrnrn")
    print ("")
    print ("")
    print ("<title>Hello - Second CGI Program</title>")
    print ("")
    print ()"")
    print ("<h2>Hello %s %s</h2>" % (first_name, last_name))
    print ("")
    print ("")

    This would generate the following result −

    Hello ZARA ALI

    Simple FORM Example:GET Method

    This example passes two values using HTML FORM and submit button. We use same CGI script hello_get.py to handle this input.

    First Name:   <br />
    Last Name:

    Here is the actual output of the above form, you enter First and Last Name and then click submit button to see the result.

    Passing Information Using POST Method

    A generally more reliable method of passing information to a CGI program is the POST method. This packages the information in exactly the same way as GET methods, but instead of sending it as a text string after a ? in the URL it sends it as a separate message. This message comes into the CGI script in the form of the standard input.
    Below is same hello_get.py script which handles GET as well as POST method.

    # Import modules for CGI handling
    import cgi, cgitb
    # Create instance of FieldStorage
    form = cgi.FieldStorage()
    # Get data from fields
    first_name = form.getvalue('first_name')
    last_name  = form.getvalue('last_name')
    print ("Content-type:text/htmlrnrn")
    print ("")
    print ("")
    print ("<title>Hello - Second CGI Program</title>")
    print ("")
    print ("")
    print ("<h2>Hello %s %s</h2>" % (first_name, last_name))
    print ("")
    print ("")

    Let us take again same example as above which passes two values using HTML FORM and submit button. We use same CGI script hello_get.py to handle this input.

    First Name: <br />
    Last Name:

    Here is the actual output of the above form. You enter First and Last Name and then click submit button to see the result.

    Passing Checkbox Data to CGI Program

    Checkboxes are used when more than one option is required to be selected.
    Here is example HTML code for a form with two checkboxes −

    Maths
    Physics

    The result of this code is the following form −
    Below is checkbox.cgi script to handle input given by web browser for checkbox button.

    # Import modules for CGI handling
    import cgi, cgitb
    # Create instance of FieldStorage
    form = cgi.FieldStorage()
    # Get data from fields
    if form.getvalue('maths'):
    math_flag = "ON"
    else:
    math_flag = "OFF"
    if form.getvalue('physics'):
    physics_flag = "ON"
    else:
    physics_flag = "OFF"
    print ("Content-type:text/htmlrnrn")
    print ("")
    print ("")
    print ("<title>Checkbox - Third CGI Program</title>")
    print ("")
    print ("")
    print ("<h2> CheckBox Maths is : %s</h2>" % math_flag)
    print ("<h2> CheckBox Physics is : %s</h2>" % physics_flag)
    print ("")
    print ("")

    Passing Radio Button Data to CGI Program

    Radio Buttons are used when only one option is required to be selected.
    Here is example HTML code for a form with two radio buttons −

    Maths
    Physics

    The result of this code is the following form −
    Below is radiobutton.py script to handle input given by web browser for radio button −

    # Import modules for CGI handling
    import cgi, cgitb
    # Create instance of FieldStorage
    form = cgi.FieldStorage()
    # Get data from fields
    if form.getvalue('subject'):
    subject = form.getvalue('subject')
    else:
    subject = "Not set"
    print("Content-type:text/htmlrnrn")
    print ("")
    print ("")
    print ("<title>Radio - Fourth CGI Program</title>")
    print ("")
    print ("")
    print ("<h2> Selected Subject is %s</h2>" % subject)
    print ("")
    print ("")

    Passing Text Area Data to CGI Program

    TEXTAREA element is used when multiline text has to be passed to the CGI Program.
    Here is example HTML code for a form with a TEXTAREA box −

    <textarea name="textcontent" cols="40" rows="4">
    Type your text here...
    </textarea>

    The result of this code is the following form −
    Below is textarea.cgi script to handle input given by web browser −

    # Import modules for CGI handling
    import cgi, cgitb
    # Create instance of FieldStorage
    form = cgi.FieldStorage()
    # Get data from fields
    if form.getvalue('textcontent'):
    text_content = form.getvalue('textcontent')
    else:
    text_content = "Not entered"
    print ("Content-type:text/htmlrnrn")
    print ("")
    print ("")
    print ("<title>Text Area - Fifth CGI Program</title>")
    print ("")
    print ("")
    print ("<h2> Entered Text Content is %s</h2>" % text_content)
    print ("")

    Passing Drop Down Box Data to CGI Program

    Drop Down Box is used when we have many options available but only one or two will be selected.
    Here is example HTML code for a form with one drop down box −

    Maths
    Physics

    The result of this code is the following form −
    Below is dropdown.py script to handle input given by web browser.

    # Import modules for CGI handling
    import cgi, cgitb
    # Create instance of FieldStorage
    form = cgi.FieldStorage()
    # Get data from fields
    if form.getvalue('dropdown'):
    subject = form.getvalue('dropdown')
    else:
    subject = "Not entered"
    print ("ontent-type:text/htmlrnrn")
    print ("")
    print ("")
    print ("<title>Dropdown Box - Sixth CGI Program</title>")
    print ("")
    print ("")
    print ("<h2> Selected Subject is %s</h2>" % subject)
    print ("")
    print ("")

    Using Cookies in CGI

    HTTP protocol is a stateless protocol. For a commercial website, it is required to maintain session information among different pages. For example, one user registration ends after completing many pages. How to maintain user's session information across all the web pages?
    In many situations, using cookies is the most efficient method of remembering and tracking preferences, purchases, commissions, and other information required for better visitor experience or site statistics.

    How It Works?

    Your server sends some data to the visitor's browser in the form of a cookie. The browser may accept the cookie. If it does, it is stored as a plain text record on the visitor's hard drive. Now, when the visitor arrives at another page on your site, the cookie is available for retrieval. Once retrieved, your server knows/remembers what was stored.
    Cookies are a plain text data record of 5 variable-length fields −

  • Expires − The date the cookie will expire. If this is blank, the cookie will expire when the visitor quits the browser.
  • Domain − The domain name of your site.
  • Path − The path to the directory or web page that sets the cookie. This may be blank if you want to retrieve the cookie from any directory or page.
  • Secure − If this field contains the word "secure", then the cookie may only be retrieved with a secure server. If this field is blank, no such restriction exists.
  • Name=Value − Cookies are set and retrieved in the form of key and value pairs.

    Setting up Cookies

    It is very easy to send cookies to browser. These cookies are sent along with HTTP Header before to Content-type field. Assuming you want to set UserID and Password as cookies. Setting the cookies is done as follows −

    print ("et-Cookie:UserID = XYZ;rn")
    print ("Set-Cookie:Password = XYZ123;rn")
    print ("Set-Cookie:Expires = Tuesday, 31-Dec-2007 23:12:40 GMT";rn")
    print ("Set-Cookie:Domain = prutor.ai;rn")
    print ("Set-Cookie:Path = /perl;n")
    print ("Content-type:text/htmlrnrn")
    ...........Rest of the HTML Content....

    From this example, you must have understood how to set cookies. We use Set-Cookie HTTP header to set cookies.
    It is optional to set cookies attributes like Expires, Domain, and Path. It is notable that cookies are set before sending magic line "Content-type:text/htmlrnrn.

    Retrieving Cookies

    It is very easy to retrieve all the set cookies. Cookies are stored in CGI environment variable HTTP_COOKIE and they will have following form −

    key1 = value1;key2 = value2;key3 = value3....

    Here is an example of how to retrieve cookies.

    # Import modules for CGI handling
    from os import environ
    import cgi, cgitb
    if environ.has_key('HTTP_COOKIE'):
    for cookie in map(strip, split(environ['HTTP_COOKIE'], ';')):
    (key, value ) = split(cookie, '=');
    if key == "UserID":
    user_id = value
    if key == "Password":
    password = value
    print ("User ID  = %s" % user_id)
    print ("Password = %s" % password)

    This produces the following result for the cookies set by above script −

    User ID = XYZ
    Password = XYZ123

    File Upload Example

    To upload a file, the HTML form must have the enctype attribute set to multipart/form-data. The input tag with the file type creates a "Browse" button.

    <p>File: </p>
    <p></p>

    The result of this code is the following form −
    File:
    Above example has been disabled intentionally to save people uploading file on our server, but you can try above code with your server.
    Here is the script save_file.py to handle file upload −

    import cgi, os
    import cgitb; cgitb.enable()
    form = cgi.FieldStorage()
    # Get filename here.
    fileitem = form['filename']
    # Test if the file was uploaded
    if fileitem.filename:
    # strip leading path from file name to avoid
    # directory traversal attacks
    fn = os.path.basename(fileitem.filename)
    open('/tmp/' + fn, 'wb').write(fileitem.file.read())
    message = 'The file "' + fn + '" was uploaded successfully'
    else:
    message = 'No file was uploaded'
    print """
    Content-Type: text/htmln
    <p>%s</p>
    """ % (message,)

    If you run the above script on Unix/Linux, then you need to take care of replacing file separator as follows, otherwise on your windows machine above open() statement should work fine.

    fn = os.path.basename(fileitem.filename.replace("", "/" ))

    How To Raise a "File Download" Dialog Box?

    Sometimes, it is desired that you want to give option where a user can click a link and it will pop up a "File Download" dialogue box to the user instead of displaying actual content. This is very easy and can be achieved through HTTP header. This HTTP header is be different from the header mentioned in previous section.
    For example, if you want make a FileName file downloadable from a given link, then its syntax is as follows −

    # HTTP Header
    print "Content-Type:application/octet-stream; name = "FileName"rn";
    print "Content-Disposition: attachment; filename = "FileName"rnn";
    # Actual File Content will go here.
    fo = open("foo.txt", "rb")
    str = fo.read();
    print str
    # Close opend file
    fo.close()

    Hope you enjoyed this tutorial. If yes, please send me your feedback at: Contact Us

Python – Exceptions Handling (Prev Lesson)
(Next Lesson) Python – MySQL Database Access