Data, Data Types and Essentials of Python Programming: Module 2: What's to Understand and Learn ...!

Abstract

There are four collection data types in the Python programming language:

  • List is a collection which is ordered and changeable. Allows duplicate members.
  • Tuple is a collection which is ordered and unchangeable. Allows duplicate members.
  • Set is a collection which is unordered, unchangeable*, and unindexed. No duplicate members.
  • Dictionary is a collection which is ordered** and changeable. No duplicate members.
Keywords
Lists, Operators, Strings and Tuples, Structuring Data, Example Program, Dictionaries

Learning Outcomes
After undergoing this article you will be able to understand the following

1. Lists: 
2. The List 
3. Data Type
4. Working with Lists, 
5. Augmented Assignment Operators, 
6. Methods
7. Example Program: 
Magic 8 Ball with a List, 
8. List-like Types
9. Strings and Tuples, 
10. References in Python
11. Dictionaries in Python
12. Structuring Data
13. The Dictionary Data Type 14. Pretty Printing, 
15. Using Data Structures to Model Real-World Things
16.. Conclusions
17. FAQs
References

Module 2: An Introduction

This module is designed to introduce you to the essential elements of Python. We will begin by studying the basic types of objects that are built-in to Python, which will enable us to work with numbers, text, and containers that can store a collection of objects. Lists, tuples, and strings all store sequences of objects (characters, in the case of strings), as such Python provides a common interface for working with these types; your ability to manipulate sequences of data will be a cornerstone for nearly all STEM work that you do in Python.

Having introduced Python’s basic types of objects and the means for working with sequences, we will formally discuss the process of assigning variables to these objects. It is not uncommon to assign multiple variables to the same object; Python’s treatment of multiple “references” to a single object will be resolved here, and in doing so we will distinguish mutable objects from immutable objects.

Armed with our growing toolkit of Python objects, and our newfound understanding of how to reference them with variables, we proceed to learn how to control the flow of logic within our code. “if”, “else”, and “else-if” statements are defined so that we can have branches of code be executed only if user-specified conditions are met (e.g. if a student’s grade is below 65, execute code to email that student a warning). “while-loops” and “for-loops” permit us to execute blocks of code repeatedly (e.g. for each student in this list, execute the code to compute that student’s average score). These so-called control-flow tools will greatly improve our ability to write useful code.

Coming off our discussion of for-loops, we take the opportunity to discuss some niceties of the Python language that arise when working with objects that are iterable (e.g. can be iterated over in a for-loop). Niceties is actually an understatement; these tricks of the trade will greatly bolster our ability to write clean, concise, and efficient code. We will pay particular attention to generator comprehension statements, which will allow us to process long sequences of data without having to hold all of the data in memory. You will be glad to have these tricks and tools in your repertoire.

Returning to a more traditional paradigm of programming languages, we will learn how to define our own functions. This will allow us to encapsulate code for reuse and invoke the code on-demand by “calling” the function that contains that code. Functions enable us to write code that is modular and to construct powerful algorithms by relying on these functions.

Finally, we will return to our initial endeavor of learning about the various types of objects that are built-in to Python, adding dictionaries, sets, and other types of collections to our inventory. Now that we are much more familiar with the language as a whole, we can concern ourselves with more nuanced, but extremely important matters. This involves discussing the efficiency of the algorithms used under the hood by its different data structures. For example, it will become clear that checking if an object is contained in a set is much more efficient than checking for membership in a list. Using the right tool for a given task is of manifest importance.

Although far from comprehensive, this module will acquaint you with the many of the essential elements of Python, along with the niceties that make the language easy to use. The objective here is to equip the reader with the tools needed to write clear and efficient code, that is particularly effective for data science applications. It is paramount that writing code in Python does not feel like stacking a bunch black boxes together; for this reason, a considerable amount of detail is included here. I hope that a reasonable balance has been struck such that this text is still easy to read and that its key “takeaways” are readily distilled.


Number Types

Python has three basic types of numbers: integers, “floating-point” numbers, and complex numbers. Familiar mathematical symbols can be used to perform arithmetic on all of these numbers (comparison operators like “greater than” are not defined for complex numbers):

Operation

Description

x + y

Sum of two numbers

x - y

Difference of two numbers

x * y

Product of two numbers

x / y

Quotient of two numbers

x // y

Quotient of two numbers, returned as an integer

x % y

x “modulo”: y: The remainder of x / y for positive xy

x ** y

x raised to the power y

-x

A negated number

abs(x)

The absolute value of a number

x == y

Check if two numbers have the same value

x != y

Check if two numbers have different values

x > y

Check if x is greater than y

x >= y

Check if x is greater than or equal to y

x < y

Check if x is less than y

x <= y

Check if x is less than or equal to y

These operations obey the familiar order of operations from your mathematics class, with parentheses available for association:

# multiplication takes precedence over addition
>>> 1 + 2 * 3
7

# grouping operations with parentheses
>>> (1 + 2) * 3
9

# finding the remainder of division between two positive numbers
>>> 11 % 5
1

# checking an inequality
>>> (2 ** 3) < (2 ** 4)
True

It should be noted that in many other programming languages, including the out-dated Python 2, dividing two integers would always return an integer - even if mathematically the result should be a fraction. In Python 3, the quotient of two integers will always return a floating-point number (i.e. a number that includes a decimal point):

# In many other languages, 3 / 2 returns the integer 1.
# In Python 3, division always returns a floating-point number:
>>> 3 / 2
1.5

>>> 4 / 2
2.0

The // operator is known as the “floor-divide” operator: it performs division between two numbers and returns the result as an integer by discarding any decimal-places for that number (thus returning the “floor” of that number). This can be used to perform the integer-division traditionally used in other programming languages:

# floor-division
>>> 1 // 3  # 0.3333.. -> 0
0
>>> 3 // 2  # 1.5 -> 1
1

Reading Comprehension: Understanding the modulo operator

The modulo operator, %, is not commonly seen in mathematics textbooks. It is, however, a very useful operation to have at our disposal. x % y (said as x “mod” y in programmer’s jargon) returns the remainder of x / y, when x and `y are non-negative numbers. For example:

  • . 2 “goes into” 3 one time, leaving a remainder of 1. Thus 3 % 2 returns 1

  • . 3 “goes into” 9 three times, and leaves no remainder. Thus 9 % 3 returns 0

Given this description of the “mod” operator, simplify the following by hand, and then use the IPython console to check your work:

  1. 1 % 5

  2. 2 % 5

  3. 22 % 1

  4. 22 % 2

  5. 22 % 3

  6. 22 % 4

  7. 22 % 5

  8. 22 % 6

Now, given any integer, n, what are the possible values that n % 2 can return? See if you can come up with a simple rule for explaining the behavior of n % 2.

Python’s math module

The standard library’s math module provides us with many more mathematical functions, likcomplete listing of them can be found in the official Python documentation. This module must be importede logarithms and trigonometric functions. A  into your code in order to use its functions:

# using the `math` module to use
# additional mathematical functions
>>> import math
>>> math.sqrt(4.)
2.0

# base-10 log
>>> math.log10(10.)
1.0

# 4! = 4*3*2*1
>>> math.factorial(4)
24

Integers

As in traditional mathematics, an integer is any “whole” number: .

Integers belong to the built-in type int, which can be used to convert objects to integers:

>>> type(-3)
int

# `1.3` is not an integer-type object
>>> isinstance(1.3, int)
False

# converting a string to an integer
>>> int("10")
10

# converting a floating-point number to an integer
>>> int(1.3)
1

You can create as large an integer as you’d like; Python will allocate as much memory as needed (and ultimately, as is available) to store an integer’s exact value:

# you can make an integer as large as you'd like
>>> large_int = 281938481039848500192847576920

Integers have some built-in functions available to them, which are detailed in the official documentation. The utility of these will likely be quite obscure to new programmers. Just note that they are here for now.

Floating-Point Numbers

A “floating-point” number is a number with a decimal point. Referred to as a “float” for short, this can be used to represent any number, up to a limited number of digits.

These objects belong to the built-in type float, which can be used to convert objects to floats:

# examples of "floating-point" numbers
>>> 100. ** 0.5
10.0

>>> 1 / 3
0.3333333333333333

>>> 1 / 2
0.5

>>> type(-2.1)
float

# the integer 10 is not a float-type object
>>> isinstance(10, float)
False

# including a decimal makes the number a float
>>> isinstance(10., float)
True

# converting a string to a floating-point number
>>> float("10.456")
10.456

# converting an integer to a floating-point number
>>> float(-22)
-22.0

Floats have a couple of built-in functions available to them, as detailed in the official documentation.

Scientific Notation

A float can also be created using familiar scientific notation. The character e is used to represent , and the proceeding number is the exponent. Here are some examples of traditional scientific notation, and their corresponding representation in Python:

 1.38e-04

 -4.2e10

Python will automatically display a float that possesses many digits in scientific notation:

# python will display many-digit numbers using
# scientific notation
>>> 0.0000001  # seven leading-zeros
1e-07

Understanding Numerical Precision

Whereas a Python integer can be made to be as large as you’d like, a floating-point number is limited in the number of digits it can store. That is, your computer will only use a set amount of memory - 8 bytes (64 bits) on most machines - to store the value of a floating-point number.

In effect, this means that a float can only be represented with a numerical precision of approximately 16 decimal places, when that number is written in scientific notation. The computer will not be able to reliably represent a number’s digits beyond those accounted for by the allotted 8 bytes. For instance, the following Python integer is defined with 100 digits, but when this number is converted to a float, it only retains 15 decimal places in scientific notation:

# Demonstrating the finite-precision of a float.

# An integer with 100 digits - Python will use as
# much memory as needed to store an integer
>>> int("1"*100)  # creates a string with 100 1s and makes it an int
1111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111

# Converted to a float, it retains only
# 16 decimal places, when written in scientific
# notation. This is the precision permitted by
# 8 bytes of memory.
>>> float("1"*100)  # creates a string with 100 1s and makes it a float
1.111111111111111e+99

The computer cannot keep track of those last 84 decimal places because doing so would require more than 8 bytes of memory to store the entire value of that float. If you had been diligently counting stars in the sky (perhaps across many universes, this number far exceeds the estimated number of stars in our universe), you would have just lost track of over  of them simply by converting your integer count to a float!

As such, attempting to modify a floating point number in decimal places beyond its numerical precision does not have any effect:

# changing a float beyond its precision has no effect
>>> 1. + 1e-16
1.0

Even in light of this discussion on float precision, you may be shocked and dismayed to see the following outcome of float arithmetic:

# the finite-precision of floats
# result in non-obvious behavior
>>> 0.1 + 0.1 + 0.1 - 0.3 == 0.
False

# the effects of having finite numerical precision
>>> 0.1 + 0.1 + 0.1 - 0.3
5.551115123125783e-17

This is not a quirk of Python; this is a well-understood aspect of dealing with floating-point numbers with limited numerical precision. To accommodate this, don’t check if two floats are “equal”. Rather, you should check if they are “close enough in value”. Let me emphasize this:

You should never check to see if two floats are exactly equal in value. Instead, you should only check that two floats are approximately equal to one another.

The math module has a very nice function for this; math.isclose will check if the relative difference between two numbers is less than . You can change this tolerance value along with the type of tolerance-checking used by the function; see its documentation here. Because in the previous example we compare values that are close to 0, we will check if their absolute difference is sufficiently small:

# checking if two float values are "almost equal"
>>> import math

# check:
# | (0.1 + 0.1 + 0.1 - 0.3) - 0 | < 1x10^{-9}
>>> math.isclose((0.1 + 0.1 + 0.1 - 0.3), 0., abs_tol=1e-9)
True

If you do not heed this lesson, it is inevitable that you will end up with serious, hard-to-find bugs in your code. Lastly, when doing numerical work in Python (and any other programming language), you must understand that the finite numerical precision of floating-point numbers is a source of error, akin to error associated with imprecision with a measuring device, and should be accounted for in your analysis (if error analysis is warranted).

Python’s decimal module can be used to define higher (or lower) precision numbers than permitted by the standard 8-byte floats. Furthermore, all arithmetic involving decimal numbers from this module is guaranteed to be exact, meaning that 0.1 + 0.1 + 0.1 - 0.3 would be exactly 0.. There is also a built-in fractions module, which provides tools for working with exact representations of rational numbers. Although we will not be using them here, it is very important to keep in mind that these modules exist and that floating point numbers are not the only way around the number line in Python.

Complex Numbers

In mathematics, a “complex number” is a number with the form , where  and  are real-valued numbers, and  is defined to be the number that satisfies the relationship . Because no real-valued number satisfies this relationship,  is called the “imaginary number”.

Weirdo electrical engineers use the symbol  in place of , which is why Python displays the complex number  as 2+3j (this is actually because  typically denotes current; we like electrical engineers too).

Along with the a + bj syntax, built-in type complex can be used to create complex-type numbers:

# creating complex numbers
>>> 2 + 3j
(2+3j)

>>> complex(2, 3)
(2+3j)

>>> complex(0, 1)**2
(-1+0j)

>>> type(2+3j)
complex

>>> isinstance(2-4j, complex)
True

Note that j is not, by itself, reserved as a special placeholder for . Rather, j must be preceded immediately with a numerical literal (i.e. you cannot use a variable) in order for the Python interpreter to treat it as a complex number.

# `j` by itself is treated like any other character
>>> j
NameError: name 'j' is not defined

# `1j` is interpreted as the imaginary number
>>> (1j) ** 2
(-1+0j)

You can access a and b from a + bj, the real and imaginary parts of the complex number, respectively.

# Accessing the real and imaginary parts of
# a complex number.
>>> x = complex(1.2, -3.4)
>>> x.real
1.2
>>> x.imag
-3.4

The cmath (“complex math”) module provides a collection of mathematical functions defined for complex numbers. For a complete listing of these functions, refer to the official documentation.

Reading Comprehension: Working with numbers in Python

  1. In Python, performing an arithmetic operation, such as addition or multiplication, on two integers will return an integer, and performing an operation on two floats will return a float:

>>> 2 * 3
6

>>> 2.0 * 3.0
6.0

For which operation, among + - * / **, does this not hold?

  1. What type of number will be returned if you perform a mathematical operation using an integer and a floating-point number? Does this hold for all the arithmetic operations? Determine this by trial and error.

  2. Given the function , make use of the math module to compute .

  3. Using Python’s syntax for scientific notation, write an expression that verifies that one trillion divided by one billion is equal to one thousand

Augmented Assignment Statements

Python provides a nice “shortcut” for updating a variable via an arithmetic operation. For example, suppose you want to increase the value of x by 1. Currently, we would update x as follows:

# incrementing `x` by 1
>>> x = 5
>>> x = x + 1
>>> x
6

We can make use of a special assignment operation += to perform this update in an abbreviated way.

# using `+=` to increment `x` by 1
>>> x = 5
>>> x += 1  # equivalent to: `x = x + 1`
>>> x
6

+= is a type of augmented assignment statement. In general, an augmented assignment performs a mathematical operation on a variable, and then updates that variable using the result. Augmented assignment statements are available for all of the arithmetic operations. Assuming x and n are both types of numbers, the following summarizes the available arithmetic augmented assignment statements that we can perform on x, using n:

  • x += n  x = x + n

  • x -= n  x = x - n

  • x *= n  x = x * n

  • x /= n  x = x / n

  • x //= n  x = x // n

  • x **= n  x = x ** n

Improving The Readability of Numbers

Python version 3.6 introduced the ability to include underscores between the digits of a number as a visual delimiter. This character can be used to improve the readability of long numbers in your code. For example the number 662607004 can be rewritten as 662_607_004, using _ to delimit digits separated by orders of one thousand. Leading, trailing, or multiple underscores in a row are not allowed; otherwise this character can be included anywhere within a numerical literal.

# examples of using `_` as a visual delimiter in numbers
>>> 1_000_000  # this is nice!
1000000

# this is gross but is permitted
>>> 2_3_4.5_6_7
234.567

# underscores work with all variety of numerical literals
>>> 10_000j
10000j

Compatibility Warning

The permitted use of the underscore character, _, in numerical literals was introduced in Python 3.6. Thus utilizing this syntax in your code will render it incompatible with Python 3.5 and earlier.

The Boolean Type

There are two boolean-type objects: True and False; they belong to the built-in type bool. We have already seen that the isinstance function either returns True or False, as a given object either is or isn’t an instance of a specific type.

# the two boolean-objects: `True` and `False`
>>> type(True)
bool

# `False` is a boolean-type object
>>> isinstance(False, bool)
True

True and False must be specified with capital letters in Python. These should not be confused with strings; note that there are no quotation marks used here.

Logic Operators

Python provides familiar operators for performing basic boolean logic:

Logic Operation

Symbolic Operator

and

&

or

|

# demonstrating boolean-logic operators
>>> True or False
True

>>> True and False
False

>>> not False
True

Operator symbols are available in place of the reserved words and and or:

# demonstrating the symbolic logic operators
>>> False | True  # equivalent to: `False or True`
True

>>> False & True  # equivalent to: `False and True`
False

That being said, it is generally more “Pythonic” (i.e. in-vogue with Python users) to favor the use of the word-operators over the symbolic ones.

Multiple logic operators can be used in a single line and parentheses can be used to group expressions:

>>> (True or False) and True
True

Comparison statements used in basic mathematics naturally return boolean objects.

>>> 2 < 3
True

>>> 10.5 < 0
False

>>> (2 < 4) and not (4 != -1)
False

The bool type has additional utilities, which will be discussed in the “Conditional Statements” section.

Boolean Objects are Integers

The two boolean objects True and False formally belong to the int type in addition to bool, and are associated with the values 1 and 0, respectively:

>>> isinstance(True, int)
True

>>> int(True)
1

>>> isinstance(False, int)
True

>>> int(False)
0

As such, they can be used in mathematical expressions interchangeably with 1 and 0

>>> 3*True - False  # equivalent to: 3*1 + 0
3

>>> True / False  # equivalent to: 1 / 0
---------------------------------------------------------------------------
ZeroDivisionError                         Traceback (most recent call last)
<ipython-input-4-f8487d9d0863> in <module>()
----> 1 True / False

ZeroDivisionError: division by zero

The purpose of having True and False double as integers is beyond the scope of this section. It is simply useful to be aware of these facts so that this behavior is not completely alien to you as you begin to write code in Python.

Reading Comprehension: Boolean expressions

  1. Assuming x is a an integer-type, write a comparison statement that will return True if x is an even number, and False otherwise. (Hint: recall the purpose of the % operator)

  2. Assuming x and y are both real-valued numbers (i.e. not complex numbers), write a line of code that will return False if: x and y are within 0.9 of one another, and x is a positive number. (Hint: try writing the expression that will return True for this condition, and then negate it)

  3. Write an expression that returns True if x is a boolean-type object or a float-type object.

The None-Type

There is a simple type, NoneType that has exactly one object: NoneNone is used to represent “null”… nothing.

# `None` is the *only* object belonging to NoneType
>>> type(None)
NoneType

As such, instead of checking if an object belongs to NoneType, you should simply check if the object is None. Python reserves is as an operation that checks if two objects are identical. This is different than ==, which checks if two objects are associated with the same value or state:

# Check if an object "is" None, instead
# of checking if it is of NoneType
>>> x = 22

>>> x is None
False

>>> x is not None
True

>>> y = None

>>> y is None
True

None appears frequently, and is often used as a placeholder in code. Here is a simple example where None could be useful; don’t worry that this code may not make perfect sense to you yet:

# Demonstrating the use of `None` as a placeholder

# In this code, we want to get the first
# item in a list that is greater than 10, and notify
# the user if there is no such number

large_num = None

for number in [1, 2, 3, 4]:
    if number > 10:
        large_num = number
        break

if large_num is None:
    print("The list did not contain any number larger than 10")

Strings

Introducing the string type

The string type is used to store written characters. A string can be formed using:

  • single quotes: 'Hello world'

  • double quotes: "Hello world"

  • triple quotes: """Hello world""" or '''Hello world'''

# Strings contain written characters, even those
# not found in the english alphabet!
>>> "hello, 你好, Olá, 123"
'hello, 你好, Olá, 123'

By default, Python 3 uses UTF-8 unicode to represent this wide variety of characters. Don’t worry about this detail beyond making note of it, for now.

Strings belong to the built-in str type, which can be used to convert non-string objects into strings.

# the type `str`
>>> type("hello")
str

>>> isinstance("83", str)
True

# Using the type `str` to convert non-string objects
# into strings.
>>> str(10.34)
'10.34'

>>> str(True)
'True'

Once a string is formed, it cannot be changed (without creating an entirely new string). Thus a given string object cannot be “mutated” - a string is an immutable object.

As the string stores a sequence of characters, Python provides a means for accessing individual characters and subsequences of characters from a string:

>>> sentence = "The cat in the hat."
>>> sentence[0]
'T'
>>> sentence[0:3]
'The'

Strings are not the only sequence-type in Python; lists and tuples are examples of sequences as well. We will reserve a separate section to learn about the common interface that Python has for all of its types that are sequential in nature, including the “indexing” and “slicing” demonstrated here.

String essentials

We will only scratch the surface with strings, touching on some essentials. Please refer to the official Python tutorial for a more extensive, but still informal, overview of strings.

In a string, \n is treated as a single character. It denotes a new-line in a string, and will be rendered thusly when the string is printed. Similarly, \t will render as a tab-character.

# using `\n` to create a newline
>>> print("hi...\n...bye")
hi...
...bye

Using triple quotes allows you to write a block-string, meaning that you can include text on multiple lines, and it is all still treated as one string:

# using triple-quotes to write a multi-line string
>>> x = """ I am a string.
I am part of the same string.
    me... too!"""

>>> x
' I am a string.\nI am part of the same string.\n    me... too!'

Python’s strings have a large number of fantastic, built-in functions available to them. It is very important that you familiarize yourself with these functions by looking over the official documentation. To demonstrate a few of these:

# demonstrating a few of the built-in functions for strings
>>> "hello".capitalize()
'Hello'

# join a list of strings, using "..."
>>> "...".join(["item1", "item2", "item3"])
'item1...item2...item3'

# split a string wherever ", " occurs
>>> 'item1, item2, item3'.split(", ")
['item1', 'item2', 'item3']

# does this string end with ".py"?
>>> "script.py".endswith(".py")
True

# does this string start with "sc"?
>>> "script.py".startswith("sc")
True

# insert objects into a string, in its
# "formatting" fields {}
>>> "x: {}, y: {}, z: {}".format(3.2, 8.4, -1.0)
'x: 3.2, y: 8.4, z: -1.0'

# Are the characters in the string
# numerical digits?
>>> "7".isdigit()
True

Formatting strings

Python provides multiple syntaxes for formatting strings; these permit us to do things like programmatically inject the values of variables into strings, align fields using whitespace, and control the number of decimal places with which numbers are displayed in a string. This section is designed to simply expose the reader to the different varieties of string-formatting.

pyformat.info is the best resource to consult to see an exhaustive (but still intuitive) treatment of string-formatting in Python. You can also refer to the official documentation here.

In Python 3, you can leverage the format method towards this end:

# using `format` to replace placeholders with values
>>> "{name} is {age} years old".format(name="Bruce", age=80)
'Bruce is 80 years old'

# padding a string with leading-spaces so that it has at least 8 characters
>>> "{item:>8}".format(item="stew")
'   stew'

Note that you may encounter the use of the cryptic % operator to format strings to the same effect:

# using `%` to  format strings (avoid this)
>>> name = "Selina"
>>> "My name is %s" % name
'My name is Selina'

this is a relic of Python 2; it is recommend that you avoid this formatting syntax.

If you are using Python 3.6 or beyond, then you have the luxury of being able to use f-strings, which provide a supremely convenient means for formatting strings. Here is an example of an f-string in action:

# an example of an 'f-string'
>>> batman = 12
>>> catwoman = 10
>>> f"Batman has {batman} apples. Catwoman has {catwoman} apples. Together, they have {batman + catwoman} apples"
'Batman has 12 apples. Catwoman has 10 apples. Together, they have 22 apples'

See that an f-string has a special syntax; an f-string is denoted by preceding the opening quotation mark with the lowercase f character:

# this is a typical empty string
>>> ""
''

# this is an empty f-string
>>> f""
''

An f-string is special because it permits us to write Python code within a string; any expression within curly brackets, {}, will be executed as Python code, and the resulting value will be converted to a string and inserted into the f-string at that position.

>>> x = 7.9
>>> f"x is a {type(x)}-number. Its value is {x}. The statement 'x is greater than 5' is {x > 5}"
"x is a <class 'float'>-number. Its value is 7.9. The statement 'x is greater than 5' is True"

As seen in the preceding examples, this permits us to elegantly include variables in our strings and even do things like call functions within the string construction syntax.

f-string Compatibility:

The ‘f-string’ syntax was introduced in Python 3.6. It is not available in earlier versions of Python.

Official documentation for strings

It is highly recommended that you take time to read over all of the functions that are built-in to a string.

  • Built-in functions for stringsF

  • ormatting strings

Reading Comprehension: Strings

To answer some of the following questions, you will need to peruse the documentation for the built-in functions of strings. It may take a bit of experimentation to understand the documentation’s use of square-brackets to indicate optional inputs for a function.

  1. Use a function that will take the string "cat", and returns the string "   cat    " (which has a length of 11, including the letters c, a, t). Now, change the way you call the function so that it returns "----cat----" instead.

  2. Replace the first three periods of this string with a space-character: "I.am.aware.that.spaces.are.a.thing"

  1. Remove the whitespace from both ends of: "  basket    "

  1. Create a string that will print as (the second line begins with a tab-character):

Hello
    over there
  1. Convert the integer 12 to the string "12".

  2. Only kids 13 and up are allowed to see Wayne’s World. Given the variables name (a string) and age (an integer), use an f-string that will display: “NAME is old enough to watch the movie: BOOL”, where NAME is to be replaced with the kid’s name, and BOOL should be True if the kid is at least 13 years old, and False otherwise.

Lists

list is a type of Python object that allows us to store a sequence of other objects. One of its major utilities is that it provides us with means for updating the contents of a list later on.

A list object is created using square-brackets, and its contents are separated by commas: [item1, item2, ..., itemN]. Its contents need not be of the same type of object.

# a list-type object stores a sequence of other objects
>>> [3.5, None, 3.5, True, "hello"]
[3.5, None, 3.5, True, 'hello']

>>> type([1, 2, 3])
list

>>> isinstance([1, 2], list)
True

# constructing an empty list
>>> []
[]

# constructing a list with only one member
>>> ["hello"]
["hello"]

You can also include variables, equations, and other Python expressions in the list constructor; Python will simplify these expressions and construct the list with the resulting objects.

# the list constructor will simplify expressions
# and store their resulting objects
>>> x = "hello"
>>> [2 < 3, x.capitalize(), 5**2, [1, 2]]
[True, 'Hello', 25, [1, 2]]

The built-in list type can be used to convert other types of sequences (and more generally, any iterable object, which we will discuss later) into a list:

# `list` forms a list out of the contents of other sequences
>>> list("apple")
['a', 'p', 'p', 'l', 'e']

Lists are sequences

Like a string, the ordering of a list’s contents matters, meaning that a list is sequential in nature.

# A list's ordering matters
>>> [1, "a", True] == [1, True, "a"]
False

Thus a list supports the same mechanism for accessing its contents, via indexing and slicing, as does a string. Indexing and slicing will be covered in detail in the next section.

# Accessing the contents of a list with indexing and slicing
>>> x = [2, 4, 6, 8, 10]

# `x` contains five items
>>> len(x)
5

# access the 0th item in the list via "indexing"
>>> x[0]
2

# access a subsequence of the list via "slicing"
>>> x[1:3]
[4, 6]

Lists can be “mutated”

We will encounter other types of containers in Python, what makes the list stand out is that the contents of a list can be changed after the list has already been constructed. Thus a list is an example of a mutable object.

# changing a list after it has been constructed
>>> x = [2, 4, 6, 8, 10]
>>> y = [2, 4, 6, 8, 10]

# "set" the string 'apple' into position 1 of `x`
>>> x[1] = "apple"
>>> x
[2, 'apple', 6, 8, 10]

# replace a subsequence of `y`
>>> y[1:4] = [-3, -4, -5]
>>> y
[2, -3, -4, -5, 10]

The built-in list-functions “append” and “extend” allow us to add one item and multiple items to the end of a list, respectively:

>>> x = [2, 4, 6, 8, 10]

# use `append` to add a single object to the end of a list
>>> x.append("moo")
>>> x
[2, 4, 6, 8, 10, 'moo']

# use `extend` to add a sequence of items to the end of a list
>>> x.extend([True, False, None])
>>> x
[2, 4, 6, 8, 10, 'moo', True, False, None]

The “pop” and “remove” functions allow us to remove an item from a list based on its position in the list, or by specifying the item itself, respectively.

>>> x = ["a", "b", "c", "d"]

# pop the position-1 item out from a list
# `pop` will return the item that gets removed.
>>> x.pop(1)
'b'

>>> x
['a', 'c', 'd']

# remove the object "d" from the list
>>> x.remove("d")
>>> x
['a', 'c']

Official documentation for lists

It is highly recommended that you take time to read over all of the functions that are built-in to a list. These are all designed to allow us to either inspect or mutate the contents of a list.

  • Built-in functions for a list

Reading Comprehension: Lists

To answer some of the following questions, you will need to peruse the documentation for the built-in functions of lists.

  1. Create a list whose sole entry is the None object.

  2. Assign to the variable k a list that contains an integer, a boolean, and a string, in that order. Then, add two more entries to the end of the list: a float and a complex number.

  3. Alphabetize the list of names: ["Jane", "Adam", "Ryan", "Bob", "Zordon", "Jack", "Jackenzie"].

Summary

The term “object” is a catch-all in Python, meaning anything that we can assign to a variable. Objects behave differently from one another according to what “type” a given object is.

We reviewed several fundamental object types in Python:

  • intfloatcomplex: the numerical types

  • bool: the boolean type. True and False are the only boolean-type objects

  • NoneType: the “null” type; None is the only object that belongs to this type

  • str: the string type

  • list: the list type

The built-in function type permits us to check the type of any object:

>>> type(3.2)
float

>>> type(True)
bool

The built-in function isinstance should be used to check if an object is of a specific type:

>>> x = 2 + 3
>>> isinstance(x, int)
True

The only exception to this is if you want to check if an object is of the type NoneType, since this is only possible if the object is None; thus it is “cleaner” to directly check this:

>>> x = None
>>> x is None
True

Objects of different types have different built-in functions available to them:

>>> x = "I am a farmer.. moo"
>>> x.upper()
'I AM A FARMER.. MOO'

>>> y = 0.5
>>> y.as_integer_ratio()
(1, 2)

>>> z = [1, 2]
>>> z.append(3)
>>> z
[1, 2, 3]

You should leverage the official documentation, for which links were provided throughout this section, whenever you are wondering how to best do a specific task with a given type of object.

As a final reminder, we saw that that an integer in Python is able to hold a value with arbitrarily-many digits. A floating-point number, on the other hand, is restricted in the number of “significant” digits it can hold. Thus, where it is perfectly fine to check if two integers are exactly equal:

# checking if two integers are equal is great!
>>> 2 + 2 == 4
True

you should never rely on two floats being exactly equal. Instead, check if they are close in value:

# checking if two floats are equal is lame!
>>> 0.1 + 0.1 + 0.1 - 0.3 == 0.
False

>>> abs((0.1 + 0.1 + 0.1 - 0.3) - 0.) < 1e-12
True

It is very important to remember this issue of the limited numerical precision of a floating point number.

List

Lists are used to store multiple items in a single variable.

Lists are one of 4 built-in data types in Python used to store collections of data, the other 3 are TupleSet, and Dictionary, all with different qualities and usage.

Lists are created using square brackets:

Example

Get your own Python Server

Create a List:

thislist = ["apple""banana""cherry"]
print(thislist)


Comments