Python Xlsxwriter Create Excel Part 4 Conditional Formatting

Sharing is caring!

Last Updated on July 14, 2022 by Jay

The Python xlsxwriter library is capable of creating all sorts of conditional formatting for our Excel file. Let’s take a look at how to do that. One of our friends on Youtube requested this, and here you go!

This is part of the automating Excel series. If you need an introduction to the xlsxwriter library that we are using, head to part 1 of the series.

Excel Conditional Formatting

The name says it all – formatting based on conditions. It’s a neat feature in Excel that allows us to highlight (in most cases) cells according to certain conditions. And of course, we, the users get to define those conditions.

Also, what makes conditional formatting great is that the formatting is dynamic – when the values change, the format will change too.

In Excel, click on Conditional Formatting, we see a bunch of choices, let’s go down to “New Rules”. This will open up a window that basically summarizes all the options. The ways we can define the conditions are super flexible, so there’s no way to cover every single case. I’ll cover a few common scenarios here, but if you need help with any specific example, leave a comment down below and I will get back to you ????

Excel Conditional Formatting

Getting The Python Environment Ready

Let’s head into Python, generate some values, and a xlsxwriter workbook. Time for Christmas decoration!

The three formats we have created are format_r (red), format_y (yellow), and format_g (green).

import xlsxwriter

wb = xlsxwriter.Workbook(r'C:\Users\jay\Desktop\PythonInOffice\excel-file-formula\conditional_formatting.xlsx')
ws = wb.add_worksheet('formatting')

list_1 = list(range(10))
list_2 = [5,4,3,2,1,9,8,7,6,0]
list_3 = [i for i in range(-10,10)]
list_text = ['python','excel','python xlsxwriter','python automate excel','excel automate python','python']

format_r = wb.add_format({'bg_color':   '#FFC7CE',
                          'font_color': '#9C0006'})

format_y = wb.add_format({'bg_color':   '#FFEB9C',
                          'font_color': '#9C6500'})

format_g = wb.add_format({'bg_color':   '#C6EFCE',
                          'font_color': '#006100'})

Review Time

If you need a refresher on how to add formats using xlsxwriter library, head to part 3 of the series.

  1. We need a workbook object (wb) and a worksheet object (ws).
  2. Either the “A1” or (row, col) style notation can be used to reference cells and ranges.
  3. Create Excel format by using the workbook.add_format() method.
  4. Format a cell by writing data and formatting simultaneously into cell/range.

New Concept

  1. To create conditional formatting , use worksheet.conditional_format('A1', {parameters})
  2. A conditional formatting is superimposed over the existing cell format. Not all cell format properties can be modified, such as font name, size, alignment, etc.
  3. Related to #2 above, most of the time we use conditional formatting just to highlight cells. i.e. change cell color.

Xlsxwriter Conditional Formatting parameters

The parameters inside the conditional_format() method must be a dictionary containing the properties that describe the type and style for the format. Some of the main properties are:

  • type – are we formatting cells, numbers, text, ranking, average, duplicate or formula? See the picture below, the “type” refers to the “Rule Type”.
  • criteria – do we want to find the “greater/less than”, “contain” certain text, top items, etc
  • value – usually in conjuction with the criteria, “great than 7”, “between 5 and 7”, “above” average, etc
  • format – the formatting, usually it’s just changing the cell/font color

Now we got the theory out of the way, let’s see how to apply them. Because we are going through several examples, so we’ll group our Python code into functions to make it more organized.

Following the order of the above screen “Rule Type” offered by Excel, we’ll take a look at each scenario below.

Conditional Formatting All Cells Color Scale

I hope you like rainbow ???? We can specify three colors (min, mid and max) and Excel will make a pretty rainbow for us. If you prefer only dual colors, then change 'type' to be '2_color_scale', and then simply remove the 'mid_color'.

def color_scale():
     ws.write('N1','3 color scale')
     ws.write_column('N2',list_1)
     ws.conditional_format('N2:N11',
                                              {'type': '3_color_scale',
                                'min_color':'red',
                                   'mid_color':'yellow',
                                   'max_color':'green'
                                        })
Conditional Formatting 3 Color Scale

Conditional Formatting All Cells Data Bar

You can make a lot of different cool stuff with the data bar, lots of flexibility there.

def data_bar():
     ws.write('P1','data bar')
     ws.write_column('P2',list_3)
     ws.conditional_format('P2:P22',
                                              {'type': 'data_bar',
                                               'bar_color':'green',
                                               'bar_only': True, #True - no data, False - show data
                                               'bar_solid': False, # True - solid color fill, False - gradient color
                                               'bar_negative_color':'red',
                                               'bar_direction': 'left', # or 'right'
                                               'bar_axis_position':'middle', # or 'none'
                                              })
Conditional Formatting – Data Bar

Conditional Formatting Based On Numbers

Pay attention to the dictionary inside the ws.conditional_format method, especially what values are passed into those properties. The 'criteria' can be any of the following (either column will work):

‘between’
‘not between’
‘equal to’‘==’
‘not equal to’‘!=’
‘greater than’‘>’
‘less than’‘<‘
‘greater than or equal to’ ‘>=’
‘less than or equal to’ ‘<=’
Conditional Formatting Options
def based_on_number():
    ws.merge_range('B1:C1','>7 is green, <5 is red, in-between is yellow')
    ws.write_column('B2', list_1)
    ws.write_column('C2', list_2)
    
    ## greater than 7, green color
    ws.conditional_format('B2:C11',
                                              {'type': 'cell',
                                               'criteria': '>',
                                               'value': 7,
                                              'format':format_g
                                              })


    ## less than 5, red color
    ws.conditional_format('B2:C11',
                                              {'type': 'cell',
                                               'criteria': '<',
                                               'value': 5,
                                              'format':format_r
                                              })


    ## between 5 and 7, yellow color
    ws.conditional_format('B2:C11',
                                              {'type': 'cell',
                                               'criteria': 'between',
                                               'minimum':5,
                                               'maximum':7,
                                              'format':format_y
                                              })

The above code generates the following conditional formatting:

Python Generated Excel Conditional Formatting

Conditional Formatting Based On Numbers From Cell Input

This is similar to the previous example, except that we are not hardcoding the threshold values 5 and 7. We will let the formatting depends on cell values – even more dynamic!

Pay close attention to the ‘value' property in the code below – we need to use absolute references, or it won’t work. In general, for any 'value' property we need to use an absolute reference.

Once we generate this in Excel, you’ll see that the formatting will change as we modify the values inside cells B19 and C19.

def based_on_number_input():
    ws.merge_range('B18:C18','with number input')
    ws.write('B19',7) ## threshold value
    ws.write('C19',5) ## threshold value
    
    ws.write_column('B20', list_1)
    ws.write_column('C20', list_2)
    ws.conditional_format('B20:C29',
                                              {'type': 'cell',
                                               'criteria': '>',
                                               'value': '$B$19',
                                              'format':format_g
                                              })

    ws.conditional_format('B20:C29',
                                              {'type': 'cell',
                                               'criteria': '<',
                                               'value': '$C$19',
                                              'format':format_r
                                              })

Conditional Formatting Based On Text

We can check whether the cells contain certain text. The criteria property includes the following:

  • 'containing'
  • 'not containing'
  • 'begins with'
  • 'ends with'
def based_on_text():
    ws.write('E1','contain "python"')
    ws.write('F1','begins with "python"')
    ws.write_column('E2', list_text)
    ws.write_column('F2', list_text)
    ws.conditional_format('E2:E11',
                                              {'type': 'text',
                                               'criteria': 'containing',
                                               'value': 'python',
                                              'format':format_g
                                              })

    ws.conditional_format('F2:F11',
                                              {'type': 'text',
                                               'criteria': 'begins with',
                                               'value': 'python',
                                              'format':format_g
                                              })

Conditional Formatting Top/Bottom Ranked Values

We can highlight the items by 'top' or 'bottom' values, i.e. first 5 largest, or by percent, i.e. bottom 10% of the selected values. Omitting the 'criteria' means by count, while setting 'criteria' : '%' means by percentage.

def top_n():
    ws.write('H1','top 5')
    ws.write_column('H2',list_2)
    ws.conditional_format('H2:H11',
                                              {'type': 'top',
                                               'value': 5,
                                               #'criteria': '%',
                                              'format':format_g
                                              })

Conditional Formatting Above/Below Average

For this one, Excel will calculate an average for the selected range, then compare each number in the range with the average, and format accordingly.

def average():
    ws.write('J1','average')
    ws.write_column('J2',list_2)
    ws.conditional_format('J2:J11',
                                              {'type': 'average',
                                               'criteria': 'above',
                                              'format':format_g
                                              })

Conditional Formatting Unique/Duplicate Values

We can also highlight duplicated (or unique) values in a selected range.

def duplicate():
    ws.write('L1','duplicate')
    ws.write_column('L2',list_text)
    ws.conditional_format('L2:L11',
                                              {'type': 'duplicate',  #or 'unique'
                                              'format':format_g
                                              })

Conditional Formatting Based On Formula

We can also do conditional formatting based on a formula – and make our Excel even much more dynamic!!

However, formula-based formatting can be a little bit tricky to work with because some situations require absolute references while others require a non-absolute reference. My go-to strategy is: try the formula in Excel, with or without $ in your cell references. If it works in your Excel, taking the same formula over to Python will also work.

The below code compares numbers in column R and column S, then highlights (in green) the larger number between the two columns.

Note that 'type' is set to 'formula', in the 'criteria' we type the formula as if for the first item (of the selected range) only. Inside the conditional_format method, we are formatting cells R2:R11, the first element is R2, thus the formula is '=R2>S2'. If we want to apply the format to R3:R11, then the formula needs to be ‘=R3>S3’, etc.

Also, note that in this case, we are comparing two columns so we don’t use absolute references in the formula. In other cases, you might need to use absolute references to make this formula-based formatting work. See my go-to strategy above.

def based_on_formula():
    ws.merge_range('R1:S1','compare R and S, highlight the greater #')
    ws.write_column('R2', list_1)
    ws.write_column('S2', list_2)
    ws.conditional_format('R2:R11',
                                              {'type': 'formula',
                                               'criteria': '=R2>S2',
                                              'format':format_g
                                              })

    ws.conditional_format('S2:S11',
                                              {'type': 'formula',
                                               'criteria': '=S2>R2',
                                              'format':format_g
                                              })

That’s it! Leave a comment below if you need help with any specific scenario.

One comment

Leave a Reply

Your email address will not be published. Required fields are marked *