Microsoft Power Automate Desktop: Capabilities and Instructive Scenarios on How to Use It
This paper is about Microsoft Power Automate Desktop. We describe the major capabilities in the tool and then show how these types of capabilities can be used in different types of process automation scenarios. So, think of this paper as a simple learning primer on Power Automate Desktop for readers who have not yet worked with it or more generally have not yet engaged with robotic process automation. We hope you will find this paper a down-to-earth guide to when where and how to use Power Automate Desktop to start moving yourself and your company in the direction of complete digital automation. As a Microsoft Gold Partner, we are ramping up our efforts across the entire set of tools in the Microsoft Power Platform, and we will be publishing many new papers to highlight this area of our company’s expertise. We are investing heavily in the training and certification of our developers and data engineers in the full set of tools, but most especially Power Automate, of which Power Automate Desktop is one component. We have other papers in this series describing Innovatix’s initiatives in this area, including a series of case studies of some of our recent projects built using Microsoft Power Automate. One final note. Innovatix’s commitment to RPA is consistent with the view introduced by Gartner in one of its recent analyses of the state of technology for RPA. Gartner defines this view as hyper-automation, namely “the idea that anything that can be automated in an organization should be automated”. Introduction In this white paper, we discuss the major capabilities offered by Microsoft Power Automate Desktop. Microsoft Power Automate Desktop (PAD) is a workflow automation platform that helps automate rules-based mundane tasks on the desktop or the web. It offers a tremendous and growing set of functionalities including conditionals, variables, OCR automation, desktop automation, web automation, and more. We will discuss each area and shed light upon some of the functionalities offered in each area that might spark your interest in the product. PAD has more than 360 prebuilt actions, so we are not going to go through all of them. After discussing the capabilities of PAD, we will dive into some scenarios where PAD might help you in the daily tasks that you perform on your computer. We expect to be adding more such scenarios in the coming months, as we encounter new and interesting ones. Our goal is to highlight real-life workflow examples where PAD can be very helpful so readers can see how easy and quick it is to get this same case study going on their own desktop or the web. PAD Capabilities Here is a starter list of 13 capabilities in PAD and summary descriptions of each: Capability Description Where Useful UI Automation PAD lets you click a UI element in a window, select a tab in a window, select menu options, drag and drop UI elements in a window and expand or collapse a tree node in a window. PAD also offers us form filling functionalities including populating a text field in a window, pressing a button, selecting a radio button, setting a checkbox state, and setting up drop-down list values in a window. These features are helpful when we encounter automation where we need form-filling capabilities and need to automate applications on the desktop. Web Automation PAD allows us to launch browsers including Chrome, Firefox, Edge, and Internet Explorer. Create a new tab in a browser, click a link on a web page and close a web page. Extract data from a webpage and take a screenshot of a web page. These features are useful when making automation that requires navigating through a webpage and extracting data from them. After extracting the data, we can paste it into files if this is the required use case. Excel Launch new or existing excel files. It also has read and write capabilities. When we combine PAD Excel with PAD Desktop it creates a powerful result as it allows us to use various excel functionalities through clicking and selecting the radio buttons and more. Email and Outlook PAD allows us to retrieve email messages, process email messages, and send email messages. We can also launch Outlook and retrieve, send, process, and save email messages in the Outlook application. This is useful in all alert use cases as well as applications where data and information need to be communicated to others. Mouse and Keyboard We can get the mouse position, move the mouse to an image or text, send a mouse click and send keys. PAD allows us to use all the Keyboard keys including the special keys. This is helpful in automation where we need to fill in forms on web pages for example signing in into an account. Conditionals and Loops PAD offers a plethora of conditionals besides the basic else, including else if, if and switch for example if file exists, if folder exists, if process exists, and more. Then PAD offers loops that include each loop. These advanced conditionals help in use cases where basic conditionals fail. Each loop is useful in iterating through a list and performing a bunch of actions repeatedly. Wait PAD offers a bunch of wait functions which include wait for file, wait for a process, and more. Sometimes when making automation the next process starts before the previous one ends. This causes the automation to fail. In these cases, we use wait functions which adds a delay in the start of the next step and therefore gives time for the previous step to finish. Variables Variable functions are used to store data for further processing in automation. PAD has a diverse range of variable functions which include generating a random number, truncating a number, clearing a list, merging lists, and more. System PAD System functions include taking a screenshot, emptying the recycle bin, locking the workstation, logging off the user, running PowerShell scripts, print documents, and more. System functions aid in building automation