Microsoft Power Automate Desktop: Capabilities and Instructive Scenarios on How to Use It

This paper is about Microsoft Power Automate Desktop. We describe the major capabilities in the tool and then show how these types of capabilities can be used in different types of process automation scenarios. So, think of this paper as a simple learning primer on Power Automate Desktop for readers who have not yet worked with it or more generally have not yet engaged with robotic process automation. We hope you will find this paper a down-to-earth guide to when where and how to use Power Automate Desktop to start moving yourself and your company in the direction of complete digital automation. As a Microsoft Gold Partner, we are ramping up our efforts across the entire set of tools in the Microsoft Power Platform, and we will be publishing many new papers to highlight this area of our company’s expertise. We are investing heavily in the training and certification of our developers and data engineers in the full set of tools, but most especially Power Automate, of which Power Automate Desktop is one component. We have other papers in this series describing Innovatix’s initiatives in this area, including a series of case studies of some of our recent projects built using Microsoft Power Automate. One final note. Innovatix’s commitment to RPA is consistent with the view introduced by Gartner in one of its recent analyses of the state of technology for RPA. Gartner defines this view as hyper-automation, namely “the idea that anything that can be automated in an organization should be automated”. Introduction In this white paper, we discuss the major capabilities offered by Microsoft Power Automate Desktop. Microsoft Power Automate Desktop (PAD) is a workflow automation platform that helps automate rules-based mundane tasks on the desktop or the web. It offers a tremendous and growing set of functionalities including conditionals, variables, OCR automation, desktop automation, web automation, and more. We will discuss each area and shed light upon some of the functionalities offered in each area that might spark your interest in the product. PAD has more than 360 prebuilt actions, so we are not going to go through all of them. After discussing the capabilities of PAD, we will dive into some scenarios where PAD might help you in the daily tasks that you perform on your computer. We expect to be adding more such scenarios in the coming months, as we encounter new and interesting ones. Our goal is to highlight real-life workflow examples where PAD can be very helpful so readers can see how easy and quick it is to get this same case study going on their own desktop or the web.  PAD Capabilities Here is a starter list of 13 capabilities in PAD and summary descriptions of each: Capability Description Where Useful UI Automation PAD lets you click a UI element in a window, select a tab in a window, select menu options, drag and drop UI elements in a window and expand or collapse a tree node in a window. PAD also offers us form filling functionalities including populating a text field in a window, pressing a button, selecting a radio button, setting a checkbox state, and setting up drop-down list values in a window. These features are helpful when we encounter automation where we need form-filling capabilities and need to automate applications on the desktop. Web Automation PAD allows us to launch browsers including Chrome, Firefox, Edge, and Internet Explorer. Create a new tab in a browser, click a link on a web page and close a web page. Extract data from a webpage and take a screenshot of a web page. These features are useful when making automation that requires navigating through a webpage and extracting data from them. After extracting the data, we can paste it into files if this is the required use case. Excel Launch new or existing excel files. It also has read and write capabilities. When we combine PAD Excel with PAD Desktop it creates a powerful result as it allows us to use various excel functionalities through clicking and selecting the radio buttons and more. Email and Outlook PAD allows us to retrieve email messages, process email messages, and send email messages. We can also launch Outlook and retrieve, send, process, and save email messages in the Outlook application. This is useful in all alert use cases as well as applications where data and information need to be communicated to others. Mouse and Keyboard We can get the mouse position, move the mouse to an image or text, send a mouse click and send keys. PAD allows us to use all the Keyboard keys including the special keys. This is helpful in automation where we need to fill in forms on web pages for example signing in into an account. Conditionals and Loops PAD offers a plethora of conditionals besides the basic else, including else if, if and switch for example if file exists, if folder exists, if process exists, and more. Then PAD offers loops that include  each loop. These advanced conditionals help in use cases where basic conditionals fail. Each loop is useful in iterating through a list and performing a bunch of actions repeatedly. Wait PAD offers a bunch of wait functions which include wait for file, wait for a process, and more. Sometimes when making automation the next process starts before the previous one ends. This causes the automation to fail. In these cases, we use wait functions which adds a delay in the start of the next step and therefore gives time for the previous step to finish. Variables Variable functions are used to store data for further processing in automation. PAD has a diverse range of variable functions which include generating a random number, truncating a number, clearing a list, merging lists, and more.   System PAD System functions include taking a screenshot, emptying the recycle bin, locking the workstation, logging off the user, running PowerShell scripts, print documents, and more. System functions aid in building automation

Imaan Ali December 3, 2020 No Comments

Document Understanding: A Short Guide on the Major Process Steps

1.   Executive Summary Document Understanding (DU) is one of the fastest-growing areas in business process automation. The DU ecosystem includes technologies that can interpret and extract text and meaning from a wide range of document types including structured, semi-structured and unstructured — even ones that contain handwriting, tables and checkboxes. This is now possible because of the ever-improving techniques of Machine Learning (ML).  Enhancements in ML are spurring innovation in document understanding. In this article, we present the major steps in the DU process and the underlying architecture with reference specifically to UiPath’s Document Understanding framework. Multiple technologies can unlock the power of document understanding such as: In today’s business processes, most of the routine and mundane tasks employees perform consist of creating, reading, reviewing, and transcribing paperwork (documents). Employees spend a significant percentage of their work time reading these docs, extracting data, and passing on the much-needed information into other downstream applications manually. Since the data extraction from the documents and input to other apps is done by a human, the process is subject to problems of accuracy and reliability. UiPath’s Document Understanding solution allows you to intelligently process data with a high level of accuracy and reliability for any type of document such as invoice, receipt, financial statement, utility bill, and any other kind of text that has a different structure. The general flow for UiPath’s DU process is encapsulated into the 6 process steps below. To decide which steps are needed for a specific business process, you will need to address the below requirements: One primary concern of the solution is that it should not stop the entire process until a human performs a manual verification. The process should escalate the check to the respective party, while at the same time continuing to evaluate and process the rest of the documents. 2.   Classification Based Approach There are scenarios where data extraction is not essential, and the priority is only to segregate the documents based only on classification, for further processing later in another process. In such cases, the UiPath DU solution comes in very handy as it provides the capability to classify documents based on keywords. The solution offers the ability to train the classifiers intelligently when setting up the automation solution. These classifiers will also continue to learn every time a document is classified (and verified by a human) thus improving accuracy over time. The classification and verification process steps are suitable for attended automation. The attended automation provides a Classification Station, where a user can verify and correct the classification if the confidence is below a predefined value. A schematic of the process is shown in the below chart. Classification Process In most business scenarios, classification is not the only requirement. Most processes will also require the extraction of data from the documents and processing of the extracted data according to specified business requirements. However, even in this case, we cannot ignore the classification process step in the automated approach, as it is essential to identify the type of document so that the robot knows how and what fields to extract. Different methods are available to handle manual verification of classification results: Use attended and unattended collaboration in scenarios where the process should be manually triggered. If the same user who triggered the process is doing the validation, the use of a Validation Station is possible. However, based on the business logic, if certain exceptional cases need management approval, such escalations can be directed to the Action Centre directly without showing to the user at the Validation Station. When designing a Document Understanding solution, it is a good practice to break the solution into separate manageable sub-processes. As a generic solution that fits for most cases, we could introduce three sub-processes to handle the Document Understanding Framework. This high-level diagram showcases a sample architecture for the Document Understanding process. The architecture used here breaks the entire document understanding process into three main sub-processes. The three main components are Initiator process along with processing logic (Process 1), UiPath Action Centre for task assignment and management (Process 2), and finally, the Train models component (Process 3) which handles the training of the intelligent classifiers and also the passing of the extracted data to other applications. The detailed architecture of each part is as follows. Process-1: The initiator Process The Initiator process is the primary process that handles document classification, data extraction, and verification logic. The verification logic will include the rules that define how to handle verifications automatically, either through the use of the Validation Station, Action Centre, or both when human intervention is needed. Depending on the option chosen in the validation logic, the extracted data will finally be passed to either the Action Centre Processor or to Post Processing to continue to the next steps. The diagram below shows a sample architecture for the Initiator Process. Process 1: Initiator Process Process 2: Action Center The Action Centre is the process that handles task creation, waits for task completion, and finally passes the data to the Post Processing portion for the end of Document Understanding. The diagram below shows a sample architecture for the Action Center process. Process 2: Action Center Process Architecture Process 3: Post Processing The post-processing includes the steps needed for exporting the final verified data, training the models, and finally, passing the data to a different process outside the document understanding framework to continue with any system interactions, etc. The data is handed over to a separate process because such steps are not part of document understanding and those should be maintained independently to maintain integrity and reusability of each component/ process. The diagram below shows a sample architecture design for the final stage of the document understanding framework, and it also showcases how Process 2 connects with Process 3. Process 3: Post Processing Solution Architecture Conclusion Although the specifics of the business process may change from one company to another, the core architecture and process steps showcased above remain largely the same

Imaan Ali November 24, 2020 No Comments

Is Your Business In Need of Process Mining?

5 Signs that Your Business Process is too Complicated and In Need of Process Mining As more automation opportunities arise, it is becoming increasingly imperative for businesses to use process mining to examine their business processes closely and determine those most ready to yield significant benefits from automation (and not solely focus on the latest technology stack!). Getting a more complete view of reality of a company’s processes and understand precisely what is happening there in real time and at the individual unit of output, these things can enable a company to move closer and much quicker to the goal of maximizing ROI through automation. Process mining has always been a mainstay of process automation, but the surprising part – it is often overlooked or not given the priority it deserves by clients in their rush to RPA and Intelligent Automation. The underlying assumption is that processes in the company are running in the manner BPM has mapped them out, without exceptions. But in reality that is often not so, there are many alternatives and exceptional paths that business in a company gets executed, and it is the goal of process mining to find them all out and present them to the company’s process engineers for inspection and further corrective action. The good part, process mining is a leading business process management technique that can be used relatively easily and quickly by an organization without any disruption to the underlying business processes. And the even better part, it can be used by companies of all sizes, it is not exclusively a large company technology. Once implemented it allows for ease in process automation and provides much better results than any manual mining operation. The purpose of this short blog is to lay out the 5 most important factors that indicate your company really needs to look into process mining as the way to better understand your business processes. If your response to any one or two of these 5 key process characteristics is in the negative, our recommendation is your look into process mining.  In our view, it is very likely your business process will benefit big time from process mining, most companies we are aware of do!  It provides real-time, actual, quantitative guideposts of your processes – that is in itself is a big advantage as all companies are heading to complete digital transformation. Moreover, it will allow you to analyze which specific processes are most in need of automation, and if using process mining can improve the company’s processes substantially and the order with which to begin! Here are some critical signs to look for in a process that suggest your process likely will be in need of process mining, and the benefits it provides: 1. Significant Data Analysis in Your Process By combining Robotic Process Automation (RPA) with Process Mining, businesses are now able to configure massive amounts of data and highlight the most valuable data and trends in order to make it readable to employees. So, keep in mind, process mining not only helps to identify and eliminate inefficiencies but it also streamlines process data collection. If you are getting overloaded with process data streams, and you’re able to only look at a few, process mining can help a great deal in this regard. Below is a data sample from SAP Process Mining by Celonis. Not only does their process mining tool streamline the data but also enables the company to customize their own categories for data sorting. 2. Not Truly Understanding your Process Not having a full understanding of business processes, and all the ways business things actually get done, can a big problem especially in the face of imperatives such as 100% process automation and digital transformation. This lack of process knowledge can come from many different factors: in some cases, processes are just too complex to follow so employees generalize them and hope for the best. Over time, this can lead to discrepancies in how each worker understands the processes and how they actually work, since essentially no one is fully aware of the process as a whole, rather everyone only knows it from their specific view of the surface. If it is hard to follow what is actually being going on in your processes, and in particular you can’t get a hold of all the exceptional process paths that works get done on, that is a surefire sign that process mining is likely needed and will be of great help. Take a look at the Purchase-Order example below and you will see how crazy and complex processes can be. More often than not, business processes tend to be just as detailed as the one below, if not more. Without a process mining tool, businesses can never truly wrap their head around the ins and outs of their process, causing them to miss the small deviations that go on to cause greater issues later on.          3. Complex HR work      The HR department tends to contain a plethora of repetitive tasks that can frustrate humans. All the manual labor that goes into inputting information into spreadsheets and logs is not only inefficient but is not making the best use of the people in HR whose jobs pertain to communication with others. Process mining (especially the element of it called task mining) can allow for these employees to be doing the work they were meant to do while their tedious processes become essentially autonomous. 4. Unidentifiable Problems or Gaps in Your End-to-end Process If businesses do not have a clear picture of their processes end-to-end as they actually exists, they will find it difficult even to notice or pinpoint any serious issues, let alone find solutions for them. Not understanding the process end-to-end can mean that there are gaps needing to be understood and filled in. Within these gaps are where the most bottlenecks are likely to arise since there is a lack of accountability for what the process flow is. Process Mining

©2026 Innovatix Technology Partners, a Macrosoft, Inc. Company. All Rights Reserved.