Askaiser.Marionette 1.1.0

There is a newer version of this package available.
See the version list below for details.
dotnet add package Askaiser.Marionette --version 1.1.0                
NuGet\Install-Package Askaiser.Marionette -Version 1.1.0                
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="Askaiser.Marionette" Version="1.1.0" />                
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Askaiser.Marionette --version 1.1.0                
#r "nuget: Askaiser.Marionette, 1.1.0"                
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install Askaiser.Marionette as a Cake Addin
#addin nuget:?package=Askaiser.Marionette&version=1.1.0

// Install Askaiser.Marionette as a Cake Tool
#tool nuget:?package=Askaiser.Marionette&version=1.1.0                

Askaiser.Marionette

Askaiser.Marionette is a test automation framework based on image and text recognition. It includes a C# source generator that allows you to quickly interact with properties generated by images from your project or elsewhere. The framework is built on top of OpenCV and Tesseract OCR and only works on Windows for now.

nuget build coverage

Why use Askaiser.Marionette

  • Unlike other test automation frameworks, Askaiser.Marionette does not rely on hardcoded identifiers, CSS or XPath selectors. It uses image and text recognition to ensure that you interact with elements that are actually visible on the screen.
  • Maintaining identifiers, CSS and XPath selectors over time can be hard. Capturing small screenshots and finding text with an OCR is not.
  • With the built-in C# source generator, you can start writing the test code right away.
  • You can interact with the whole operating system, instead of a single application.
  • This means you can also test desktop applications!
  • It works well with BDD and SpecFlow.
  • You can use it to create a bot.

Askaiser.Marionette in action

  • 00:00 : Capture screenshots of the app you're testing,
  • 00:08 : Rename and organize your screenshots in a meaningful way,
  • 00:22 : Drop your screenshots in your project,
  • 00:30 : Use ImageLibrary to automatically generate properties from your screenshots,
  • 01:06 : Use MarionetteDriver to interact with the generated properties (or even text recognized by the OCR)!

https://user-images.githubusercontent.com/14242083/126416123-aebd0fce-825f-4ece-90e9-762503dc4cab.mp4

Getting started

dotnet add package Askaiser.Marionette

It supports .NET Standard 2.0, .NET Standard 2.1 an .NET 6, but only on Windows for now.

using (var driver = MarionetteDriver.Create(/* optional DriverOptions */))
{
    // in this exemple, we enter a username and password in a login page
    await driver.WaitFor(library.Pages.Login.Title, waitFor: TimeSpan.FromSeconds(5));

    await driver.SingleClick(library.Pages.Login.Email);
    await driver.TypeText("much@automated.foo", sleepAfter: TimeSpan.FromSeconds(0.5));
    await driver.SingleClick(library.Pages.Login.Password);
    await driver.TypeText("V3ry5ecre7!", sleepAfter: TimeSpan.FromSeconds(0.5));

    await driver.SingleClick(library.Pages.Login.Submit);
    
    // insert more magic here
}

The sample project will show you the basics of using this library.

Creating image and text elements manually

// Instead of relying on the source generator that works with image files, you can create an ImageElement manually
var bytes = await File.ReadAllBytesAsync("path/to/your/image.png");
var image = new ImageElement(name: "sidebar-close-button", content: bytes, threshold: 0.95m, grayscale: false);
  • ImageElement.Threshold is a floating number between 0 and 1. It defines the accuracy of the image search process. 0.95 is the default value.
  • ImageElement.Grayscale defines whether or not the engine will apply grayscaling preprocessing. Image search is faster with grayscaling.
Although many methods accept a simple string as an element, you can manually create a TextElement
var text = new TextElement("Save changes", options: TextOptions.BlackAndWhite | TextOptions.Negative);

Text options are flags that define the preprocessing behavior of your monitor's screenshots before executing the OCR.

  • TextOptions.None : do not use preprocessing,
  • TextOptions.Grayscale : Use grayscaling,
  • TextOptions.BlackAndWhite : Use grayscaling and binarization (this is the default value),
  • TextOptions.Negative : Use negative preprocessing, very helpful with white text on dark background.

Source generator behavior

TODO: I will explain how to define settings for each individual image (threshold and grayscaling), as well as grouping images into an array property. All of this can be done by using special keywords in each image file name.

Show me the APIs

Many parameters are optional. Most methods that look for an element (image or text) expect to find only one occurrence of this element. ElementNotFoundException and MultipleElementFoundException can be thrown.

You can use DriverOptions.FailureScreenshotPath to automatically save screenshots when these exceptions occur.

Configuration and utilities

static Create()
static Create(DriverOptions options)

GetScreenshot()
GetCurrentMonitor()
GetMonitors()
SetCurrentMonitor(int monitorIndex)
SetCurrentMonitor(MonitorDescription monitor)
SetMouseSpeed(MouseSpeed speed)
Sleep(int millisecondsDelay)
Sleep(TimeSpan delay)

Basic methods

WaitFor(IElement element, TimeSpan? waitFor, Rectangle searchRect)
WaitForAll(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
WaitForAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
SingleClick(int x, int y)
DoubleClick(int x, int y)
TripleClick(int x, int y)
RightClick(int x, int y)
MoveTo(int x, int y)
DragFrom(int x, int y)
DropTo(int x, int y)
TypeText(string text, TimeSpan? sleepAfter)
KeyPress(VirtualKeyCode[] keyCodes)
KeyDown(VirtualKeyCode[] keyCodes)
KeyUp(VirtualKeyCode[] keyCodes)
ScrollDown(int scrollTicks)
ScrollUp(int scrollTicks)
ScrollDownUntilVisible(IElement element, TimeSpan totalDuration, int scrollTicks, Rectangle searchRect)
ScrollUpUntilVisible(IElement element, TimeSpan totalDuration, int scrollTicks, Rectangle searchRect)

Mouse interaction with an element

MoveTo(IElement element, TimeSpan? waitFor, Rectangle searchRect)
SingleClick(IElement element, TimeSpan? waitFor, Rectangle searchRect)
DoubleClick(IElement element, TimeSpan? waitFor, Rectangle searchRect)
TripleClick(IElement element, TimeSpan? waitFor, Rectangle searchRect)
RightClick(IElement element, TimeSpan? waitFor, Rectangle searchRect)
DragFrom(IElement element, TimeSpan? waitFor, Rectangle searchRect)
DropTo(IElement element, TimeSpan? waitFor, Rectangle searchRect)

Check for element visibility

IsVisible(IElement element, TimeSpan? waitFor, Rectangle searchRect)
IsAnyVisible(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
AreAllVisible(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)

Mouse interaction with the first available element of a collection

MoveToAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
SingleClickAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
DoubleClickAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
TripleClickAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
RightClickAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
DragFromAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)
DropToAny(IEnumerable<IElement> elements, TimeSpan? waitFor, Rectangle searchRect)

Text-based actions

WaitFor(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
MoveTo(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
SingleClick(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
DoubleClick(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
TripleClick(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
RightClick(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
DragFrom(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
DropTo(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
IsVisible(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)

Mouse interaction with points

MoveTo(Point coordinates)
SingleClick(Point coordinates)
DoubleClick(Point coordinates)
TripleClick(Point coordinates)
RightClick(Point coordinates)
DragFrom(Point coordinates)
DropTo(Point coordinates)

Mouse interaction with WaitFor search result

MoveTo(SearchResult searchResult)
SingleClick(SearchResult searchResult)
DoubleClick(SearchResult searchResult)
TripleClick(SearchResult searchResult)
RightClick(SearchResult searchResult)
DragFrom(SearchResult searchResult)
DropTo(SearchResult searchResult)

Key press with single key code

KeyPress(VirtualKeyCode keyCode, TimeSpan? sleepAfter)
KeyDown(VirtualKeyCode keyCode, TimeSpan? sleepAfter)
KeyUp(VirtualKeyCode keyCode, TimeSpan? sleepAfter)

System.Drawing.Image-based actions

WaitFor(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
MoveTo(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
SingleClick(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
DoubleClick(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
TripleClick(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
RightClick(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
DragFrom(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
DropTo(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
IsVisible(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)

Finding elements locations without throwing not found exceptions or multiple element found exceptions

FindLocations(IElement element, TimeSpan? waitFor, Rectangle searchRect)
FindLocations(string text, TimeSpan? waitFor, Rectangle searchRect, TextOptions textOptions)
FindLocations(Image image, TimeSpan? waitFor, Rectangle searchRect, decimal threshold, bool grayscale)
Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 is compatible.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed.  net9.0 was computed.  net9.0-android was computed.  net9.0-browser was computed.  net9.0-ios was computed.  net9.0-maccatalyst was computed.  net9.0-macos was computed.  net9.0-tvos was computed.  net9.0-windows was computed. 
.NET Core netcoreapp2.0 was computed.  netcoreapp2.1 was computed.  netcoreapp2.2 was computed.  netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard2.0 is compatible.  netstandard2.1 is compatible. 
.NET Framework net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen40 was computed.  tizen60 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
1.5.1-preview.1 156 1/3/2023
1.5.0 510 1/2/2023
1.4.0 302 12/31/2022
1.3.0 535 7/3/2022
1.2.0 454 7/1/2022
1.1.0 443 6/27/2022
1.0.0 422 8/23/2021
0.1.4 435 8/1/2021
0.1.3 479 7/31/2021
0.1.2 359 7/24/2021
0.1.1 379 7/21/2021
0.1.0 347 7/19/2021