Certa's Tech Blog

Graph Databases: To Use or not to use?

Sachin Harpalani — Tue, 16 Apr 2024 05:33:50 GMT

Introduction

In today's interconnected world, data isn't just about rows and columnsit's about relationships. From social networks connecting friends and family to recommendation engines suggesting our next favorite movie, understanding these intricate connections is essential for unlocking the full potential of our data.

Enter graph databasesthe specialized database designed to do just that.

In this blog post, we'll delve into the world of graph databases, exploring their unique features, typical use cases, and alternatives, to answer the question: To use or not to use?

Understanding Graph Databases

If this is you, dont worry, weve got you covered. Lets dive into understanding the basics.

Fundamentals

A graph database is a specialized, single-purpose platform used to create and manipulate data of an associative and contextual nature. The graph itself contains nodes, edges, and properties that come together to allow users to represent and store data in a way that relational databases arent equipped to do.

The main concept of a graph database system is a relationship. Relationships are defined as first-class citizens this means everything you can do with all other elements can be done with a relationship. Data is related together in a graph to store a collection of nodes and edges, where the edges represent the relationship between nodes.

Relationships allow data within the system to be linked together directly. Querying relationships in a graph database is fast since theyre stored in a way that doesnt change. You may also visualize them, which makes them great for deriving insights for heavily interconnected data.

In this example: User are nodes that have properties name, surname and FOLLOWS is the relationship between the nodes which can have properties (like since ) of its own.

Now that we are done with the basics, lets move on to the next segment

Use Cases of Graph Databases

1. Social Networks: Graph databases are ideal for modeling social networks, where users, friendships, likes, and interactions form a complex web of relationships. By representing users as nodes and connections between them as edges, graph databases facilitate efficient querying for friend recommendations, community detection, and content personalization.

2. Recommendation Engines: Graph databases power recommendation engines by analyzing user behavior, preferences, and item similarities to generate personalized recommendations. By modeling users, items, and their interactions as nodes and edges, graph databases enable efficient recommendation algorithms that can scale to millions of users and items.

3. Fraud Detection: Graph databases play a crucial role in fraud detection by identifying patterns of suspicious behavior within networks of transactions, users, and entities. By analyzing the flow of money and connections between accounts, graph databases can uncover fraudulent activities such as money laundering, identity theft, and insider fraud.

4. Network Analysis: Graph databases are widely used in network analysis applications, including transportation networks, telecommunications networks, and biological networks. By representing nodes as network elements and connections as edges, graph databases enable advanced analytics such as route optimization, network visualization, and gene pathway analysis.

5. Features involving Hierarchical data: In addition to the standard use cases mentioned earlier, graph databases offer flexibility for implementing custom business logic involving hierarchical relationships. Whether managing organizational charts, product categorizations, or any other nested data models, graph databases excel at representing and querying hierarchical relationships efficiently. This versatility allows businesses to implement bespoke solutions tailored to their unique requirements, empowering them to derive insights and make informed decisions based on the underlying data relationships.

Commonly used graph query languages

As graph databases gain traction, the number of graph query languages has exploded. This abundance of options can be quite daunting for developers. Navigating this ever-growing landscape can be challenging.

Lets narrow those and take a quick look at some of the most commonly used graph query languages out there:

Cypher

Cypher is an open-source declarative query language developed by Neo4j for querying graph databases. It is part of the openCypher project, an open standard that aims to make Cypher available for use in various graph database systems, which has led to it being one of the most widely adopted query languages. Queries in Cypher are typically straightforward, beginning with the specification of a pattern to match and then permitting further refinements through filtering, aggregation, and other operations.

Gremlin

Developed as part of the Apache TinkerPop graph computing framework, Gremlin is notable for its language agnosticism, meaning it is not bound to a particular graph database system. Instead, Gremlin is compatible with various graph databases, making it a suitable choice for developers who require flexibility and want to work with multiple data sources.

GraphQL

GraphQL is a query language developed internally by Facebook in 2012 before being publicly released in 2015. It has gained significant popularity as an alternative to REST APIs. Unlike the other graph query languages discussed, GraphQL was designed for clients to query data from APIs and servers, not traditional graph databases. However, it shares similarities in its graph-structured queries.

Apart from these, there are more vendor-specific graph languages like AQL(ArrangoDB), GSQL(TigerGraph), nGQL(Nebula Graph) and many more.

Alternatives to Graph Databases

Even though graph databases excel at managing interconnected data, their implementation and maintenance can be a time-consuming and expensive endeavor. However, they're not the only game in town.

Several alternative approaches exist, each with its strengths and weaknesses. Let's explore some of these alternatives and how they stack up against graph databases.

1. Recursive Common Table Expressions (CTEs) in Relational Databases: Relational databases, such as PostgreSQL and SQL Server, offer support for recursive common table expressions (CTEs). This feature allows for recursive queries that can traverse hierarchical or graph-like structures within relational data models. By recursively joining tables with themselves, developers can perform graph traversals and pathfinding operations directly within SQL queries.

2. Caching Solutions like Redis or Memcached: Another approach to handling graph-like data involves leveraging caching solutions like Redis or Memcached. These in-memory key-value stores excel at storing and retrieving data with low latency, making them ideal for implementing algorithms like breadth-first search (BFS) on graph-like data structures. By caching intermediate results, developers can improve the performance of graph traversal operations, especially in scenarios with high read/write ratios or frequently changing data.

3. Apache Age: Apache Age is a distributed graph database built on top of PostgreSQL. It leverages the capabilities of PostgreSQL as a relational database while providing native support for graph data structures and operations. With Apache Age, users can store and query graph data using familiar SQL syntax, making it a compelling alternative for organizations already invested in PostgreSQL infrastructure.

4. pgRouting: pgRouting is an extension for PostgreSQL that adds routing functionality to the database, enabling users to perform complex routing and pathfinding operations on spatial data. While not a full-fledged graph database, pgRouting can be used to solve graph-related problems such as finding the shortest path between two points in a network. It offers a range of routing algorithms and functions, making it a valuable tool for applications requiring geospatial analysis and routing.

Trade-offs and Considerations

If youre in this same dilemma, here are several factors to consider:

Performance: Graph databases are optimized for querying and traversing graph-like data structures, offering efficient graph algorithms and indexing mechanisms out of the box. In contrast, recursive CTEs in relational databases may suffer from performance limitations as the size of the dataset grows, while caching solutions may introduce overhead due to cache invalidation and synchronization.
Scalability: Graph databases are designed to scale horizontally to handle large and highly interconnected datasets. However, they may require specialized infrastructure and tuning to achieve optimal performance at scale. On the other hand, relational databases and caching solutions can also scale horizontally, but may face limitations in handling complex graph traversals efficiently.
Ease of Use: Graph databases typically offer high-level query languages and intuitive data modeling tools tailored specifically for graph data. This makes them easy to use for developers familiar with graph concepts. In contrast, leveraging recursive CTEs or caching solutions may require more advanced SQL skills or custom code to implement and maintain.
Pricing: Graph databases often come with pricing models that consider factors such as the volume of data stored, the number of transactions processed, and the level of support provided. While some graph databases offer community editions or open-source options with no upfront costs, others may require subscription-based licensing or usage-based pricing models. In contrast, relational databases and caching solutions may have different pricing structures, such as per-instance fees or pay-as-you-go pricing for cloud-based deployments. It's essential to consider the total cost of ownership, including licensing, infrastructure, and ongoing maintenance, when evaluating the pricing of graph databases and alternatives.
Documentation: While graph database vendors strive to provide comprehensive documentation, the quality and coverage may vary depending on the specific vendor. In contrast, alternatives such as relational databases often offer consistently well-established documentation that covers a wide range of use cases. The extensive documentation, backed by a large community, makes it easy for users to find resources and support for their projects.
Community Support: While graph databases have an active & growing community, they may not be as extensive as those surrounding alternatives like relational databases. Relational databases have been around for more years and are more mature, resulting in a larger and more established user base. This broad user base provides robust support, forums, and resources for users, ensuring a wealth of knowledge and assistance is readily available.

Conclusion

In the ever-evolving landscape of data management, the choice between graph databases and alternative approaches is not always straightforward. Each option offers its own set of advantages, trade-offs, and pricing considerations, making it essential to carefully evaluate your specific requirements and constraints before making a decision.

Graph databases shine in scenarios where complex relationships and graph-like data structures are prevalent, offering intuitive query languages, efficient graph traversal algorithms, and scalable infrastructure. However, it's important to note that they may come with significant upfront costs, making them a substantial investment for organizations. Additionally, utilizing graph databases effectively often requires specialized expertise, further adding to the overall cost of implementation and maintenance.

On the other hand, alternatives like recursive CTEs in relational databases and caching solutions like Redis or Memcached provide viable options for certain use cases, offering familiar SQL-based querying capabilities and low-latency data access. However, they may face performance limitations or scalability challenges when dealing with highly interconnected datasets.

When making your decision, consider factors such as performance, scalability, ease of use, and pricing, weighing the trade-offs against your specific application requirements. Whether you opt for the flexibility of graph databases, the familiarity of relational databases, or the speed of caching solutions, remember that the ultimate goal is to empower your organization with the right tools to derive insights and drive innovation from your data.

In the end, the best approach is one that aligns closely with your business objectives and enables you to extract maximum value from your data assets. So, choose wisely, and embark on your data journey with confidence, knowing that you've selected the optimal solution for your needs.

References

https://www.dylanpaulus.com/posts/postgres-is-a-graph-database/

https://www.linkedin.com/pulse/you-dont-need-graph-database-modeling-graphs-trees-viktor-qvarfordt-efzof

https://www.datacamp.com/blog/what-is-a-graph-database

https://linkurious.com/graph-query-languages/

Using SVG Icons with Pixi.JS

Abir Pal — Wed, 29 Nov 2023 13:47:21 GMT

📚 Pre-requisites

1. Basic understanding of PixiJS

PixiJS is an open-source, web-based rendering system that provides blazing-fast performance for graphics-intensive projects.

This blog assumes the user has set up a basic Pixi.js application. If not, it's highly recommended to go through the official Pixi.js tutorials

2. SVG

SVG is an XML-based vector image format for defining two-dimensional graphics, having support for interactivity and animation.

Basic understanding of and element
The element is used to define graphical template objects which can be instantiated by a element. Refer to MDN for reference.

3. How to use spritesheet

In layman's terms, spritesheet is a collection of small images, like icons in this case. Instead of keeping each picture separately, you tile them on a single large document. To view an image from this spritesheet, you just need the position of the image.

The Challenge

If you directly use SVG icons within Pixi JS, they get "Rasterized" (meaning they lose their vector data). Which means if you scale them, they get blurred out. To avoid this, we can:

Either convert them into native Pixi Objects using something like pixi-svg library.
- The problem with this approach is that these conversions don't translate 1:1 to original SVG due to lack of certain features. So this option can be used if your SVG is not using any of the unsupported features listed by the library.
Or we can set the SVG scale beforehand so we can rasterize them at higher resolution, before they get converted to textures.

For the scope of this blog post, we will be implementing the solution based on the second approach, while making sure that we have:

Efficient Icon Rendering Performance - Icons are loaded quickly and rendered without any hassle.
Smoother Icon Management - Adding new icons should not require significant extra effort

🔖 The Action Plan

Let's get started, with our action plan.

NodeJS Script to generate Spritesheet
1. The first step is to create a spritesheet from available SVG icons in our workspace. The advantage of this tooling is rich DX and a better way to manage SVG icons.
2. Assumption is that we already have a set of SVG icons with us
3. The script will read these icons and compile them into a single spritesheet.json file, which we'll use within our code.
4. The script itself will be responsible for:
  1. loading SVG icons as strings
  2. Wrapping around them so they become "reusable".
  3. Finally, using them by and laying them horizontally. (with the assumption that every icon is of the same size and aspect ratio)
  4. Following will be format for spritesheet.json
```
   // Spritesheet.json   {       "spriteSheetSVGData": "..." // Entire SVG as string       "iconSequence":[] // sequence of icons in the spritesheet.   }
```
Using SVGs in Pixi.JS
1. Now coming to our main logic, we will use the generated spritesheet JSON to:
2. Create texture from the spritesheet
3. Create a function to crop the spritesheet based on the provided iconName and return the resultant sprite.
4. Render the icon in a PixiJS Stage.

🚀 Code Walkthroughs

🧑🏽💻 Spritesheet generation

spritesheet.json consists of the following properties
- spritesheetSVGData: stringified SVG Data of the spritesheet.
- iconSequence : An ordered array of distinct svg-ids used in the spritesheet. This is a critical attribute, whose index will be used to fetch an Icon, which is going to be used in Pixi.js app.

      /**       * Note:       * The script runs in node-js environment.       */      import path from "path";      import fs from "fs-extra";      /**       * Converts string to PascalCase       */      const toPascalCase = (str) => {        return `${str}`          .replace(/[-_]+/g, " ")          .replace(/[^\w\s]/g, "")          .replace(            /\s+(.)(\w*)/g,            ($1, $2, $3) => `${$2.toUpperCase() + $3.toLowerCase()}`,          )          .replace(/\w/, (s) => s.toUpperCase());      };      // Directory where spritesheet.json will be generated      const spriteSheetDirectoryPath = "";      // spritesheet object, to be loaded in JSON.      const spriteSheet = {        spriteSheetSVGData: "",        iconSequence: [],      };      // Utility function to replace SVG with       const replaceSvgWithSymbol = (svg, id) => {        // Replace         let modifiedSvg = svg.replace(/]*)>/, `${id}">`);        // Replace  closing tag with  closing tag        modifiedSvg = modifiedSvg.replace("", "");        // Remove xmlns property from         modifiedSvg = modifiedSvg.replace(' xmlns="http://www.w3.org/2000/svg"', "");        return modifiedSvg;      };      export const createSpriteSheet = (svgList) => {        /*            Used to store (s).            Every root  represents a svg icon.           For example:           ...        */        let symbols = "";        // conversion of SVG to , and        // updating IconSequence in spritesheet        svgList.forEach((svg) => {          const svgCode = svg.data;          const iconName = toPascalCase(svg.name);          const symbolCode = replaceSvgWithSymbol(svgCode, iconName);          symbols += symbolCode;          spriteSheet.iconSequence.push(iconName);        });        /*            Create SpriteSheet SVG Data           Our icons are of size: 16 X 16           A linear spritesheet having a           height: 16 units (based on icon height).           width: 16 * total icons           gap between icons: 8px          The data part of spritesheet svg          We use entity reference method, to put images in the spritesheet.                    Finally write the spriteSheet to the respective JSON file        */        spriteSheet.spriteSheetSVGData = `16 * svgList.length}" viewbox="0 0 ${          16 * svgList.length        } 16">${symbols}${spriteSheet.iconSequence          .map((symbolID, index) => {            return `${symbolID}" x="${index * 24}" y="0"/>`;          })          .join(" ")}`;        // Create Spritesheet.json        fs.writeFileSync(          path.resolve(spriteSheetDirectoryPath, "spritesheet.json"),          JSON.stringify(spriteSheet),        );        // Create Spritesheet.svg        fs.writeFileSync(          path.resolve(spriteSheetDirectoryPath, "spritesheet.svg"),          JSON.stringify(spriteSheet.spriteSheetSVGData),        );      };

Using SVGs in Pixi.JS

Assumptions
- spritesheet.svg is generated from the above script.
- iconSequence is extracted from the spriteSheet.json
Example App Structure
1. We will be using Vanilla Javascript as an example for simplicity and easier for understanding
```
    app/      index.html      js/        app.js      static/       spritesheet.json
```
As we are using vanilla javascript. We will be using the native DOM to load the Pixi.JS. Henceforth, here is what index.html looks like.
- Note: The script, app.js is loaded as a module. This is intentionally done to prevent scope ambiguity issues with global context. Henceforth, we have to explicitly mention what variables we want to add to the global context. For example: globalThis.__PIXI_APP__ = app;

        html>    <html lang="en">      <head>        <meta charset="UTF-8" />        <meta name="viewport" content="width=device-width, initial-scale=1.0" />        <meta http-equiv="X-UA-Compatible" content="ie=edge" />        <title>PixiJS and Icons Integrationtitle>      head>      <body>        <script src="https://cdn.jsdelivr.net/npm/pixi.js@6.x/dist/browser/pixi.min.js">script>        <script type="module" src="js/app.js">script>      body>    html>

app.js performs the following operations.

Initialize PixiJS
Load SVG Resource for spritesheet
Create texture from the SVG resource.
A function that takes input as icon name, and returns the relevant icon sprite.
Create a sprite from the texture.
Crop the sprite generated, based on the required index found in iconSequence

If no such icon exists, return undefined.

 // App.js // File where our svg is stored. const SVG_URL = "/images/spriteSheet.svg"; const ICON_SEQUENCE = ["House", "Tasks", "Chart"]; // Initializing the PIXIJS let app = new PIXI.Application({   width: 400,   height: 300,   resolution: window.devicePixelRatio,   antialias: true,   backgroundColor: 0xeeeeee,   autoDensity: true }); globalThis.__PIXI_APP__ = app; document.body.appendChild(app.view); // Created SVG Resource from the SVG const svgRes = new PIXI.SVGResource(SVG_URL, { scale: 3 }); // Created Texture from the SVG Resource const texture = PIXI.Texture.from(svgRes); /* Returns the relevant Icon Sprite  * @param iconName: string  * @param options: {x:number, y:number, height:number, width:number}  */ function getIcon(iconName, options) {   const sprite = new PIXI.Sprite(texture);   const iconIndex = ICON_SEQUENCE.findIndex(                     (iName) => iName === iconName);   if (iconIndex < 0) {     console.error("No icon found with name:", iconName);     return undefined;   }   texture.on("update", () => {     if (texture.valid) {       texture.frame = new PIXI.Rectangle(70 * iconIndex, 0, 50, 48);       sprite.width = options?.width || 16;       sprite.height = options?.height || 16;       sprite.x = options?.x || 175;       sprite.y = options?.y || 125;     }   });   return sprite; } const iconSprite = getIcon("Tasks", { height: 30, width: 30, x: 25, y: 25 }); app.stage.addChild(iconSprite);

👨🏼💻 CodeSandbox

Here's a live example of our above implementation.

https://codesandbox.io/s/icons-integration-with-pixi-js-t55fdn?file=/js/app.js

Conclusion

So let's look at the aspects we discussed before starting the blog.

Icon Rendering Performance
- Now, we do not calculate textures for every icon. Instead, we calculate it once for the spritesheet, and then we crop the spritesheet based on the icon we need.
Icon Management
- Spritesheets can be now easily generated and updated with new icons from our new script.
- And every generated spritesheet, can be versioned via git.
- In the end, the whole spritesheet is just a big SVG file, and we need not worry about having multiple files for multiple SVG icons, providing a smoother icon management experience, without any overhead of fetching individual resources.

I hope you found the blog helpful and got to learn something new today. Plus give a read to our detailed blog on building a layout engine in Pixi.JS here

We are hiring! 🎉

Solving challenging problems at scale in a fully remote team interests you, head to our careers page and apply for the position of your liking!

Third-Party Cookie Restrictions for Iframes in Safari

Malav Shah — Tue, 17 Oct 2023 07:12:13 GMT

What do you need to know before?

Introduction

Have you ever wondered why some websites ask for storage permissions or why some features don't work in certain browsers? Let's dive into the intricacies of third-party cookie restrictions in Safari and how we tackled them!

Safari, Apple's flagship browser, has always been a pioneer in the quest for user privacy and security. In recent years, it has made significant strides in protecting its users from the prying eyes of online trackers by restricting third-party cookies.

In the following discussion, we will delve into a specific challenge encountered within this privacy-focused paradigm. This blog aims to unravel the complexities of seeking storage permission in Safari, particularly when our content is loaded within an iframe and relies on third-party cookies.

The predicament arose as we endeavored to integrate our React-based web app into an iframe hosted on a different domain. This intersection of domains posed a unique challenge, prompting us to explore innovative solutions to seamlessly navigate the intricacies of Safari's third-party cookie restrictions.

💡

WebKit is an open-source web browser engine developed by Apple Inc. It is primarily used as the rendering engine for Apple's Safari web browser. In the next few sections, we have added links to Webkit's website.

Problem

Our application uses cookies for authentication. In one of the use cases, it was being rendered inside an iframe. The parent HTML document that was rendering the iframe was being hosted on an entirely different domain.

In other browsers, our application within the iframe was able to access the cookies but not in Safari. Safari uses Intelligent Tracking Prevention(ITP) to control the access of third-party cookies.

ITP aims to prevent third-party cookies, making them inaccessible in iframes unless certain conditions are met. These conditions can be found in the Webkit's official announcement.

Storage Access APIs

As per ITP

"Third-party cookie access can only be granted through the Storage Access API."

Let's look at parts of this API that concern our problem:

document.hasStorageAccess API Doc
This API is used to check cookie storage access. This will return false for third-party cookies in the case of the Safari browser.
document.requestStorageAccess API Doc
This API is used to ask for third-party storage(cookie) access explicitly from the user.

Both of the above APIs are available in Safari as well as in other browsers.

Webkit's official documentation explains the steps to use these APIs & the rest of the user flow(which is the basis for the following solution). We recommend giving it a read before moving ahead with this post.

Solution

The solution described below is not the only one but will help you in designing solutions for your use cases. You can also design a solution as per your needs by following the guide mentioned here.

We are using react so the above-mentioned solution is written in the concepts of react.

We have created separate utility functions in the helper file.

 export function isSafari(): boolean{   const userAgent = navigator.userAgent.toLowerCase();   return (     userAgent.indexOf("safari") !== -1 && userAgent.indexOf("chrome") === -1   ); }; function supportStorageAccessApi(): boolean {   return "hasStorageAccess" in document && "requestStorageAccess" in document; } export function hasStorageAccess(): Promise<boolean> {   return document.hasStorageAccess(); } export function requestStorageAccess(): Promise<void> {   return document.requestStorageAccess(); } export function requiresStoragePermissions(): boolean {   return isSafari() && supportStorageAccessApi(); }

The above code is to make the browser API abstract from the actual implementation. These functions are what we are going to call.

Then after, we created a new file named useStoragePermissions.tsx and added the below-mentioned code.

 export const useStoragePermissions = (): {   needPermission: boolean;   askForPermission: () => void;   haveCheckedPermission: boolean; } => {   const [needPermission, setNeedPermission] = React.useState(     requiresStoragePermissions() ? true : false   );   const [haveCheckedPermission, setHaveCheckedPermission] =     React.useState(false);   const isHavingPermissionFn = useCallback(async () => {     try {       return await hasStorageAccess();     } catch (e: any) {       // Handle error gracefully and show user some message       return false;     }   }, []);   const checkPermission = useCallback(() => {     isHavingPermissionFn().then((isHavingPerm: boolean) => {       setNeedPermission(!isHavingPerm);       setHaveCheckedPermission(true);     });   }, [isHavingPermissionFn]);   const askForPermission = useCallback(async () => {     try {       await requestStorageAccess();       checkPermission();     } catch (e: any) {       // Handle error gracefully and show user some message     }   }, [checkPermission]);   React.useEffect(() => {     if (requiresStoragePermissions()) {       checkPermission();     }   }, [checkPermission]);   return {     needPermission,     askForPermission: requiresStoragePermissions()       ? askForPermission       : () => {},     haveCheckedPermission   }; };

Using the above hook we have exposed below three states:

needPermission: This will be true when the browser is Safari and it has support for hasStorageAccess and requestStroageAccess
Hint: Use this boolean while consuming this hook to decide when to call askForPermission and haveCheckedPermission .
askForPermission: This is the function that the consumer could call to request the user to give storage access permission
haveCheckedPermission : This is a boolean which will be true after calling askForPermission in the case of needPermission is true initially.

To consume the above hook, we have followed the below-mentioned steps:
1. We have mounted and created a hook in #2 at the initialization part of the app. Use the needPermission state from it and proceed ahead as normal when needPermission is false.
2. We created some other routes user-access-flow that show a button with some text like Set Cookie . And onClick of it, we set the cookie.
  1. This is where we are actually calling authentication-related APIs and then the server is setting up cookies.
  2. Once this cookie is set, close this tab using window.close()
3. Now when, needPermission is true
  1. We show the user some text like, "Click here and click on the Set-Cookie button on the newly opened tab" and on click of it, redirect the user to the route created in above step 2.
  2. Now, when a user comes back from that route, call askForPermission on click of some button. Which should ask the user to give storage permission.
    Ex:
  3. Once the user clicks on Allow the button, your website is authorized to store and access third-party cookies and now you can continue with business logic.
  4. We have also kept this thing in mind that this consent will be revoked if the user cleans up the browser history and does not visit that domain for 7 days. These constraints are already mentioned here

Conclusion

If you have faced such issues while developing or browsing such issues, please share those in the comments. We will be more than happy to read and comment more on those.

We are hiring!

If solving challenging problems at scale in a fully remote team interests you, head to our careers page and apply for the position of your liking!

Simplifying WebGL: Building an Effective Layout Engine

Abhinav Dabral — Wed, 09 Aug 2023 08:30:16 GMT

As front-end engineers, we often don't have to think about everything that the browser is doing behind the scenes to make our lives easier.

You put two div together, they get stacked automatically.
They grow in size as you add content to them.
You can style them, change alignments and whatnot.

The logic that's handling all this is your browser's layout engine (along with a lot of other things).

But when we're using WebGL, everything has to be individually positioned and their dimensions have to be predefined as well. So, if we want to get an HTML-like experience within WebGL, that is, relative positioning between objects; we require a layout engine. A layout engine will help in dynamically allocating and calculating the position and dimensions of relatively positioned objects within WebGL.

What is a Layout Engine?

In this context, we're referring to a piece of logic that is responsible for handling the logistics of where something should be positioned, relative to the position of others.

Even this practical guide is using WebGL as a most common use case but the approach is not specific to WebGL. It can be used in other use cases with similar requirements.

Option 1 - Find something that "just works"

Your best bet is to find something purpose-built for this exact use case. How well it integrates with your specific project, is another thing.

Here are a couple of options:

Yoga layout - https://github.com/facebook/yoga (C++)
This is what React Native uses internally. That's how you can write CSS-like styles and they work nearly identically on iOS and Android.
Stretch layout - https://github.com/vislyhq/stretch (Rust)
Pixi Layout - https://github.com/pixijs/layout (JS)
And a few others (which have not been updated for a while)
- https://github.com/lynaghk/subform-layout (Abandoned, also only minified code is available)
- https://github.com/randrew/layout (C++)

Unfortunately, for our use case (and without getting into details), none of those worked as we expected.

Maybe you're in the same situation as us, or you just want something small and don't want to use these big libraries, either way, I hope you will find something useful in this article.

Option 2 - Learn how a layout engine works

There's no point in putting these details here, especially because Matt Brubeck has done a far better job at highlighting all the important bits that someone building a layout engine should know.

Here's the link - https://limpet.net/mbrubeck/2014/08/08/toy-layout-engine-1.html

While his approach was more towards building an HTML rendering engine, the core design of the layout engine stays more or less the same.

Building a layout engine in Javascript

Please note that this is not a one-size-fits-all solution. This is more of a "proof-of-concept" than anything else. Here, we're merely trying to highlight the process that goes into building a very basic layout engine, so that one can learn from it and build their implementation, for their specific use-case. You can make this as simple as needed or as complicated as needed.

1. Plan

Before we start the implementation, let's just go over what we plan to achieve and how we plan to achieve it.

The process from start to finish goes somewhat like this:

Define - Have all the nodes with styles in them, arranged in a proper hierarchy. This will be the tree structure and the top-mode node will be called the rootNode
Compute - Start computing layouts from the rootNode, and recursively calculate for the entire tree.
Paint - Start from the rootNode and recursively go over all children.

2. Implementation

These are the items we'll be implementing here

Allocator (class)
An Allocator is a singleton, responsible for handling all the operations of adding/removing nodes.
Box (class)
The Box will be a visual element. It'll just use the calculated data from the node to position and shape itself accordingly.
Test environment
To test everything.
Layout calculator (function)
A recursive function that will calculate the layout for the given node (and its children) recursively.

2.1 - Allocator

2.1.1 - Define the shape of Node

Nodes are a logical structure that we will use to represent any element during the entire process. Node doesn't need to know about what kind of element it is representing. An element can be whatever you want it to be - Box, Text, anything. But they all still have to be linked to their own individual "node" which will represent them during the layout process.

A node can contain information like:

Externally supplied style to this element
Layout parameters of the element

For our use case, let's say that the node is defined by the following TypeScript types:

// layoutEngine.tstype NodeLayout = {  id: string; // unique ID  width: number; // computed  height: number; // computed  x: number; // computed  y: number; // computed  style: NodeStyle; // externally supplied styles  children: Array; // in order  parentNode: NodeLayout | null; // because root will have `null`}type NodeStyle = {  width?: number;   height?: number;  layoutMode: LayoutMode}enum LayoutMode {  HORIZONTAL = "horizontal",  VERTICAL = "vertical"}

2.1.2 - Create the Allocator

An Allocator singleton class will be responsible for managing all the operations of adding nodes to other nodes and forming a tree.

// allocator.tsclass Allocator {  public rootNode = createNode(); // Wait what?  // .. other methods will go here}

Oh yes, we also need a function that will help us create a node initializer. Nothing too complicated.

// allocator.tsconst defaultStyle: NodeStyle = {  layoutMode: LayoutMode.VERTICAL,  backgroundColor: 0x000000};/** * Creates and returns a NodeLayout object with  * default values assigned to it. */export const createNode = () => {  const newNode: NodeLayout = {    id: uuid(),    width: 0,    height: 0,    x: 0,    y: 0,    style: defaultStyle,    children: [],    parentNode: null  };  return newNode;};

Now that, we've taken care of that, let's get back to our singleton.

Now we need a way to:

Attach new nodes to existing nodes

  attachChild(child: NodeLayout, parent?: NodeLayout) {    const finalParent = parent || this.rootNode;    finalParent.children.push(child);    child.parentNode = finalParent;  }

Detach nodes from existing nodes

  detachChild(child: NodeLayout) {    child.parentNode.children = child.parentNode.children.filter(      (c) => c.id !== child.id    );    child.parentNode = null;  }

Move nodes between nodes

  moveChild(child: NodeLayout, parent: NodeLayout) {    this.detachChild(child);    this.attachChild(child, parent);  }

and Destroy the nodes, recursively.

  destroy(node: NodeLayout) {    node.children.forEach((child) => this.destroy(child));    this.detachChild(node);  }

That's it. Now the only thing left to do is, actually instantiate it as a singleton and export it.

// allocator.tsexport const defaultAllocator = new Allocator();

2.2 - Box, the visual component

Again, this is just a POC, so we'll go as simple as possible. For any element, in this case, Box, we'll just have a class that will contain an instance of a NodeLayout, and then the rest of the implementation is only around how the box renders and everything else involved with it.

For the sake of our example, we're going with Pixi JS to create this box. But this can be whatever else it needs to be. The important part is how we consume the data of the node to render the box.

// Box.tsimport { createNode } from "./allocator";import { NodeLayout, NodeStyle } from "./layoutEngine";import * as PIXI from "pixi.js";export class Box {  node: NodeLayout = createNode();  container: PIXI.Graphics = new PIXI.Graphics();  constructor(style?: Partial) {    this.node.style = { ...this.node.style, ...style };  }  render() {    this.container.clear();    this.container.beginFill(this.node.style.backgroundColor);    this.container.lineStyle(1, 0x000000);    this.container.drawRect(      this.node.x,      this.node.y,      Number(this.node.width),      Number(this.node.height)    );    this.container.endFill();  }}

2.3 - Sandbox

Just to check if everything is in order so far, we'll just create a sandbox environment to test things out. We'll be using Pixi JS here.

// app.ts or index.tsimport * as PIXI from "pixi.js";import { defaultAllocator } from "./allocator";import { Box } from "./Box";import { calculateLayout, LayoutMode } from "./layoutEngine";function initApp(container: HTMLElement) {  const app = new PIXI.Application({    resizeTo: container  });  container.appendChild((app.view as unknown) as HTMLCanvasElement);  //// Sample code Start ////  // initialize a Box with Red background  const b1 = new Box({    backgroundColor: 0xff0000  });  // we need to add the PIXI component to the main stage  app.stage.addChild(b1.container);  // And let's just assign some values to node  // these values will actually be computed automatically  // when we have calculateLayout in place  b1.node = {    ...b1.node,    x: 20,    y: 20,    width: 100,    height: 100  };  // finally call the render method to pain the box  b1.render();  //// Sample code End ////}const htmlContainer = document.getElementById("app");if (htmlContainer) {  initApp(htmlContainer);}

At this stage, you should get something like this:

Yay, a red rectangle on the screen.

The setup is all complete and only the last piece remains, which is what binds this whole thing together (and the whole point of this article). It was necessary to have everything else in place because layout computation is something that you'd likely want to experiment around and experimentation is fun when you can visually see the change happening as you do it.

2.4 - Layout computation

When we talk about layout engines, it mostly comes down to calculating the x, y, width and height of a node. It sounds simple enough but the key is doing calculations in a particular sequence to get it right.

Layout calculation process and implementation

Computation starts at the given node (rootNode for the very first iteration)

 // layoutEngine.ts function calculateLayout(node: NodeLayout) {   const parent = node.parentNode;   // what now? }

Calculate width, x and y of this node

Let's check if the parent is not there. This means this is likely the root node. so we just define some variables as defaults for the root node.

   ...   if (!parent) {     // this is a root node     node.width = originLayout.width;     node.x = originLayout.x;     node.y = originLayout.y;   }

If a parent is present, then we need to check for whether the parent is laying the children Horizontally or Vertically.

 ... else {     const currentNodeIndex = parent.children.findIndex(       (n) => n.id === node.id     );     // Somethings are dictated by previous sibling,     // we let's have it ready     const previousSibling =       currentNodeIndex > 0         ? parent.children[currentNodeIndex - 1]         : null;     // If the parent is laying out components vertically     if (parent.style.layoutMode === LayoutMode.VERTICAL) {       node.width = parent.width; // deduct padding/margin here       node.x = parent.x; // add padding/margin here       const siblingBottom = previousSibling         ? previousSibling.y + previousSibling.height         : 0;       if (siblingBottom && previousSibling) {         // if sibling starts at 0, ends at 200,         // this starts at 201.         node.y = siblingBottom + 1;          // Also consider adding margins here.       } else {         node.y = parent.y; // add margins here       }     }     // If the parent is layout out components horizontally     else {       const availableWidth = parent.width; // deduct paddings       // we're going lazy here but ideally you can have the       // logic to propotionally divide the available width among       // the children.       node.width = availableWidth / parent.children.length;       node.y = parent.y; // Addings padding/margins here       const siblingLeft = previousSibling         ? previousSibling.x + previousSibling.width         : 0;       if (previousSibling && siblingLeft) {         node.x = siblingLeft + 1; // add margins       } else {         node.x = parent.x; // add padding/margins here       }     }   }

Finally, let's just set the width equal to that from the style, if it is configured
```
   if (node.style.width) {     node.width = node.style.width;   }
```

Start calculating the layout for the children, recursively
(i.e. start from #1 of this process)
```
 node.children.forEach(calculateLayout);
```

Calculate height of this node
(This is done after computing the layout for the children because height is affected by the children)

 // Case 1: When height is already defined   if (node.style.height) {     node.height = node.style.height;   }   // Case 2: If the current node is laying the children vertically   else if (node.style.layoutMode === LayoutMode.VERTICAL) {     // then the total height is just the total height of all     // the children (and their margins/paddings)     /**      * get the maximum height by computing difference between y of      * first node and the y of last node, and add the height of      * the last node.      * Also add the padding of the parent node and margins of      * first and last child      */     if (node.children.length >= 1) {       const firstChild = node.children[0];       const lastChild = node.children[node.children.length - 1];       node.height = lastChild.y + lastChild.height - firstChild.y;     } else {       node.height = 0;     }   }   // Case 3: If the current node is laying children horizontally   else {     // Just find the node that has the largest height     node.height = Math.max(...node.children.map((c) => c.height));   }

Done

2.5 - Let's try it

To test this out, we just need to go back to the sandbox environment and set up a set of Box components.

For this test, we'll do something like this

b1 (default vertical mode)|- b11 (horizontal mode) - Yellow|  |- b111 - Red|  |- b112 - Orange|  |- b113 - Magenta|- b12 (vertical mode) - Aqua|  |- b121 - Green|  |- b122 - Dark green|  |- b123 - Dark blue

To implement this, let's go back to our sandbox, replace all the sample code and implement this as follows:

Add the box b1
```
 const b1 = new Box();
```

Add 2 boxes b11 and b12

  const b11 = new Box({     layoutMode: LayoutMode.HORIZONTAL,     backgroundColor: 0xffff00 // yellow   });   const b12 = new Box({     backgroundColor: 0x00ffff // aqua   });

Add 6 boxes (3 for b11 and 3 for b12)

 // FOR b11   const b111 = new Box({     width: 100,     height: 100,     backgroundColor: 0xff0000 // red   });   const b112 = new Box({     width: 120,     height: 120,     backgroundColor: 0xffaa00 // orange   });   const b113 = new Box({     width: 80,     height: 80,     backgroundColor: 0xcc00ff // magenta   }); // FOR b12   const b121 = new Box({     height: 100,     backgroundColor: 0x00ff00 // green   });   const b122 = new Box({     width: 200,     height: 120,     backgroundColor: 0x00cc00 // dark green   });   const b123 = new Box({     height: 80,     backgroundColor: 0x0000cc // dark blue   });

Put everything within the allocator so that it forms a tree

   // Add b1 to the root   defaultAllocator.attachChild(b1.node, defaultAllocator.rootNode);   // Add b11 and b12 as b1's children   defaultAllocator.attachChild(b11.node, b1.node);   defaultAllocator.attachChild(b12.node, b1.node);   // add 3 children b111, b112, b113 within b11   defaultAllocator.attachChild(b111.node, b11.node);   defaultAllocator.attachChild(b112.node, b11.node);   defaultAllocator.attachChild(b113.node, b11.node);   // add 3 children b121, b122, b123 within b12   defaultAllocator.attachChild(b121.node, b12.node);   defaultAllocator.attachChild(b122.node, b12.node);   defaultAllocator.attachChild(b123.node, b12.node);

Create an array of all the elements and just attach them to the Pixi Stage.
Still, nothing will be rendered at this point.

 const components = [            b1,                       b11,               b12,                 b111, b112, b113,  b121, b122, b123 ]; components.forEach((c) => app.stage.addChild(c.container));

Compute the layout and render the nodes

 calculateLayout(defaultAllocator.rootNode); components.forEach((c) => {   c.render(); });

Done. If everything so far was correctly done, you should see something like this.

Wait ... it's not over

The article was more aimed at keeping things as simple as possible to understand the basics of a layout engine. But there's no need to stop here. This can be taken as an opportunity to challenge yourself and implement more things on top of this implementation, such as:

Alignment (both horizontal and vertical)
Paddings and Margins
Flexible widths of children based on proportions

CodeSandbox

This entire example is also available on CodeSandbox if you'd like to fork it there and experiment with it - https://codesandbox.io/s/layout-engine-demo-3y3hjf

Join us

We're always looking to expand and welcome talented members to our team. And the best part, it's all remote! You can work from wherever you are. Head over to our careers page and apply to any of the available positions that seem right for you.

Build React Forms using JSON

Pawan Kolhe — Mon, 15 May 2023 04:39:46 GMT

Struggling to manage forms in your project? What if you could generate your forms from a JSON schema?

Every web application makes use of forms at some point as they are vital for information gathering. These forms can get very large and complex depending on your use case. Ideally, we want to be able to create forms in the simplest way possible.

🚁 The Problem

The React ecosystem already has some very popular libraries to manage forms such as React Hook Form, React Final Form, Formik, and many more. All these libraries provide validation, conditional logic, form submission handling, and everything you need to create complex forms. Although, all of them require a fair amount of JSX to be written which becomes verbose and repetitive, especially with long forms and multiple sections. If your forms are large, they often need to be divided into multiple React component files for better maintainability. I think we can all agree that doing so will be very tedious and time-consuming.

🚀 The Solution

Wouldn't it be great to abstract away repetitive and verbose JSX code required to render forms? Just think about it, forms do very standard and predictable tasks. Generally, we need the following:

Validation - Is a field required or not, Regex, etc? What error message to display if validation fails?
Layout - Whether to place fields horizontally or vertically. Nesting fields under collapse sections.
Conditions/Dependencies - Whether to hide or show a field/section when a certain condition is true. Deriving field A value from field B.
Form state - Providing the initial state of the form. Perform an action on submitting.
Field metadata - Name, label and description of the field. Component to render (TextField, NumberField, Switch, custom component..).

Creating a form should be as simple as defining the structure and properties of the form in JSON format and passing that to a React component that will render the form with all the complex validation, layout, and conditions for us. Data Driven Forms is a React library that enables us to do just that.

Data Driven Forms, as the name suggests, creates React forms using JSON data. Once we set up the library in our React app, all we need to do is define a JSON definition to create a unique form with its validation, layout, fields, conditions, and structure. It can be thought of as a declarative way to build forms. We don't need to specify how to build the form, but only what the user should see and how it should behave.

Data Driven Forms library internally makes use of React Final Form for managing the state of the form state. Although this dependency is likely to be removed in the next version of the library.

Benefits of Data Driven approach

Build new forms significantly faster
Less source code
More readable and easy to tell what the form is doing
A well-structured and consistent way to create forms

In the future if you choose to adopt a different framework like Svelte, the JSON form definitions can still be used. You would however need to create a component that would be able to render the JSON definition in the new framework as Data Driven Form library only has support for React at the time of writing this blog.

Here is an example of how a JSON definition/schema is structured.

const exampleForm = {    fields: [        {              name: "firstName",              component: "text-field",              label: "First Name",              validate: ...,              condition: ...,        },        ...    ]}

🧑🏼💻 Using Data Driven Forms

Here is a CodeSandbox for you to view the source code and run it yourself:

https://codesandbox.io/embed/data-driven-forms-d6jur?expanddevtools=1&fontsize=14&hidenavigation=1&module=%2Fsrc%2FSchemaForm.jsx&theme=dark&view=editor

Install library

To get started, install the required npm package.

yarn add @data-driven-forms/react-form-renderer

The @data-driven-forms/react-form-renderer package contains the FormRenderer component that is responsible for rendering the form based on the schema it is passed.

We'll also need a component mapper. The @data-driven-forms/ant-component-mapper provides a Component Mapper that maps string literals to Ant Design components. The mapper is passed to FormRenderer as a prop.

yarn add @data-driven-forms/ant-component-mapper antd

You could install one of the other component mapper available too or create your own custom mapper which would allow you to make use of React components already available in your project. At Certa, we have mapped our design system components to build a custom mapper.

FormRenderer

The FormRender is a React component that contains all the logic for rendering a React form according to the schema and the configuration provided to it via props.

// SchemaForm.jsximport FormRenderer from "@data-driven-forms/react-form-renderer/form-renderer";import FormTemplate from "@data-driven-forms/ant-component-mapper/form-template";import componentMapper from "@data-driven-forms/ant-component-mapper/component-mapper";import { schema } from "./schema";export const SchemaForm = () => {  return (    <FormRenderer      schema={schema}      componentMapper={componentMapper}      FormTemplate={FormTemplate}      initialValues={{}}      onSubmit={(values) => console.log(values)}    />  );};

The schema property takes the form JSON definition.

The FormTemplate property defines a template of the form. This is a component that

The initialValues property allows you to set the initial values of the fields in the form.

All the available props can be viewed here.

📜 JSON Definition

Here is what a typical form schema might look like.

// schema.jsexport const schema = {  fields: [    {      component: componentTypes.TEXT_FIELD,      name: "name",      label: "Your name",      isRequired: true,      validate: [{ type: validatorTypes.REQUIRED }]    },    {      component: componentTypes.TEXT_FIELD,      name: "email",      label: "Email",      isRequired: true,      validate: [        {          type: validatorTypes.PATTERN,          pattern: "[a-z0-9._%+-]+@[a-z0-9.-]+.[a-z]{2,}$",          message: "Not valid email"        }      ]    },    {      component: componentTypes.TEXT_FIELD,      name: "confirm-email",      label: "Confirm email",      type: "email",      isRequired: true,      validate: [{ type: "same-email" }]    },    {      component: componentTypes.TEXT_FIELD,      name: "address.street",      label: "Street"    },    {      component: componentTypes.SELECT,      name: "address.state",      label: "State",      options: [        { label: "Delhi", value: "delhi" },        { label: "Goa", value: "goa" },        { label: "Maharashtra", value: "maharashtra" }      ]    },    {      component: componentTypes.CHECKBOX,      name: "newsletters",      label: "I want to receive newsletter"    }  ]};

The name property defines the path where the field data needs to be mapped.

The component property defines the component to be used to render the field from the componentMapper.

The label property is just the label text of that field.

The validate property defines the validation parameters on the field. Regex is also supported.

Depending on the component property, you might need to pass in more properties. For example, the select component also needs dropdown options.

🎛 Conditions

Forms fields might need to conditionally be hidden or visible if they satisfy a certain condition. With Data Driven Forms you can easily declare this inside the field schema.

Let's take an example to show visibility constraint in action.

{    name: "whereInGoa",    component: "text-field",    label: "Where in Goa?",    condition: {        when: "state",        is: "goa"    }}

Initially this field would be hidden. Only when the field with name "state" has a value of "goa", would this field be visible.

This was a simple example. Data Driven Forms allows for more complex conditions where you can define multiple rules in not, and, or or logic. You can even change the value of the field if the condition satisfies.

🛡 Validators

Validation are important to prevent users from submitting incorrect data.

A common way to validate a field is to use regex. You can use the "pattern" validator to achieve this.

{    name: "foo",    ...    validate: [        {            type: validatorTypes.PATTERN,            pattern: /^Foo$/i,            message: "This field doesn't match the required format"        },    ]}

message property specifies a custom error message for when the validation fails.

Following are all the validators provided by the library:

const validatorTypes = {  REQUIRED: 'required';  MIN_LENGTH: 'min-length';  MAX_LENGTH: 'max-length';  EXACT_LENGTH: 'exact-length';  MIN_ITEMS: 'min-items';  MIN_NUMBER_VALUE: 'min-number-value';  MAX_NUMBER_VALUE: 'max-number-value';  PATTERN: 'pattern'; // regex  URL: 'url';}

You can define your custom validator too which is essentially a function that receives the value of the field as the first argument and is expected to return the error message string if validation fails or undefined if the validation passes.

{    name: "foo",    ...    validate:  [(value) => (value === "Pawan" ? '"Pawan is not valid" : undefined)]}

🗃 Custom Component Mapper

Using a predefined component mapper could be a great way to get started, but at some point, we want to use custom components that are already in our project.

Here is how we define a custom component mapper:

// customComponentMapper.jsimport { TextField } from "./TextField";export const customComponentMapper = {  "text-field": TextField};

Here the TextField component would contain the custom component prop mapping.

// TextField.jsximport React from "react";import { Form } from "antd";import { useFieldApi } from "@data-driven-forms/react-form-renderer";import { validationError } from "@data-driven-forms/ant-component-mapper";import { Input } from "../components/Input";export const TextField = (props) => {  const {    input,    isReadOnly,    isDisabled,    isRequired,    label,    helperText,    description,    validateOnMount,    meta,    ...rest  } = useFieldApi(props);  const invalid = validationError(meta, validateOnMount);  const warning = (meta.touched || validateOnMount) && meta.warning;  const help = invalid || warning || helperText || description;  return (    <Form.Item      validateStatus={!invalid ? (warning ? "warning" : "") : "error"}      help={help}      label={label}      required={isRequired}    >      <Input        {...input}        onChange={(e) => input.onChange(e.target.value)}        defaultValue={input.value}        disabled={isDisabled || isReadOnly}        {...rest}      />    Form.Item>  );};

The useFieldApi hook is a wrapper around React Final Form useField hook.

In this case, Input component would be the custom component that you want to use as a text field.

If the component itself doesn't have a way to show the required symbol, label or description of a field, a wrapper component like Form.Item from ant design can be used.

We can now spread the customComponentMapper object while passing it into FormRenderer. This would override the predefined component mappers with our custom ones.

📐 FormTemplate

The available component mapper packages offer a predefined FormTemplate component to fit the design language of their respective Design Systems. At Certa, we have our own design system and needed a way to customize the button styles, placements and styling of the container form component. The best way to do this is to define our own custom FormTemplate component.

A very basic custom FormTemplate would look like this:

const FormTemplate = ({ schema, formFields }) => {  const { handleSubmit } = useFormApi();  return (    <form onSubmit={handleSubmit}>      { formFields }      <button type="submit">Submitbutton>    form>  )}

👋 Final Words

There is some initial heavy lifting required to setup Data Driven Forms, especially if you have your own design system components. But once the custom component mappers, validators, and FormTemplate are good to go, the form-building effort is significantly low and would save a lot of developer effort and time in the long run.

Hope the Data Driven approach helps you and your team focus less on forms and more on the business logic of your product.

We're hiring!

If solving challenging problems at scale in a fully-remote team interests you, head to our careers page and apply for the position of your liking!

The Ultimate Hack for Supercharging Your CircleCI Pipeline!

Martin Siby — Mon, 06 Mar 2023 19:01:45 GMT

🚀 Prerequisites

An existing CircleCI pipeline.
Basic knowledge of Pytest.
Basic knowledge of YAML files.

🌅 Background

We have a couple thousand test cases as the test suite of the Django app powering Certa. To ensure CI, we are utilizing CircleCI to validate the PRs before merging them onto the repository's development branch.

😬 The problem

Due to the reasons listed below, developers were being slowed down and had to monitor and re-run test cases when merging their PR constantly.

The total duration of the test run before optimization was ~28 min.
Due to the size & history of the test suite, flaky test cases used to be a frequent occurrence.

🤔 Solutions

📊 Distributing test cases based on timing
- By default, test cases are distributed among containers based on the file names, which can lead to skewed timings like this:
  
  Splitting based on timings would shave out much of the test duration for no extra cost.
  Steps:
  1. Upload test run report
    After a test run to get the test cases to generate a report once their run is completed
```
 - run: # this will create the dir to store the reports     name: Make test report dir     command: mkdir -p test-results/reports - run: # we split the test cases using timing and generates report at the end of test run     name: Run tests and create report     command: |       TESTFILES="$(circleci tests glob "/**/test_*.py" | circleci tests split --split-by=timings)       pytest ${TESTFILES} --junitxml=test-results/reports/junit.xml - store_test_results: # this will ensure test results are stored in CircleCI for timing data for later runs.     path: test-results
```
    Note: By default, the reports we generate use xunit2 format to store test results that do not contain the file paths and hence can't be used to generate timing data at the time of writing this blog. A workaround we used is to specify the following in the setup.cfg the file of the Django project:
```
 [tool:pytest] python_files = test*.py junit_family = xunit1
```
  2. Split test cases based on timings
    Once we have the timing data available on CircleCI, then we can add split-by-timing to the YAML file as follows(re-iterating):
```
 - run: # we split the test cases using glob and generates report at the end of test run     name: Run tests and create report     command: |       TESTFILES="$(circleci tests glob "/**/test_*.py" | circleci tests split --split-by=timings)       pytest ${TESTFILES} -n 4  --reuse-db --junitxml=test-results/reports/junit.xml
```
    The run command uses two things, the first is to get all the test files using glob and then to initiate the test run by specifying to use timing as the split criteria.
  3. Set timing-type
    Now that we have timing data, we can set the granularity. CircleCI supports 4 different choices: filename , classname, testname and autodetect. This choice will depend on your needs but classname worked best for our case.
- Cost vs Impact
  - No additional cost was associated with this change, but it significantly improved the timings, reducing the overall duration by 36%.
- Caveats
  - The store_test_results command needs a directory as input. The test results need to be stored inside this directory.

📈 Increasing container count
This is an obvious option available to you, but the problem is that it will incur higher credit usage. This particular option varies based on projects, we found that 8 containers of the large class are the ideal configuration for our use case, which saves time while not burning away credits. Below are the YAML file changes:
```
  working_directory: ~/repo   resource_class: large   parallelism: 8   steps:     - attach_workspace:              at: ~/repo     - run: # we split the test cases using glob and generate the report at the end of the test run         name: Run tests and create report         command: | TESTFILES="$(circleci tests glob "/**/test_*.py" | circleci tests split --split-by=timings) pytest ${TESTFILES} -n 4 --reuse-db --junitxml=test-results/reports/junit.xml
```
- Cost vs Impact
  - The Addon cost was around 100k credits per month. It reduced the timings by around 28% and also consumed a lot of time to find the sweet spot since each time after changing the number, we had to run the pipeline completely to get the impact.
- Caveats
  - This will be time-consuming; what we have done to make it easier is to push several commits with only the job container count changed and run the pipelines in parallel manually. By default, it cancels a job when a new commit is pushed, so you must manually rerun the suite.
  - Increasing parallelism can also lead to an increase in flaky test cases, depending on your test suite. There can be multiple reasons for this - race conditions, resource contention, or synchronization issues. Read on to learn about the solutions that worked for us.

🏎 Parallel test runs with a job
If you already have parallel containers running the jobs, then one thing to keep track of is the resource usage section of the test run. Below is the screenshot of an unoptimized job.

As the screenshot shows, jobs were utilizing at most 50% of the CPU. An easy remedy we applied is to utilize the number of parallel pytest instances running in a container. Below is the config change for increasing the instances to 4:
```
    working_directory: ~/repo     resource_class: large     parallelism: 8     steps:     - attach_workspace:             at: ~/repo     - run: # we split the test cases using glob and generate the report at the end of the test run          name: Run tests and create report         command: |           TESTFILES="$(circleci tests glob "/**/test_*.py" | circleci tests split --split-by=timings)           pytest ${TESTFILES} -n 4  --reuse-db --junitxml=test-results/reports/junit.xml
```
Note: Low CPU resource usage is expected if your test suite is IO-heavy. In such cases lowering the resource type can yield some savings on credits.
- Cost vs Impact
  - No additional cost was associated with this change, and the impact was not great either. On average, it saved about 1-2 mins (3.8%) per test run.
- Caveats
  - When you optimize this value, if you don't see at least a 15% improvement compared to the previous value, then it is time to stop since, post that, your cost increase will probably be unjustifiable.

💾 Using cache and workspace storage

Projects that utilize external libraries or files that remain relatively constant between multiple test runs can be cached and used to save a significant amount of time. We used:

cache to store our python libraries by generating a key based on the hash of the requirement files

workspaces cache to create a common test env from which different test jobs start.

The config is as follows:

prep-test-env:   parameters:     resource_class:       type: string       description: CircleCI resource class       default: medium   docker:       - image: cimg/python:3.9.16   working_directory: ~/repo   resource_class: large   steps:     - checkout     - run: # Generate a hash based on requirements         name: Check if requirements changed         command: |           echo $(find ./requirements -type f -exec md5sum {} \; | md5sum | cut -d' ' -f1)  >> REQUIREMENTS_CACHE_KEY     - restore_cache:         keys:           - dependencies-{{ checksum "REQUIREMENTS_CACHE_KEY" }}     - run: # this will install any new dependency added         name: Install python dependencies         command: |           python -m venv venv           . venv/bin/activate           pip install --upgrade setuptools           pip install -e .           for file in requirements/*.txt; do pip install -r "$file"; done     - save_cache:         paths:           - venv         key: dependencies-{{ checksum "REQUIREMENTS_CACHE_KEY" }}     - persist_to_workspace:         root: ~/repo         paths: ./

In our job for the test run, we will attach this saved workspace as follows:

    steps:    - attach_workspace:            at: ~/repo

To specify the order in which we define the following in the workflows section

  workflows:    version: 2    app-tests:      jobs:        - prep-test-env:            name: Prepare the test environment        - run-test:            name: Test run            requires:              - Prepare the test environment

This will ensure the prep-test-env runs before the test cases are run.

Cost vs Impact
- This caused an additional cost of ~3k credits per month, including caches and workspace storage. This saved about 40 seconds from each test run job, which is significant since each parallel job consumes credits independently.
Caveats
- Do make sure to set the usage policy to avoid storing irrelevant data. We have set it as follows:

🔄 Auto-rerun failed test cases
Once the optimizations were done, flaky test cases became the bane of our existence. Since this would be counter-productive to the main objective of lowering the load on developers. We utilized a library called pytest-rerunfailures, which, as the name suggests, re-runs failed test cases. But the problem is that this is not a library required for production and is not maintained/secured to be installed in production. Hence it was added in a test.txt file in the requirements directory, and since we utilize for file in requirements/*.txt; do pip install -r "$file"; done it will be installed during test runs and not in any other server.
```
      - run: # we split the test cases using glob and generate the report at the end of the test run           name: Run tests and create report           command: | pytest ${TESTFILES} -n 4 --reuse-db --junitxml=test-results/reports/junit.xml --reruns 3 -x
```
--reruns 3 reruns the failed test case up to 3 times, and if one of them passes, it goes ahead with the rest.
-x will exit immediately after a test case fails, even after 3 reruns
- Cost vs Impact
  - No cost add-on for this change, but it caused the flaky test cases to be reduced significantly from 2-3 flaky test cases failures per job run to 1 flaky failure in every 10-15 test runs.
- Caveats
  - Make sure to include -x in the command, if your pipeline is configured to fail even if one test case fails since that will save some credits.
    You could also set it to only rerun test cases with a specific type of failure like this: --only-rerun AssertionError this will only rerun test cases that have failed due to AssertionError.

We're hiring!

If solving challenging problems at scale in a fully-remote team interests you, head to our careers page and apply for the position of your liking!

Generating React Icon Components from Figma

Pawan Kolhe — Wed, 12 Jan 2022 12:32:12 GMT

Maintaining icons in a React project can be a mess. Some amount of automation can always make the lives of Frontend devs on your team simpler and save precious time.

At Certa, our designer maintains over 150 SVG icons on a Figma board, with new ones coming every now and then. Previously we had to go through the following tedious and manual process of adding an icon from Figma to our React project:

Exporting the icon as SVG format from Figma
Creating a React component file and appropriately naming it
Copying and pasting the SVG code into the component file
Exposing props that parent component can pass (color, size, etc)
Modifying SVG attributes (e.g. setting fill to currentColor)
Clean unnecessary attributes
Exporting the React icon component from the index file

This was a slow, repetitive, and error-prone process. Certainly, we could write some scripts here to automate much of the process. So we did. But first, let us explore why the above process can be a problem.

🔎 The Problem

Problem 1

Everyone has their own way of performing manual work. Not every developer on the team will follow a consistent way of performing the above steps.

Let us say Alice downloaded loader.svg icon from Figma and created a component called Spinner.tsx in the React project because that name seems to make sense to Alice. Bob comes along and is searching for the loader icon by searching for Loader.tsx file assuming that icons are probably named according to their names on Figma. But Bob doesn't find the icon, so he creates a new icon component name Loader.tsx from loader.svg.

Now we have two icon components named differently, but having the same code. This will lead to code duplication unknowingly, which is never good.

Problem 2

Another issue was that of inconsistent props of icon components. In some components, certain props ( e.g. color and size) were added, and in some, it wasn't. In some components, the default value of a prop (e.g. color) was set to a hex value and in others set to currentColor. This had led to unpredictable behavior when using icons and unexpected bugs started haunting the UI.

🚀 The Plan

Our frontend codebase is a monorepo housing various packages each having its own special purpose. One such package is called blocks (@certa/blocks). It houses the design system/building block components of our application UI such as the button, menu, tooptip, etc. Previously it housed the icons too, but it didn't quite feel like the right home for it.

The proposed idea was to create a new package (@certa/icons) in the monorepo specifically catering to icons and automating as much of the icon component generation process as possible.

Maybe not all the icons you need in your project are on Figma. Not a problem, we can also load SVGs icons into the script from a specific folder as an additional source.

The new package would contain a script that would:

Fetch SVG icons from Figma
Load SVG icons added manually into a specific folder as an additional source
Clean and optimize the SVG code
Generate React components for each icon which expose common props with the same defaults
Create an index file that exports all the icon components

🧑💻 The Script

Here is a CodeSandbox for you to view the source code and run it yourself:

https://codesandbox.io/embed/icons-generator-jn248?expanddevtools=1&fontsize=14&hidenavigation=1&module=%2Fsrc%2Fgenerate.ts&theme=dark&view=editor

The script has been implemented using Node.js with TypeScript and it will make use of the following libraries:

figma-api-exporter - for fetching icons data from Figma
@svgr/core - transforming SVG code into React components
ts-node - enables executing TypeScript on Node.js without precompiling
fs-extra - adds file system methods that aren't included in the native fs module
axios - for downloading SVGs from Figma
dotenv - loading environment variables into the script
chalk - printing colored output to the terminal

Firstly, we need to install the required libraries:

yarn add react figma-api-exporter fs-extra axios dotenv @svgr/core@5.5.0 @svgr/plugin-prettier@5.5.0 @svgr/plugin-svgo@5.5.0 chalk@4.1.2yarn add --dev typescript ts-node @types/react @types/node @types/fs-extra

Folder Structure

We'll be using the directory structure given below to organize our code:

icons-generator/ src/   icons/     components/ (all generated React icon components)       Bell.tsx       Gear.tsx       Home.tsx     svgs/ (manually added SVGs)       Gear.svg     index.tsx   templates/   generate.ts   types.ts   utils.ts   index.tsx .env svgr.config.js package.json

src/generate.ts file is the primary script where a majority of our logic would be.

svgr.config.js is the SVGR library configuration file

src/icons/components/ will contain all our generated react component files

src/icons/svgs/ contains SVG files that are not in Figma but also need to be converted to React components.

src/icons/index.tsx index file that exports all icon react components from components/ directory

Setup

The first step in the script would be to fetch the metadata of icons from Figma. The metadata will contain S3 URLs of all the icons which can be used to download their corresponding SVG code. We will use figma-api-exporter for this purpose.

To use this library, we would need to set up our Figma Personal Access token that the library can use to access our Figma account. Secondly, we need the Figma file id of the Figma board containing the icons. And lastly, the canvas name is required to locate the icons.

Getting Figma Personal Access Token, File ID, and Canvas

Go to your Figma dashboard and open Settings from the top right dropdown menu.
Scroll down to the "Personal access tokens" section and create an API token. Keep the API token handy, we'll be adding it to the .env file soon.
Now to get the canvas name, look at the sidebar on Figma. The canvas is the name of the page on a Figma board where you kept the icons. In my case, the canvas is named "Icons".
One more thing we need is the Figma file id. Go to the Figma board where your icons reside. Look for the file id in the URL, it'll be the 12 character alphanumeric text.
```
https://www.figma.com/file/atT09besy7MsrjeyUsJLfP/Example-Board
```
Anyone who has your personal access token can get full access to your Figma account. It is best to save these values to a .env file so that we don't accidentally commit these values to version control. In the .env file, add the API token and the file id as key-value pairs.
```
 FIGMA_API_TOKEN=<YOUR_FIGMA_API_TOKEN> FIGMA_FILE_ID=<FIGMA_FILE_ID> FIGMA_CANVAS=Icons
```
Open the package.json file and add the following entry in the scripts section:
```
 "icons": "ts-node ./src/scripts/generate.ts",
```
Now we can just use the yarn icons command to run the script.

Code

Enough talk, let's dive into the code.

Firstly, we load the environment variables.

 // generate.ts // Loads environment variables from .env dotenv.config(); // 1. Retrieve Figma Access Token, File ID and Canvas from .env file const FIGMA_API_TOKEN = process.env.FIGMA_API_TOKEN; const FIGMA_FILE_ID = process.env.FIGMA_FILE_ID; const FIGMA_CANVAS = process.env.FIGMA_CANVAS;

Then we use figma-api-exporter library to fetch metadata about the icons from Figma, providing it with the token, file id and canvas name.

 // 2. Fetch icons metadata from Figma // generate.ts const exporter = figmaApiExporter(FIGMA_API_TOKEN); exporter   .getSvgs({     fileId: FIGMA_FILE_ID,     canvas: FIGMA_CANVAS   })   .then(async svgsData => {     console.log('SVGs DATA:', svgsData);   })   .catch((err: unknown) => {     process.exit(1);   });

The console.log would output the following object structure:

 {     "svgs": [         {             id: "1:43",             url: "https://s3-us-west-2.amazonaws.com/...",             name: "House"         },         {             id: "2:86",             url: "https://s3-us-west-2.amazonaws.com/...",             name: "Chevron down"         },         ...     ],     "lastModified": "2021-11-30T19:19:08Z" }

The most interesting properties here are url and name.

We can use the url property of each icon to download the SVG code using the axios library. A simple get request will do the trick.

 // utils.ts export const downloadSVGsData = async extends {}>(   data: ({ url: string } & T)[] ) => {   return Promise.all(     data.map(async dataItem => {       const downloadedSvg = await axios.get<string>(dataItem.url);       return {         ...dataItem,         data: downloadedSvg.data       };     })   ); }; // generate.ts // 3. Download SVG files from Figma const downloadedSVGsData = await downloadSVGsData(svgsData.svgs);

Not all your icons might be in Figma. Having a second source for adding SVG icons would be great to have. So next, we can load the manually added SVG files located in src/icons/svgs/ and combine them with the ones downloaded from Figma into a single array named allSvgs. The fs.readdirSync would come in handy, allowing us to retrieve files in a certain directory.

 // generate.ts // 4. Read manually added SVGs data let manuallyAddedSvgs: { data: string; name: string }[] = []; const svgFiles = fs   .readdirSync(SVG_DIRECTORY_PATH)   // Filter out hidden files (e.g. .DS_STORE)   .filter((item) => !/(^|\/)\.[^/.]/g.test(item)); svgFiles.forEach((fileName) => {   const svgData = fs.readFileSync(     path.resolve(SVG_DIRECTORY_PATH, fileName),     "utf-8"   );   manuallyAddedSvgs.push({     data: svgData,     name: toPascalCase(fileName.replace(/svg/i, ""))   }); }); const allSVGs = [...downloadedSVGsData, ...manuallyAddedSvgs];

Once we have all the SVG code, it is time to convert them into React components and write the files to the src/icons/components/ directory.

 // generate.ts // 5. Convert SVG to React Components allSVGs.forEach(svg => {   const svgCode = svg.data;   const componentName = toPascalCase(svg.name);   const componentFileName = `${componentName}.tsx`;   // Converts SVG code into React code using SVGR library   const componentCode = svgr.sync(svgCode, svgrConfig, { componentName });   // 6. Write generated component to file system   fs.outputFileSync(     path.resolve(ICONS_DIRECTORY_PATH, componentFileName),     componentCode   );});

The svgr.sync function is performing some magic here, converting SVG code to React component. We will dive deeper into how it works in a later section.

After the React components are created, the process of generating an index file can begin. The index file would export all the React components, so they can be used outside our current package. Have to say, the final generated index file looks oddly satisfying. 😄

 // utils.ts export const createIndex = ({   componentsDirectoryPath,   indexDirectoryPath,   indexFileName }: IndexConfigProps) => {   let indexContent = "";   fs.readdirSync(componentsDirectoryPath).forEach(componentFileName => {     // Convert name to pascal case     const componentName = toPascalCase(       componentFileName.substr(0, componentFileName.indexOf(".")) ||         componentFileName     );     // Compute relative path from index file to component file     const relativePathToComponent = path.relative(       indexDirectoryPath,       path.resolve(componentsDirectoryPath, componentName)     );     // Export statement     const componentExport = `export { default as ${componentName} } from "./${relativePathToComponent}";`;     indexContent += componentExport + os.EOL;   });     // Write the content to file system   fs.writeFileSync(path.resolve(indexDirectoryPath, indexFileName), indexContent); }; // generate.ts // 7. Generate index.ts createIndex({   componentsDirectoryPath: ICONS_DIRECTORY_PATH,   indexDirectoryPath: INDEX_DIRECTORY_PATH,   indexFileName: "index.tsx" });

The above function is simply looping over all the icon components, converting the filename to a component name, computing the relative path of the component file, forming an export style string, and concatenating it to a string that then gets written to the index.tsx file.

SVG to React Components

When it comes to raw SVG to React conversion, the SVGR library gets the job done. It provides additional plugins like SVGO that are able to shave off some excess SVG styling and help with size reduction.

I provided SVGR with a config where I customized the output React component code as per my requirements. The svgProps property is used to set the SVG attribute values. replaceAttrValues will replace an existing attribute value with a new one, very useful for changing a hexcode color for fill attribute to currentColor which essentially tells the SVG to inherit the color property value from a parent element.

// svgr.config.jsconst componentTemplate = require("./src/templates/componentTemplate");module.exports = {  typescript: true,  icon: true,  svgProps: {    width: "inherit",    height: "inherit"  },  replaceAttrValues: {    "#00164E": "currentColor"  },  plugins: [    // Clean SVG files using SVGO    "@svgr/plugin-svgo",    // Generate JSX    "@svgr/plugin-jsx",    // Format the result using Prettier    "@svgr/plugin-prettier"  ],  svgoConfig: {},  template: componentTemplate};

This is cool and all, but you might be wondering how we can customize the React component code that SVGR pushes out. Here is where templates come in. SVGR allows us to pass it a template function that is internally executed by the babel plugin babel-plugin-transform-svg-component and expects Babel AST (Abstract Syntax Tree) to be returned from the function.

AST is essentially a tree representation of program source code, and in our case represents the React component source code.

// componentTemplate.jsfunction componentTemplate(  { template },  opts,  { imports, componentName, props, jsx, exports }) {  const code = `    %%NEWLINE%%    %%NEWLINE%%    import * as React from 'react';    import { IconProps } from '../../types';    import { IconWrapper } from '../IconWrapper';    %%NEWLINE%%    const %%COMPONENT_NAME%% = (allProps: IconProps) => {      const { svgProps: props, ...restProps } = allProps;      return     };    %%EXPORTS%%  `;  const mapping = {    COMPONENT_NAME: componentName,    JSX: jsx,    EXPORTS: exports,    NEWLINE: "\n"  };  /**   * API Docs: https://babeljs.io/docs/en/babel-template#api   */  const typeScriptTpl = template(code, {    plugins: ["jsx", "typescript"],    preserveComments: true,    syntacticPlaceholders: true  });  return typeScriptTpl(mapping);}module.exports = componentTemplate;

The template API from @babel/template is provided as part of the props. It is responsible for converting our code from a string format to AST format. We pass the mappings to the placeholders in our code when invoking the function returned by the template function.

If you've noticed, in the custom template we have an IconWrapper component that gets passed the SVG code via props. The SVG code is rendered between a span. The wrapper has the benefit of being able to modify the props' behavior in one place as opposed to keeping the logic in each generated React component.

// IconWrapper.tsximport * as React from "react";import { IconProps } from "../types";export const IconWrapper: React.FC<{ icon: React.ReactNode } & IconProps> = ({  icon,  color: colorProp,  size: sizeProp,  autoSize,  ...restProps}) => {  const color = colorProp ? colorProp : "currentColor";  const size = sizeProp ? `${sizeProp}px` : autoSize ? "1em" : "16px";  return (    <span      role="img"      aria-hidden="true"      style={{        color: color,        width: size,        height: size,        display: "inline-flex",        fontSize: "inherit"      }}      {...restProps}    >      {icon}    span>  );};

🎨 Make it Pretty

Once the script is ready, you can improve it by adding error handling and descriptive messages. We at Certa have also abstracted away a lot of dynamic values like the index file directory, index file name, icons component directory, etc. into a separate config file to be able to easily configure changes.

If you love good design like me, you might want to use additional libraries like ora and chalk to make the developer experience awesome while waiting for the icons to be generated. Ora is what adds the cool-looking spinners to the terminal while something is loading and Chalk allows you to bring the terminal to life with colors.

Look how beautiful the final CLI tool looks:

👋 Final Words

It seems like a lot of work to get the script set up, but once it works, you gain complete control over your icons. Plus it saves a ton of time and makes your code more reliable.

Certa's Tech Blog

Graph Databases: To Use or not to use?

Introduction

Understanding Graph Databases

Fundamentals

Use Cases of Graph Databases

Commonly used graph query languages

Alternatives to Graph Databases

Trade-offs and Considerations

Conclusion

References

Using SVG Icons with Pixi.JS

📚 Pre-requisites

The Challenge

🔖 The Action Plan

🚀 Code Walkthroughs

👨🏼💻 CodeSandbox

Conclusion

We are hiring! 🎉

Third-Party Cookie Restrictions for Iframes in Safari

What do you need to know before?

Introduction

Problem

Storage Access APIs

Solution

Conclusion

We are hiring!

Simplifying WebGL: Building an Effective Layout Engine

What is a Layout Engine?

Option 1 - Find something that "just works"

Option 2 - Learn how a layout engine works

Building a layout engine in Javascript

1. Plan

2. Implementation

2.1 - Allocator

2.1.1 - Define the shape of Node

2.1.2 - Create the Allocator

2.2 - Box, the visual component

2.3 - Sandbox

2.4 - Layout computation

2.5 - Let's try it

Wait ... it's not over

CodeSandbox

Join us

Build React Forms using JSON

🚁 The Problem

🚀 The Solution

Benefits of Data Driven approach

🧑🏼💻 Using Data Driven Forms

Install library

FormRenderer

📜 JSON Definition

🎛 Conditions

🛡 Validators

🗃 Custom Component Mapper

📐 FormTemplate

👋 Final Words

We're hiring!

The Ultimate Hack for Supercharging Your CircleCI Pipeline!

🚀 Prerequisites

🌅 Background

😬 The problem

🤔 Solutions

📊 Distributing test cases based on timing

📈 Increasing container count

🏎 Parallel test runs with a job

💾 Using cache and workspace storage

🔄 Auto-rerun failed test cases

We're hiring!

Generating React Icon Components from Figma

🔎 The Problem

Problem 1

Problem 2

🚀 The Plan

🧑💻 The Script

Folder Structure

Setup

Code

SVG to React Components

🎨 Make it Pretty