Love.Law.Robots. by Ang Hou Fu

Copyright

Feature image

In 2021, I discovered something exciting — an application of machine learning that was both mind-blowing and practical.

The premise was simple. Type a description of the code you want in your editor, and GitHub Copilot will generate the code. It was terrific, and many people, including myself, were excited to use it.

🚀 I just got access to @github Copilot and it's super amazing!!! This is going to save me so much time!! Check out the short video below! #GitHubCopilot I think I'll spend more time writing function descriptions now than the code itself :D pic.twitter.com/HKXJVtGffm

— abhishek (@abhi1thakur) June 30, 2021

The idea that you can prompt a machine to generate code for you is obviously interesting for contract lawyers. I believe we are getting closer every day. I am waiting for my early access to Spellbook.

As a poorly trained and very busy programmer, it feels like I am a target of Github Copilot. The costs was also not so ridiculous. (Spellbook Legal costs $89 a month compared to Copilot's $10 a month) Even so, I haven't tried it for over a year. I wasn’t comfortable enough with the idea and I wasn’t sure how to express it.

Now I can. I recently came across a website proposing to investigate Github Copilot. The main author is Matthew Butterick. He’s the author of Typography for Lawyers and this site proudly uses the Equity typeface.

GitHub Copilot investigation · Joseph Saveri Law Firm & Matthew ButterickGitHub Copilot investigation

In short, the training of GitHub Copilot on open source repositories it hosts probably raises questions on whether such use complies with its copyright licenses. Is it fair use to use publicly accessible code for computational analysis? You might recall that Singapore recently passed an amendment to the Copyright Act providing an exception for computational data analysis. If GitHub Copilot is right that it is fair use, any code anywhere is game to be consumed by the learning machine.

Of course, the idea that it might be illegal hasn’t exactly stopped me from trying.

The key objection to GitHub Copilot is that it is not open source. By packaging the world’s open-source code in an AI model, and spitting it out to its user with no context, a user only interacts with Github Copilot. It is, in essence, a coding walled garden.

Copi­lot intro­duces what we might call a more self­ish inter­face to open-source soft­ware: just give me what I want! With Copi­lot, open-source users never have to know who made their soft­ware. They never have to inter­act with a com­mu­nity. They never have to con­tribute.

For someone who wants to learn to code, this enticing idea is probably a double-edged sword. You could probably swim around using prompts with your AI pair programmer, but without any context, you are not learning much. If I wanted to know how something works, I would like to run it, read its code and interact with its community. I am a member of a group of people with shared goals, not someone who just wants to consume other people’s work.

Matthew Butterick might end up with enough material to sue Microsoft, and the legal issues raised will be interesting for the open-source community. For now, though, I am going to stick to programming the hard way.

#OpenSource #Programming #GitHubCopilot #DataMining #Copyright #MachineLearning #News #Newsletter #tech #TechnologyLaw

Author Portrait Love.Law.Robots. – A blog by Ang Hou Fu

Feature image

There’s a movement brewing between the lines of Twitter and within the deeper reaches of GitHub. Somebody is trying to “open source” contracts. You might have come across the term “open source” when downloading your favourite web browser. Open-source software is free, and it works. Is that what “open source” would mean for contracts?

I liked how Bonterms describes the motivation behind the endeavour:

Look inside the stack of nearly any major cloud application and you’ll find open source code, and lots of it. Developers leverage any existing package they can find before writing a line of code on their own. And they spend hours happily contributing back improvements to the projects they use. Open source has fundamentally transformed software development for the benefit of the entire ecosystem. But, could lawyers do the same? Could you possibly get law firm and in-house lawyers with the relevant domain experience to come together to articulate best practices, collaborate on drafting and then give their work product away for free? Yes, it turns out, you can. You just have to ask and provide a forum for working together and engaging in friendly, detailed debate.

Could time-starved lawyers used to charging by the hour be more like programmers and give what they do for free?

Standards, standards everywhere

Previously I wrote about an “open source” contract — OneNDA. There’s been good news on that front. They transformed themselves into Claustack and came out with oneDPA, backed by PwC and ContractPodAI.

Will oneNDA rule them all?oneNDA, a crowdsourced NDA, says it has standardised the NDA. Cue the sceptic in 3... 2... 1...Love.Law.Robots.HoufuBack when OneNDA first came out, I hesitated to join the “hivemind”. My opinion has improved since.

Other “open source” contracts have sprouted out recently — check out Bonterms and Common Paper.

It’s striking how similar these efforts are — all of them use some “cover page” mechanism to contract and are written by a “committee” of lawyers.

Here’s another similarity: all of them discourage modifying their templates.

You can see this from the particular license chosen by these projects. OneNDA chose CC-BY-ND 4.0 (the ND means no derivatives), and the others chose CC-BY (You might be able to make changes, even for commercial purposes, but you must credit the project when you make changes. How do you do that in a contract? 🤷🏻).

Even if you don’t know the difference between the various Creative Commons licenses, you’d be sufficiently discouraged by the documentation. One of the answers in the OneNDA FAQ is, “Yes, you can do whatever you like with it except actively allow or encourage people to change anything in oneNDA other than the variables.”

After I thought harder about the distinctions, I realised these projects aren’t so much about open source but standardisation. If everyone uses a particular contract, there will be massive benefits to all involved. However, you must agree to its restrictions — You can only modify the variables or the cover page. To use the contract, you must agree to all the choices and tradeoffs made by the project.

Philosophically, I disagree with this sort of standardisation. It’s apropos to introduce some XKCD:

Don’t get me wrong. I’m not going to sneer if I saw a OneNDA in the wild (I haven’t).

But I won’t overestimate the impact of these competing efforts at standardisation. On the one hand, nothing is stopping me from modifying any template. On the other, I don’t get any benefit from adhering to one too.

OneNDA becomes Claustack — now a Community!

There is another aspect of “open source” that these projects might be alluding to. Open source development takes place in an open forum where anyone can contribute — on a mailing list, the GitHub issues page or some Discord server.

This idea that anyone can contribute appears to be anthemic to law. In the open source contracts I covered, all of them highlighted that they are supported or drafted by “experts” in their fields (I am a bit sceptical that someone would call themselves an NDA specialist). Both Common Paper and Bonterms have GitHub repositories for their contracts but don’t appear to accept contributions.

This brings me to Claustack. As mentioned above, it used to be OneNDA only, but now they have created a platform described as “GitHub meets StackOverflow – for lawyers”. The focus is not on the few documents that they are in charge of, but also on others including Bonterms and Common Paper. So, it is now a collection of resources, and a forum for people to provide feedback and suggestions, and at some level, be involved in its development. I liked this iteration better, so I joined up.

A contract standard might sound pointless because there are few, if any, restrictions to ensure you adhere to it. However, if there was a critical mass of users — a community — using, advocating and helping others on it, that is a recipe for conquering the world.

In “Forge your Future with Open Source”, a book about open source and how you can contribute to it, author VM Brassuer writes:

... the most important aspect of free and open source software isn’t the code; it’s the people. Contribution to [free and open source software] is about so much more than simply code, design, or documentation; it’s about participation and community. The licenses make the software available, but the people make the software, and the community supports the people. Remove one piece from this equation, and the entire system falls apart.

The quality of a contract might be important, and the licensing, the design and the cost of adoption are probably important too. But what would keep such a project going would be its people. At that point, more people have a stake in the success of the project, not just its founders or commercial backers.

Building a community wouldn’t be easy...

Although I am cautiously optimistic about how Claustack is turning out, it’s still early days for these open source contracts. More has to be done in order to persuade other folks to contribute and advocate.

My lack of faith probably stems from my experience and observation that open source projects dealing directly with law and lawyers are very few and far in between.

Open Source Legal: The Open Legal DirectoryOpen Source Legal is a central repository and review database of open and open source legal standards, applications, platforms and software libraries. It’s meant to help the legal engineering community track and develop a set of community-driven tools and standards to improve legal service delivery…Open Source LegalYou can check out other open-source projects listed here.

One such project which actually has a community is docassemble. They even have a yearly “DocaCon”. I attended my first last year (when the event was in person it was impossible for me to travel to Boston to attend it), and found a pretty weird tribe. Most of the excitement involved access to justice (A2J) implementations of docassemble, not something you would find in law firms or legal departments. I was excited at an effort to bring testing to docassemble interviews, again, I would never discuss this anywhere else.

In a recent interview on LawNext Podcast, Jonathan Pyle, benevolent dictator of docassemble, said this about his motivations for docassemble:

No, I really like to not make any money off of [docassemble]. It’s because I would really like being able to be honest to other people... I like being able to advise people not to use my code. It’s just so much easier if I could just concentrate on the technology and creating new features and not having to worry about making a living. It’s kind of nice to do something nice in the nights and weekends.

I can’t name another project like this.

Lack of opportunities is not the only problem. Culturally, lawyers seemed to be “trained” not to collaborate with each other.

Being #1 isn’t always a good thing—loneliness among lawyers (296) | Legal EvolutionSuccess as a lawyer can come at the expense of personal relationships. Is it worth the price? Few of my former partners in the global firm where I workedLegal EvolutionTom SharbaughIn this detailed narrative, associates, partners and law students confront loneliness.

Echoes of this also come from a recent interview with Mary O’Carroll on Artificial Lawyer.

If you have three lawyers in a room, and someone has information that can make someone else look good, will they help the other lawyers? Knowledge sharing between lawyers is not incentivised in training programmes. But, in a corporate setting you have to flex that muscle, i.e. collaboration and teamwork. The problem is that lawyers are trained to be the smartest person in the room. They don’t work cross-functionally in law firms. In a company however, every team has to work with every other team across the business.

Building a community for a normal open source project is really difficult. The question when it comes to open source contracts is: do lawyers even want a community?

Conclusion

An early draft of this post started by asking whether calling a contract “open source” is a PR stunt. It’s not fair to cast aspersions on an open source contract being given out for free when the usual course is not to share at all. Even so, one also has to be judicious about the way you spend your own time, something which lawyers are definitely (maybe overly) familiar with. Building an open source community is difficult, but that is what would make such a project sustainable. I’ll be keeping a lookout and hopefully there is a place for someone who wants to contribute.

#Contracts #docassemble #oneNDA #Claustack #Bonterms #CommonPaper #Law #Lawyers #Copyright

Author Portrait Love.Law.Robots. – A blog by Ang Hou Fu

I have been mulling over developing an extensive online database of free legal materials in the flavour of OpenLawNZ or an LII for the longest time. Free access to such materials is one problem to solve, but I'm also hoping to compile a dataset to develop AI solutions. I have tried and demonstrated this with PDPC's data previously, and I am itching to expand the project sustainably.

However, being a lawyer, I am concerned about the legal implications of scraping government websites. Would using these materials be a breach of copyright law? In other countries, people accept that the public should generally be allowed to use such public materials. However, I am not very sure of this here.

The text steps highlightedPhoto by Clayton Robbins / Unsplash

I was thus genuinely excited about the amendments to the Copyright Act in Singapore this year. According to the press release, they will be operational in November, so they will be here soon.

Copyright Bill – Singapore Statutes OnlineSingapore Statutes Online is provided by the Legislation Division of the Singapore Attorney-General’s ChambersSingapore Statutes OnlineThe Copyright Bill is expected to be operationalised in November 2021.

[ Update 21 November 2021: The bill has, for the most part, been operationalised.]

Two amendments are particularly relevant in my context:

Using publicly disclosed materials from the government is allowed

In sections 280 to 282 of the Bill, it is now OK to copy or communicate public materials to facilitate more convenient viewing or hearing of the material. It should be noted that this is limited to copying and communicating it. Presumably, this means that I can share the materials I collected on my website as a collection.

Computational data analysis is allowed.

The amendments expressly say that using a computer to extract data from a work is now permitted. This is great! At some level, the extraction of the material is to perform some analysis or computation on it — searching or summarising a decision etc. I think some limits are reasonable, such as not communicating the material itself or using it for any other purpose.

However, one condition stands out for me — I need “lawful access” to the material in the first place. The first illustration to explain this is circumventing paywalls, which isn’t directly relevant to me. The second illustration explains that obtaining the materials through a breach of the terms of use of a database is not “lawful access”.

That’s a bit iffy. As you will see in the section surveying terms, a website’s terms are not always clear about whether access is lawful or not. The “terms of use” of a website are usually given very little thought by its developers or implemented in a maximal way that is at once off-putting and misleading. Does trying to beat a captcha mean I did not get lawful access? Sure, it’s a barrier to thwart robots, but what does it mean? If a human helps a robot, would it still be lawful?

A recent journal article points to “fair use” as the way forward

I was amazed to find an article in the SAL Journal titled “Copying Right in Copyright Law” by Prof David Tan and Mr Thomas Lee, which focused on the issue that was bothering me. The article focuses on data mining and predictive analytics, and it substantially concerns robots and scrapers.

Singapore Academy of Law Journale-First MenuLink to the journal article on E-First at SAL Journals Online.

On the new exception for computational data analysis, the article argues that the two illustrations I mentioned earlier were “inadequate and there is significant ambiguity of what lawful access means in many situations”. Furthermore, because the illustrations were not illuminating, it might create a situation where justified uses are prohibited. With much sadness, I agree.

More interestingly, based on some mathematics and a survey, the authors argue that an open-ended general fair use defence for data mining is the best way forward. As opposed to a rule-based exception, such a defence can adapt to changes better. Stakeholders (including owners) also prefer it because it appeals to their understanding of the economic basis of data mining.

You can quibble with the survey methodology and the mathematics (which I think is very brave for a law journal article). I guess it served its purpose in showing the opinion of stakeholders in the law and the cost analysis very well. I don’t suspect it will be cited in a court judgement soon, but hopefully, it sways someone influential.

We could use a more developer-friendly approach.

Photo by Mimi Thian / Unsplash

There was a time when web scraping was dangerous for a website. In those times, websites can be inundated with requests by automated robots, leading them to crash. Since then, web infrastructure has improved, and techniques to defeat malicious actors have been developed. The great days of “slashdotting” a website has not been heard of for a while. We’ve mostly migrated to more resilient infrastructure, and any serious website on the internet understands the value of having such infrastructure.

In any case, it is possible to scrape responsibly. Scrapy, for example, allows you to queue requests regularly or identify yourself as a robot or scraper, respecting robots.txt. If I agreed not to degrade a website’s performance, which seems quite reasonable, shouldn’t I be allowed to use it?

Being more developer-friendly would also help government agencies find more uses for their works. For now, most legal resources appear to cater exclusively for lawyers. Lawyers will, of course, find them most valuable because it’s part of their job. However, others may also need such resources because they can’t afford lawyers or have a different perspective on how information can be helpful. It’s not easy catering to a broader or other audience. If a government agency doesn’t have the resources to make something more useful, shouldn’t someone else have a go? Everyone benefits.

Surveying the terms of use of government websites

RTK survey in quarryPhoto by Valeria Fursa / Unsplash

Since “lawful access” and, by extension, “terms of use” of a website will be important in considering the computational data analysis exceptions, I decided to survey the terms of use of various government agencies. After locating their treatment of the intellectual property rights of their materials, I gauge my appetite to extract them.

In all, I identified three broad categories of terms.

Totally Progressive: Singapore Statutes Online 👍👍👍

Source: https://sso.agc.gov.sg/Help/FAQ#FAQ_8 (Accessed 20 October 2021)

Things I like:

  • They expressly mention the use of “automated means”. It looks like they were prepared for robots!
  • Conditions appear reasonable. There’s a window for extraction and guidelines to help properly cite and identify the extracted materials.

Things I don’t like:

  • The Singapore Statutes Online website is painful to extract from and doesn’t feature any API.

Comments:

  • Knowing what they expect scrapers to do gives me confidence in further exploring this resource.
  • Maybe the key reason these terms of use are excellent is that it applies to a specific resource. If a resource owner wants to make things developer-friendly, they should consider their collections and specify their terms of use.

Totally Bonkers: Personal Data Protection Commission 😖😖😖

Source: https://www.pdpc.gov.sg/Terms-and-Conditions (Accessed 20 October 2021)

Things I like:

  • They expressly mention the use of “robots” and “spiders”. It looks like they were prepared!

Things I don’t like:

  • It doesn’t allow you to use a “manual process” to monitor its Contents. You can’t visit our website to see if we have any updates!
  • What is an automatic device? Like a feed reader? (Fun fact: The PDPC obliterated their news feed in the latest update to their website. The best way to keep track of their activities is to follow their LinkedIn)
  • PDPC suggests that you get written permission but doesn’t tell you what circumstances they will give you such permission.
  • I have no idea what an unreasonable or disproportionately large load is. It looks like I have to crash the server to find out! (Just kidding, I will not do that, OK.)

Comments:

  • I have no idea what happened to the PDPC, such that it had to impose such unreasonable conditions on this activity (I hope I am not involved in any way 😇). It might be possible that someone with little knowledge went a long way.
  • At around paragraph 6, there is a somewhat complex set of terms allowing a visitor to share and use the contents of the PDPC website for non-commercial purposes. This, however, still does not gel with this paragraph 20, and the confusion is not user or developer-friendly, to say the least.
  • You can’t contract out fair use or the computational data analysis exception, so forget it.
  • I’m a bit miffed when I encounter such terms. Let’s hope their technical infrastructure is as well thought out as their terms of use. (I’m being ironic.)

Totally Clueless: Strata Titles Board 🎈🎈🎈

Materials, including source code, pages, documents and online graphics, audio and video in The Website are protected by law. The intellectual property rights in the materials is owned by or licensed to us. All rights reserved. (Government of Singapore © 2006).
Apart from any fair dealings for the purposes of private study, research, criticism or review, as permitted in law, no part of The Website may be reproduced or reused for any commercial purposes whatsoever without our prior written permission.

Source: https://www.stratatb.gov.sg/terms-of-use.html# (Accessed 20 October 2021)

Things I like:

  • Mentions fair dealing as permitted by law. However, they have to update to “fair use” or “permitted use” once the new Copyright Act is effective.

Things I don’t like:

  • Not sure why it says “Government of Singapore ©️ 2006”. Maybe they copied this terms of use statement in 2006 and never updated it since?
  • You can use the information for “commercial purposes” if you get written permission. It doesn’t tell you in what circumstances they will give you such permission. (This is less upsetting than PDPC’s terms.)
  • It doesn’t mention robots, spiders or “automatic devices”.

Comments:

  • It’s less upsetting than a bonkers terms of use, but it doesn’t give me confidence or an idea of what to expect.
  • The owner probably has no idea what data mining, predictive analytics etc., are. They need to buy the new “Law and Technology” book.

Conclusion

One might be surprised to find that terms of using a website, even when supposedly managed by lawyers, feature unclear, problematic, misleading, and unreasonable terms. As I mentioned, very little thought goes into drafting such terms most of the time. However, they provide obstacles to others who may want to explore new uses of a website or resource. Hopefully, more owners will proactively clean up their sites once the new Copyright Act becomes effective. In the meantime, this area provides lots of risks for a developer.

#Law #tech #Copyright #DataScience #Government #WebScraping #scrapy #Singapore #PersonalDataProtectionCommission #StrataTitlesBoard #DataMining

Author Portrait Love.Law.Robots. – A blog by Ang Hou Fu

Feature image

I am not going to be shy about this. I really hate reading Non-Disclosure Agreements. Sometimes, they are known as NDAs, Confidentiality Agreements, and Mutual Confidentiality Agreements. Whatever guise they are in, they seem to say the same thing in many different ways. If there was a nuance, it's difficult to say what actual impact they have. Yet, we still have to figure it out and review them as necessary. Because somebody probably forgot to put in something that we need. Once they are signed, we move on. In short, a lot of time is spent on NDAs, but nobody cares. 🤢

That's what @thelawboutique_ execs Electra and Roisin found when they sat down to review the agreements they'd handled for their clients.

The #NDA effort-to-value ratio isn’t just appalling for the service providers – for in-house counsel it’s even worse. Can you relate? pic.twitter.com/CAuHQAkPQR

— oneNDA (@onendaclub) July 26, 2021

This kind of dysfunction is ripe for disruption. NDAs are probably one of the most straightforward problems in using AI for contract review. They have a certain structure and a limited set of clauses. You don't need to look far for an example — check NDAlynn. You can even enjoy NDALynn for free if you don't mind your document becoming part of the hivemind.

oneNDA is a different kind of disruptor. It suggests that life would be much easier for everyone if we sat together and agreed on one NDA. If you're curious what that crowdsourced NDA looks like, you can take a look at their website or download it here.

oneNDA Is Live! An M&A Module Is NextThe oneNDA project, which has sought to create a universal Non-Disclosure Agreement to cover confidential information, has gone live and is now available. However, the project will not stop th…Artificial Lawyerartificiallawyer

What I liked about oneNDA

  • It's short, simple and pretty. At two pages, they have really condensed the document into its finer parts.
  • The so-called “variables” only cover about half a page. Agreeing on the details of this document will probably take less than a few minutes. This is probably more substantial than many NDAs I have seen regarding time and costs saved (if that is the only thing you have to agree on).
  • It's licensed underCC BY-ND 4.0, a Creative Commons license. So yeah, I can share it here.

What I didn't like about oneNDA

  • Choices have been made, and not everyone will agree with them. You can check out their graveyard to read about what they decided to leave out in oneNDA. I can understand why they did it, but that will not stop anyone from adding it back in. You would then have to spend time reviewing it, which defeats the purpose.
  • It's licensed with ND (No Derivatives). This means you can't change and then distribute it, including sending it to your counter-party. It's a plausible controversy whether contract clauses can be copyrighted (they obviously can). I guess dictating how the NDA is used can help adoption by building a strong identity. In my view and having some background in open source, I honestly think this ND qualification is unnecessary. I would definitely think twice about these restrictions before using them.

My Conclusion

The creators of oneNDA have made a smart decision by aiming first for an NDA that is the simplest — having a commercial discussion. Logically speaking, it's like buttoning your shirt as you prepare for a business meeting. You shouldn't be spending much (if possible, any) lawyer or negotiation time on this sort of agreement. This NDA definitely helps you to reach that conclusion.

However, if you've reviewed enough NDAs, you would know that the arguments which prevent us from having a universal NDA aren't entirely rational. Somebody thinks their template is better, and we are going to have a debate over it. Sometimes it's easier to argue that we should stick to our template than selling a change which sounds like losing our freedoms. Maybe our language has been “tested”, and oneNDA hasn't. If I was conservative, having as many clauses in as possible is safer than leaving something out.

Reviewing NDAs is the express route to learning why contracts are dysfunctional.

Finally, there's a problem with NDAs. I first found out when I wrote my own NDA generator. Everyone sort of agrees that NDAs are low hanging fruit, but the reward of solving the problem isn't sweet or worth shouting to management. Nobody cares about NDAs, so nobody cares about the solution. Here lies an important lesson in innovation — some problems just aren't worth solving. The conclusion of my NDA generator was that people nodded at the “proof of concept”. Conversely, a letter generator that wasn't particularly complex or legal had a greater impact. They used it more often, and it saved them from learning how to choose a template.

On the other hand, like buttoning a business shirt, it wouldn't impress you if a button was in a different shape, or featured “magnets”.

oneNDA Is mediocreNDA: Thoughts on a Proposed Standard Nondisclosure Agreement – Adams on Contract DraftingoneNDA is a new initiative that “set out on a mission to standardise the NDA so that lawyers can spend less time on them and more time on more valuable work.” For more about oneNDA, go here. This week they released their “simple, plain English, open-source NDA,” also called, somewhat confusingly, on…Adams on Contract DraftingKen AdamsIt's great to know that I ain't alone in my conclusions.

So, while I am a firm supporter of standards, I am not excited about this one. oneNDA is great, but it is just another NDA.

Standards, the Importance of StandardsI explore the possibilities of using a taxonomy from the Legal Matter Standard Specification from SALI the Alliance.Love.Law.Robots.HoufuRead about a standard I am actually excited about.

Looking Forward

I may not be excited about oneNDA in its current state, but I am more curious about how technically they would implement modules for M&A. I hope it will be a useful tool that provides convenience and simplicity for users, and not just a choose your own NDA. Anything that keeps NDAs away from lawyers will be a boon for the whole process.

For now, count this sceptic out of the hivemind.

Enjoyed this post? You can read more about law or technology on Love.Law.Robots. by subscribing today. I post at least once a week and subscribers get a free members only newsletter. Thank you!

#Law #Contracts #Copyright #LegalTech #oneNDA

Author Portrait Love.Law.Robots. – A blog by Ang Hou Fu