Auto-generate reports via coding & AI

Updated: Feb 17

Can sector snapshots, market studies, freedom to operate searches, business plans, competition analysis etc. be automated and reports generated on demand?


Below is a healthcare sector snapshot that was auto-generated reports via code and using various AI techniques. Before glancing over it, here are some notes:


  • The report is a series of snapshots to illustrate the possibility and is not intended to be comprehensive.

  • Ignore the lack of aesthetic aspects as UX was not the focus of this report.

  • The focus was given on auto-generation and hence human editing has been avoided.

  • The only human intervention was in inserting takeaways to point out trends etc. (these could be automated too)

  • The data is focused on startups in India and for a small timeline of Dec 2017 to April 2018.

  • Patent analysis is extended to 5 years.



Healthcare report

Topics

Slide 2: What are people following in the startup ecosystem?


Nature of money flow

Healthcare startup investments

Slide 3 & 4: Which sectors raised money? Where does healthcare stack-up? Within healthcare which startups raised funding?



Characterize startup

healthcare startups

Slide 5 & 6: Expand understanding of the startups that received funding and classify them



Classification of funded startups

Slide 7: Organically come up with a new way to classify & cluster startups. This provides a great richness to understand startups and relevant competition



Deep Dive

Slide 8: Conduct a technology deep dive on each startup. An example here — understand the patent landscape


Such a report can be generated on-demand literally at that moment. The report will represent the state of the sector at that moment and can be generated multiple times. Slide generation is automated using Python libraries like “python-pptx”. This can also be used for adding logos and improving UX visual appeal.


The above report demonstrates various aspects. For example, company analysis, VC money flow, patent analysis, etc. This is just the beginning. A lot more can be added like earning transcript analysis, investor decks, founder analysis, peer-reviewed publication analytics, company image analysis etc


The cost of building this is minimal. Upfront R&D is to figure out the data, cleaning it and writing algorithms. The operational cost is primarily storage costs of data and any 3rd party API call charges. At such low cost of report generation, new business models like micro-subscription models can be opened up thereby disrupting consulting, media, publishing & report generation industries.


The above report was achieved through a mash-up of technologies as can be seen in the picture below.



Report generation work flow

The opportunity is to structure various reports at different depths of detail. To illustrate this, find below 2 charts focused on AI and in specific Deep Learning. This is done by converting patents into a multi-dimensional vector space using AI and clustering them. One can go to greater depths with specific domain knowledge-based extraction like analyzing what neural network architectures are being used in the patents being filed using deep learning. Between the two charts, one can see there is a trend towards understanding deep learning neural models from the brain, applying on speech & image data-sets, and also analyzing anomalies in telecom network packets.



Deep Learning Patent Landscape

Network Architecture

Slide 9 & 10: Understand upcoming technologies and architectural trends. In this case for Deep Learning


What’s next?


Accuracy can be improved with more training and better vocabulary building. Human editing can take it to great levels. Augmenting humans to write reports I believe is a promising area that I tend to call Augmented Reporting. But what is keeping me excited is the picture below. Adding an AI layer that can take contextual input and intelligently generate the schema of reports and produce them. Multiple layers of intertwined AI like our brain. That’s the future of AI and the one that will penetrate the higher end strategic tasks.



Agumented Report


18 views0 comments

Recent Posts

See All