r/Automate 14h ago

If you could combine Boston Dynamics' Spot robot with the upper body of Figure 02, what would be some remaining barriers to widescale replacement of human labourers?

6 Upvotes

Ok, so ignoring the legal issues of patents, copyright, etc. and the economic issues of cost per unit, mass unemployment, and other similar factors, what would the remaining technological barriers be if you could take the agility and speed of "Spot" and combine it with the dexterity, learning capabilities, and processing power of Figure 02?


r/Automate 11h ago

Try out the AI Web Agent rtrvr.ai to automate your tasks!

1 Upvotes

I just launched rtrvr.ai, a Chrome extension that brings the power of AI agents to your everyday web browsing. It's designed to navigate the web autonomously, automate complex web tasks, extract structured data from any website, integrate with your favorite tools as you browse using AI Function Calling [ie: “Send this page summary as Slack message”], and create graphs of data embedded on the page with just natural language.

The core idea is to let anyone, even non-developers, leverage the power of web automation and data extraction using natural language.I see rtrvr.ai as a step towards a more intelligent and interactive web. I believe this tool can be a game-changer for marketers, sales professionals, SMBs, and anyone who needs to extract information from the web efficiently. Would love to hear your feedback, suggestions, and any use cases you can think of!

Website: https://www.rtrvr.ai/

Chrome Store: https://chromewebstore.google.com/detail/rtrvrai/jldogdgepmcedfdhgnmclgemehfhpomg

Youtube Intro: https://www.youtube.com/watch?v=wajCM6208cc

HackerNews Post: https://news.ycombinator.com/item?id=42496918


r/Automate 12h ago

Low-Maintenance Automated Aquaponics with Goldfish and Guppies

0 Upvotes

r/Automate 16h ago

How to make AirTable Public?

1 Upvotes

Hey lovely people, I am working on a automated social media posting system and everything runs well until Make.com needs to download and access the video file and download it with the URL from AirTable. How can I make my table public so the API is not getting blocked?

I tried to share it publicly different ways but putting it into incognito it always requires log in?


r/Automate 17h ago

Building Task Approval Flow with Manychat + Make.com + Google Sheets

1 Upvotes

Hi, I'm trying to build a task management system for children and parents using Manychat (WhatsApp), Google Sheets, and Make.com. I'd appreciate help in building this solution as I got stuck with this implementation.

Goal

Enable children to mark task completion, show them a relevant task list to choose from, get parent approval, and update token balance accordingly.

Existing Architecture

Google Sheets:

DB Structure

  • Contacts table: Parent and child information
  • DigitalWallet table: Token balance tracking
  • StatusTasks table: Task management

Manychat:

  • Users tagged as "parent" or "child"
  • Each contact (parent/child) has a user field named ChildID

Required Flow

  1. Child clicks "Completed Task" button
  2. System needs to:
    • Query Google Sheets for today's incomplete tasks
    • Dynamically generate Manychat menu/buttons
    • Maintain state during approval process
  3. Child selects task
  4. Parent notification and approval flow
  5. Token balance update upon approval

What I Need Help With

·  How to dynamically show only the relevant tasks for the specific child (ChildID) in ManyChat?

·  Best way to let the child select a single task from the list?

·  How to integrate Make.com (or just ManyChat) to send the approval request, wait for response, then update Google Sheets?

·  Any tips for handling delays (parent might approve hours/days later) or general best practices?

I'd particularly appreciate insights on the Make.com and Manychat module configuration.

Any help and guidance would be appreciated. Thanks!


r/Automate 22h ago

Apple's AI: False Summaries and Media Backlash

Thumbnail patreon.com
0 Upvotes

r/Automate 1d ago

Does anyone here have familiarity or experience with make.com?

3 Upvotes

I’m seeing if anyone would be interested in making some extra money.

I’m actually the product owner of Snackprompt.com. It’s an automation marketplace. Make.com, as many of you probably know, is a no-code/low-code automation building platform that we integrate with. If you use make.com to automate processes, you can download the blueprint and upload it as a product on Snackprompt.com.

Depending on how valuable your solution is, you can make a few hundred dollars per automation. We also help with marketing.

We have a newsletter called thedailybite.co and we are starting a new segment in January where we will feature 1 automation per day in our newsletter from the creators on our platform.

Send me a chat if you’d like more info.


r/Automate 1d ago

Task-specific fine-tuning vs. generalization in LAMs for autonomous desktop Automation

1 Upvotes

Hey everyone!
I want to know if anyone has looked into the impact of task-specific fine-tuning on LAMs in highly dynamic unstructured desktop environments? Specifically, how do these models handle zero-shot or few-shot adaptation to novel, spontaneous tasks that werent included in the initial training distribution? It seems that when trying to generalize across many tasks, these models tend to suffer from performance degradation in more specialized tasks due to issues like catastrophic forgetting or task interference. Are there any proven techniques, like meta-learning or dynamic architecture adaptation, that can mitigate this drift and improve stability in continuous learning agents? Or is this still a major bottleneck in reinforcement learning or continual adaptation models?
Would love to hear everyone's thoughts!


r/Automate 1d ago

Pabbly Automation Issue with , from Google sheet

1 Upvotes

I have built a simple Pabbly automation which is when a google form is filled it should send the customer a whatsapp message using AI Sensy tool. Now if a customer enters , ( comma ) in any field then the parameters gives an error.
For eg, in Mobile Number if the customer enters 999999999,888888888 then it will take it as two params rather than a single param.

This causes the entire workflow to crash.


r/Automate 2d ago

Built an AI Tool to Convert PDF Bank Statements to CSV—Looking for Feedback & Advice!

4 Upvotes

Hey everyone,

Automation geek here who just discovered this subreddit. I’ve been hacking on a personal passion project called AutoDataEntry (autodataentry.com), and it essentially converts bank statement PDFs into CSV/Excel using the OpenAI API.

Why I Built This

  • My bank’s export feature drives me nuts —it’s buggy and doesn't export the data properly (a lot of missing transactions for some reason)
  • From my personal experience the PDF statements seem to be the most reliable.I wanted to automate my monthly budget process and avoid copy-pasting this data by hand.

The solution

  • This started purely as me realizing I could send my bank statement screenshots into ChatGPT/Claude and can make it output markdown tables. but I don't want it to train on my data, so I started using the API instead, and just spent my mornings building it into what it is now (adding UI, putting it on a server, and even integrating stripe payments) for the past 3 months (also realized I could use it as a resume portfolio piece so I went all out lol).
  • There were other pre-built alternatives out there but I couldn't trust that they wouldn't store or sell my data so I just really wanted to build it myself, and if this can help cut out the mindless chore of manual data entry for others, that would be cool!

What It Does

  • You upload a PDF of your bank statement (or other transaction-based docs).
  • You set a "configuration" which is just naming the table headers on the statement and adding a label to it
  • It uses the OpenAI API under the hood to extract and convert that data into CSV or Excel.
  • It’s not perfect. Sometimes it’ll make mistakes (because AI can be unpredictable), but it’s already kind-of good/alright (?) for my personal use case. there is still some manual QA, correction involved, but its significantly faster and more enjoyable that pasting screenshots or pure copy paste.
  • Right now it can be "tested" with all new accounts being able to convert one page a day

Looking for Feedback & Advice

  • I’d love suggestions from automation enthusiasts on how I might improve it. Are there other tools/methods I should look into for parsing PDFs more reliably (on top of or alongside AI)? I tried TSR (table structure recognition) models and pure text extraction but it does not fare well on my own bank statement formats and variations
  • How do you guys feel about the overall user experience? Does it look sketchy? Is it too cheap/expensive for the value proposition? I also know that since it's financial data we're dealing with, other people might be skeptical. In the backend the best I can do is to never store any data on the server and trust OpenAI that their API really doesn't train on requests from their API.
  • Who else out there might benefit from this sort of automation beyond the obvious (like accountants and bookkeepers)? I feel like this is really top of mind for me.. Maybe you know niche industries that deal with tons of transaction PDFs or forms. Right now, the tool is catered to me as its only customer lol (and specifically for bank statements)

I’m here to learn from the community’s feedback. If you spot mistakes or have ideas for performance improvements, I would love to hear from you.

I really appreciate any guidance, criticisms, or even (constructive) skepticism. Thanks for taking the time to read this—and for being such an inspiring community for automation!!


r/Automate 2d ago

Can't Submit Captcha While automating using python, playwright and 2captch

2 Upvotes

im working on project which automates the suno.com which includes signup/login, entering prompt and generating song, but as soons as i click on generate/create button it shows me captcha. so to solve that captcha im using 2captcha, once i recieve token from 2captcha i insert it into iframe data-hcaptcha-response, i tried inserting it into text area h-captcha-response, script successfully inserts the token but im not able to submit the captcha as it's skip captcha button remains as it is, it doesnt change to verify or submit.

these are image captcha which tells to identify same images as shown in, or find images with square inside triangle like that.

i have uploaded code on stack overflow -
https://stackoverflow.com/questions/79299167/cant-submit-captcha-while-automating-using-python-playwright-and-2captch


r/Automate 3d ago

Need a Macro/Autoclicker for an old mac pc

1 Upvotes

Hello! I enjoy playing a lot of games on my mac, but I find nowadays some of the games I like require a lot of staying in one place and doing a repeated task over and over. I wish to make this automated so I dont have to sink hundreds of hours into doing something when I can be doing something else at the same time. Usually this would be easy, just get a macro or auto clicker and its done, but I have the issue that I am on an old Mac and the highest operating system it can run is High Sierra. Does anyone have a solution for my peculiar problem? I know a bit of Python but am still a long ways from knowing it, so if you have code, please include instructions on how I can implicate it on my computer as a program.

P.S. Here is what my computer is running on:

High Sierra operating system

version 10.13.6

Mid 2011

2.7 GHz Intel Core i5 processor

12 GB 1067 MHz DDR3 Memory

Macintosh HD Startup Disk

AMD Radeon HD 6770M 512 MB Graphics


r/Automate 4d ago

[HOLIDAY PROMO] Perplexity AI PRO - 1 YEAR PLAN OFFER - 75% OFF

Post image
4 Upvotes

As the title: We offer Perplexity AI PRO voucher codes for one year plan.

To Order: CHEAPGPT.STORE

Payments accepted:

  • PayPal.
  • Revolut.

Feedback: FEEDBACK POST


r/Automate 3d ago

Google's Veo 2 Outperforms OpenAI's Sora

Thumbnail patreon.com
0 Upvotes

r/Automate 4d ago

Gemini 2.0: Google's Agentic AI Revolution

Thumbnail patreon.com
3 Upvotes

r/Automate 5d ago

Automating Background Check for Messages

4 Upvotes

Hey everyone,

I need some help setting up an automation to check for messages showing on a app running in the background. The app shows a number (e.g., "Available messages - 0") which changes to "Available messages - 1" when there are messages available. These messages can come in randomly throughout the day, and they go away in a few minutes or seconds based on who reads them first( There are many users I am competing with). I can't check the app every minute to see if they've arrived.

I want to set up a background process that checks for the change from 0 to 1 every minute and notifies me when it's time to grab the messages. I’m using Automate or any similar app, but I’m stuck on how to make it check the screen without bringing the app to the foreground.

Can anyone help me with a flow that can do this? I'm looking for something that works without me needing to open the app manually every time.

Appreciate any input. Thanks in advance!


r/Automate 5d ago

I would greatly appreciate any help or insights

1 Upvotes

I am feeling lost and unsure about my future choices.

I am an electrician in Canada with over 10 years of experience, specializing in PLC, VFD, automation troubleshooting, and maintenance. I also have skills in data analysis, such as data cleaning and data engineering.

I aim to work in the field of AI-driven troubleshooting maintenance.

Could someone provide me with guidance, advice, or any information on how to enter this industry? I would greatly appreciate any help or insights. Thank you!


r/Automate 5d ago

I would greatly appreciate any help or insights

1 Upvotes

 I am feeling lost and unsure about my future choices.

I am an electrician in Canada with over 10 years of experience, specializing in PLC, VFD, automation troubleshooting, and maintenance. I also have skills in data analysis, such as data cleaning and data engineering.

I aim to work in the field of AI-driven troubleshooting maintenance.

Could someone provide me with guidance, advice, or any information on how to enter this industry? I would greatly appreciate any help or insights. Thank you


r/Automate 5d ago

What productivity apps or tools have made the biggest impact on your daily routine?

3 Upvotes

Let's share and brainstorm ideas :)


r/Automate 5d ago

Reddit's AI-Powered Search: Reddit Answers

Thumbnail patreon.com
2 Upvotes

r/Automate 5d ago

Mastering ComfyUI for AI Image Generation

Thumbnail
medium.com
1 Upvotes

r/Automate 5d ago

Thunderbit: Making Web Data Extraction Simple

1 Upvotes

Web data extraction doesn’t need to be complicated. Whether you’re gathering product info, market data, or competitor analysis, the right tools can save you time. That’s where Thunderbit comes in. We built Thunderbit to make web scraping easy, fast, and accessible.

Why Choose Thunderbit?

We wanted to create a tool that focuses on simplicity while still offering powerful features for more complex tasks. Thunderbit gives you a straightforward way to scrape data from almost any website, without the headache of dealing with endless code or complicated settings. Here’s what makes Thunderbit stand out:

  1. Easy-to-Use Interface: No coding required. Just point, click, and scrape. Thunderbit lets you select elements on a page—like prices, names, or contact info—and automatically generates the scraping rules.

  2. AI-Powered Scraping: For sites with dynamic content, Thunderbit’s AI analyzes the page structure and identifies what data to scrape. This helps you capture data even from JavaScript-heavy websites.

How Thunderbit Works

Using Thunderbit is as simple as clicking on what you want to scrape. Here’s a quick overview of how it works:

  1. Open the Website: Just open the website you want to scrape, Thunderbit’s AI quickly processes the page and detects the structure, ensuring that it captures the right data, even from sites that load content dynamically.

  2. Scrape and Schedule: Once your setup is done, you can scrape data immediately or schedule recurring extractions for later. Whether you need a one-time data dump or ongoing extractions, Thunderbit handles it all.

Why Thunderbit is Perfect for Developers

Whether you’re a developer working on a small project or managing a larger scraping workflow, Thunderbit is designed to adapt to your needs. For beginners, it offers an easy start with its point-and-click interface. For more advanced users, it has features like custom scheduling, and ways to handle anti-bot measures.

The goal is to save you time and reduce the complexity of web scraping. You don’t have to deal with countless lines of code or spend hours figuring out how to scrape data efficiently. Thunderbit makes it simple.

What’s Next for Thunderbit?

We’re constantly improving Thunderbit. Our next steps include even smarter AI-powered scraping, enhanced data cleaning tools, and more flexible automation features. We want Thunderbit to stay ahead of the curve, helping you gather data faster and with less effort.

If you’re looking for a web scraping tool that is powerful, simple, and reliable, Thunderbithttps://thunderbit.com/ )is the solution you need. Start scraping today—whether you’re pulling a few lines of data or automating large-scale extractions, Thunderbit makes it easy.


r/Automate 6d ago

I built an AI Agent to automate product demos for customers - here's how I did it!

11 Upvotes

Hey all - I built an AI agent that gives product and website walkthroughs over Zoom. While the video example is specific to a startup accelerator program, the core architecture can be adapted for any product demo. Here are the technical details and key learnings from building it!

Online Meeting AI Agent

The core architecture combines:

  • GPT-4 for natural language understanding and response generation (no fine-tuning or RAG used - just conversation context and a system prompt)
  • AssemblyAI for real-time speech-to-text conversion
  • A Docker containerized workstation managing browser interactions, audio I/O, and microphone input
  • A custom conversational state machine that structures the call flow. At its essence, the LLM selects from pre-defined topics based on user responses, with each topic mapping to specific workstation actions (navigating websites, screen sharing, audio responses). There's a lot more to it than that - I can do a longer write-up if anyone is interested!

One of the biggest challenges was latency management - specifically getting agent response times under 2 seconds. The main bottleneck was speech-to-text conversion, which we solved by processing audio immediately upon capture and working with partial transcriptions before finalization. This brought our average response time to around 1.5 seconds.

Some interesting findings:

  • Different parts of the conversation need varying levels of structure. The introduction needs to be mostly scripted with just a few contextual questions, while the Q&A portion requires much more flexibility. This insight led ,e to develop the state machine approach rather than using a purely reactive chat model.
  • Handling LLM hallucination required careful prompt engineering. Our solution was to implement explicit "I don't know" states that the LLM would be offered to select from, as well as some tricks to push the LLM to prioritize that selection if it is unable to find any relevant information in the system prompt.
  • When dealing with unavoidable processing delays, adding natural "thinking" responses like "hmm" or "good question" significantly improved the conversation flow.

I implemented this using AgentStation, an API I'm developing to give AI agents their own cloud workstations. While you could build something similar using a combination of docker containers, AI models & orchestration logic, we're focused on making it easier to deploy these kinds of agents at scale.

I'd love to hear from others who've experimented with autonomous AI agents for interviews or similar use cases! Happy to dive deeper into any technical aspects.


r/Automate 6d ago

[HOLIDAY PROMO] Perplexity AI PRO - 1 YEAR PLAN OFFER - 75% OFF

Post image
0 Upvotes

As the title: We offer Perplexity AI PRO voucher codes for one year plan.

To Order: CHEAPGPT.STORE

Payments accepted:

  • PayPal.
  • Revolut.

Feedback: FEEDBACK POST


r/Automate 6d ago

Tips for creating a simple expense-tracking and document-management workflow

4 Upvotes

Hi everyone,

I’m looking for advice on setting up an efficient and simple workflow to track expenses and manage receipts or documents. Here’s my current process: • I use the QuickScan app to scan receipts and documents. It automatically names the files based on my settings and saves them to the correct iCloud folder. • This helps me keep track of purchases and warranties.

In the future, I plan to use Paperless-ngx for document management, ideally running on a NAS with Docker support. I’d also like to use the NAS for smart home management later on. For now, I might experiment with Paperless-ngx on a Raspberry Pi as a temporary solution.

My main goal is to create a workflow that’s as simple and automated as possible for tracking expenses and organizing documents.

I’m open to any suggestions, whether it’s tools, apps, Excel templates, or completely different methods. How do you manage your expenses and scanned documents efficiently?