• AI
  • Architecture
  • Article
  • Automation
    • Browser-Use
    • ViewComfy
    • OpenManus
    • RVT-GPT
  • Text-to-Image
    • ComfyUI
    • Midjourney
    • DALL·E 2
  • Text-to-Video
    • Luma
    • RunwayML
  • LLM`s
    • Anthropic Claude 2.5 API
    • ChatGPT API
    • Ollama (local models)
Home
  • Flux Kontext + Ollama in ComfyUI

    27/06/2025

    •

    AI, ComfyUI, Flux, Workflow

    I designed a lightweight LLM-powered workflow to improve consistency and quality in image prompting. The process begins with Microsoft’s Florence 2 vision model, which analyzes and describes the content of the input image. This description is then combined with user input using Ollama to create a richer, more context-aware prompt. Finally, the refined prompt is…

    Read This Post: Flux Kontext + Ollama in ComfyUI
  • Simple ComfyUI Web App Using ViewComfy and Ngrok.

    27/03/2025

    •

    AI, Architecture, ComfyUI, Flux, Lora, Ngrok, ViewComfy

    I’ve created a simple locally run ComfyUI web app using ViewComfy and Ngrok.

    Read This Post: Simple ComfyUI Web App Using ViewComfy and Ngrok.
  • Replicate Anything with ControlNet, IPAdapter, and Florence2 CLIP Encoder in ComfyUI.

    27/03/2025

    •

    AI, ComfyUI, Flux, Video

    In this project, I used a custom SDXL checkpoint, ControlNet with DepthAnythingV2, IPAdapter with style transfer, and the Florence2 open-source vision model to replicate the original photo. By integrating these three methods, the final image aligns with the original in terms of style, depth map, and descriptive prompt. How It Works𝐂𝐨𝐧𝐭𝐫𝐨𝐥𝐍𝐞𝐭 𝐰𝐢𝐭𝐡 𝐃𝐞𝐩𝐭𝐡𝐀𝐧𝐲𝐭𝐡𝐢𝐧𝐠: This tool…

    Read This Post: Replicate Anything with ControlNet, IPAdapter, and Florence2 CLIP Encoder in ComfyUI.
  • Living Room Study with Flux-Dev (NF4) in ComfyUI.

    27/03/2025

    •

    AI, Architecture, ComfyUI, Flux, Video

    Read This Post: Living Room Study with Flux-Dev (NF4) in ComfyUI.
  • Smart Flux Inpainting Workflow With Automated Visual Recognition and Masking in ComfyUI.

    27/03/2025

    •

    AI, Architecture, ComfyUI, ControlNet, Flux, LLM, Video

    In this workflow, I used Florence and Segment Anything vision model to automatically recognise elements in the image and create masking layer based on the text prompt. I applied SEGS Detailer and Flux to complete the inpainting.

    Read This Post: Smart Flux Inpainting Workflow With Automated Visual Recognition and Masking in ComfyUI.
  • Virtual Reality Flux Workflow With ComfyUI + Kuula.

    27/03/2025

    •

    AI, Architecture, ComfyUI, Flux, Kuula, Video, VR

    Link to VR tour in Kuula: https://kuula.co/share/collection/7ZlgS

    Read This Post: Virtual Reality Flux Workflow With ComfyUI + Kuula.
  • Mystical worlds with Flux NEON-MIST LoRa in ComfyUI and RunwayML.

    27/03/2025

    •

    AI, ComfyUI, Flux, Lora, RunwayML, Video, Workflow

    Text-to-image with Flux1 dev and neon-mist LoRa.Image-to-video with Runway GEN-3 Alpha Turbo.

    Read This Post: Mystical worlds with Flux NEON-MIST LoRa in ComfyUI and RunwayML.
  • LLM Generated HTML Webpages with OpenManus.

    27/03/2025

    •

    AI, Anthropic Claude 2.5, Automation, Browser-Use, HTML, LLM, OpenManus, Qwen2.5, Video

    I used OpenManus, an open-source, general-purpose autonomous AI project, alongside the Anthropic Claude 2.5 LLM model to generate responsive HTML webpages from simple prompts.

    Read This Post: LLM Generated HTML Webpages with OpenManus.
  • Locally Run LLM With Browser Use.

    27/03/2025

    •

    AI, Automation, Browser-Use, LLM, Qwen2.5, Video

    Integrating the locally run Qwen2.5:7b language model with Browser-Use through Ollama enables efficient automation of browser tasks while maintaining data privacy. While the integration offers numerous advantages, it’s important to note that smaller models may occasionally produce incorrect output structures, leading to parsing errors. To maximize Browser-Use’s potential, I recommend using ChatGPT-4o API support.

    Read This Post: Locally Run LLM With Browser Use.
  • Browser Use With ChatGPT-4o API.

    27/03/2025

    •

    AI, Browser-Use, ChatGPT, LLM, Video

    With Browser Use running locally in Docker and powered by the ChatGPT-4o API, I’m exploring how AI can automate everyday tasks using computer vision and reasoning.

    Read This Post: Browser Use With ChatGPT-4o API.
1 2 3 … 11
Next Page

Personal Blog By Marcel Žnidarič

Here, I document my experiments with Automatic1111, ComfyUI, Midjourney, Flux, Stable Diffusion, Runway, ChatGPT API, LLM`s, automation, and much, much more.

Discover, learn, and innovate along the way!

Get in touch

Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
View preferences
{title} {title} {title}