====== A.I. Artificial Intelligence ====== Just a place to put a few notes on some A.I. software to try out for image generation. In a few years this will probably seem kind of quaint but at the time of this writing there are only a few apps that are reasonably easy to set up to do natural language image processing / generation. ===== Source of A.I. News / Info ===== * Streamtabulous - Krita and AI - https://www.youtube.com/watch?v=suwwW9eE5qI * Bob Doyle on Youtube, intros with a lot of image AI tools - https://www.youtube.com/@BobDoyleMedia * Pixovert doing A.I. imaging tools and techniques (comfy-ai and SDXL) https://www.youtube.com/@Pixovert/videos * Matt Wolfe does news roundups - https://www.youtube.com/@mreflow/videos ===== Tools & Findings ===== ==== Chat + Organizing Information ==== * Perplexity - https://www.perplexity.ai/ * Brave Browser Leo - https://brave.com/leo-release/ * DeepSeek - Not sure exactly but looks big - https://www.deepseek.com/ * ChatGPT - https://chatgpt.com/ * Google Notebook LM - https://notebooklm.google/ * DeepAI Chat - https://deepai.org/chat * Venice AI - https://venice.ai/chat/ * Gab AI - https://gab.ai/login * Google Gemini - https://deepmind.google/technologies/gemini/#introduction * Affine - https://affine.pro/ai * Morpheos - p2p - https://mor.org/about * Undetectable AI - https://undetectable.ai/ * GPT4ALL-Run Local LLMs - https://www.nomic.ai/gpt4all * HuggingFace Chat - https://huggingface.co/chat/ * DeepL writing and translation - https://www.deepl.com/en/translator * Descript creates script from audio or video - https://www.descript.com/ * ONIT - Desktop A.I. - https://github.com/synth-inc/onit?tab=readme-ov-file * Chatbox - Desktop AI - cross-platform - https://github.com/Bin-Huang/chatbox * GPTME - https://github.com/ErikBjare/gptme * A.I. Mock Interviews - https://prepin.ai/ ==== A.I. Agents ==== * Eigent Multi-Agent Open source local - https://github.com/eigent-ai/eigent * Huginn - Open source ai agent - https://github.com/huginn/huginn * Proxy by convergence - https://convergence.ai/ * OpenAI Operator * Skyvern - https://github.com/Skyvern-AI/skyvern ==== Code / Programming Assistance ==== * Project and Process - https://www.pneumatic.app/ * Vercel v0 - Prototyping and dev - https://v0.dev/ * Cody - VSCode - https://sourcegraph.com/cody | https://github.com/sourcegraph/cody * OpenAI Canvas - https://openai.com/index/introducing-canvas/ * Phind - For developers - https://www.phind.com/about * Gitlab Duo - https://about.gitlab.com/gitlab-duo/ * Codeium - https://codeium.com/ * Cursor (non-free) - https://www.cursor.com/ * Continue - https://www.continue.dev/ * MicroAgent - https://github.com/BuilderIO/micro-agent ==== UX and Design (non-photo stuff) ==== * Paper.design - https://paper.design/pricing * Pencil.dev UX drawing with MCP - https://www.pencil.dev/ * Quant UX - https://quant-ux.com/ * Graphite - https://graphite.art/ * Grida - https://grida.co/ * Anima - https://www.animaapp.com/ * Open Pencil - https://openpencil.dev/ (opens figman files) * TL:Draw - https://www.youtube.com/watch?v=1C2TdPkj6aQ * Lunacy - https://icons8.com/lunacy * UIzard - https://uizard.io/ * Galileo - http://UseGalileo.ai * Fronty - Convert IMG to UI - https://fronty.com/ * More? - https://www.interaction-design.org/master-classes/ai-powered-ux-design-how-to-elevate-your-ux-career * More? - https://chatuxd.gumroad.com/l/aitooldatabase * Claude was better at extracing PDF data to csv - https://martinklepsch.org/posts/pdf-to-csv-with-gemini-and-claude ==== Image Generation ==== * Ideogram - https://ideogram.ai/login * Flux AI Playground - https://playground.bfl.ai/image/edit * VisionFX - https://visionfx.ai/ai-image-generator * Black Forest Labs Flux.1 Kontext - https://bfl.ai/models/flux-kontext * Napkin AI - Generates Diagrams - https://www.napkin.ai/ * Stable Diffusion - https://stability.ai/ * Stable Diffusion Reddit - https://www.reddit.com/r/StableDiffusion/ * Blender connected app (several gigabytes of space) * MidJourney - Commercial - https://www.midjourney.com/ * DALL-E - formerly Craiyon - https://en.wikipedia.org/wiki/DALL-E * NVIDIA Canvas - https://www.nvidia.com/en-us/studio/canvas/ * Adobe A.I. - https://www.youtube.com/watch?v=pp_9cGrT_Is * Open CV - https://www.opencv.ai/ * Caktus AI - Education - https://www.caktus.ai/ * Canva AI - https://www.canva.com/ai-image-generator/ * LeonardoAI - commercial - https://leonardo.ai/ * Huggingface - https://huggingface.co/spaces * AI Comic Factory - https://huggingface.co/spaces/jbilcke-hf/ai-comic-factory * Github Co-pilot inside VSCode - https://code.visualstudio.com/docs/editor/github-copilot * Webstorm AI assistant - https://www.jetbrains.com/help/webstorm/ai-assistant.html#ai-assistant-features * Virtual Teammate inside Atlassian Jira - https://www.atlassian.com/software/artificial-intelligence * GIMP AI background remover - https://www.youtube.com/watch?v=A0pCWJra4uA * Stable diffusion inside Gimp - https://www.youtube.com/watch?v=4IuIKe1sEFY * Topaz AI for images and video - https://www.topazlabs.com/shop * VisionFX for Paintshop Pro - https://www.paintshoppro.com/en/products/plugins/vision-fx/ * Krita AI Diffusion - https://github.com/Acly/krita-ai-diffusion * GANbreeder - Mixes photos with AI - https://www.joelsimon.net/ganbreeder.html * DeepDream - dreamy fantasy tool - cool but meh - https://deepdreamgenerator.com/ * AI Painter - https://www.fotor.com/features/ai-painter/ * Adobe Fresco * Mistral AI - Models - https://mistral.ai/ * Prisma - Phone App - https://prisma-ai.com/prisma * AISEO Art - https://art.aiseo.ai/ * ComfyUI - https://github.com/ComfyUI-Workflow/ComfyUI-OpenAI * OmniGen - https://github.com/VectorSpaceLab/OmniGen * ReCraft - https://www.recraft.ai/ ===== I want to do X type Graphic ===== One of the most important things to be able to do in A.I. age with image generation is to know more often than not what tool is the best choice to do a certain task. Perhaps a list of tasks and the tool assocaited is in order. This may become unwieldy after a while but you have to start somewhere. For example "I want to make studio Ghibli", or "3D craft stop motion", "Lego movie", "Summarize Content", "Art Deco", "Diagrams", "UX Design Mockup", "Page Layhout", "Renaissance Painting", "Fill in areas here" There may be a single tool that's better accepting uploads. There may be a better prompt builder which is what might be done locally. Also a tool like Krita could be the one to run locally. And it'd be good to know how good or similar certain tools are. If Grok and Gab can bother generate images. What's best UI? * A.I. Pattern Generator? * A.I. 3D model generator? - Blender to Claude - https://www.youtube.com/watch?v=r7H60u0kHRA * A.I. Create from simple sketch? * A.I. Create User Interface Wireframe? * A.I. Create Landscapes? * A.I. Create Maps? * A.I. Create? * A.I. Create Sketch Drawing? * A.I. Create Biomes? * A.I. Ghibli Effect? * A.I. Renaissance Painting? * A.I. Sculpture? * A.I. Morph or Morph Animation? * A.I. Face imposition * A.I. Swap face, aging, de-aging ==== Image Restoration ==== * Upscaler software - https://gitlab.com/TheEvilSkeleton/Upscaler * Flathub - https://flathub.org/apps/details/io.gitlab.theevilskeleton.Upscaler * Upscale.media - https://www.upscale.media/ * Orig libraries - https://github.com/xinntao/Real-ESRGAN-ncnn-vulkan * https://github.com/upscayl/upscayl * Krita AI Generative Fill - https://github.com/Acly/krita-ai-diffusion * https://www.youtube.com/watch?v=tDWzjx68KUI * GIMP ML (machine learning) - https://kritiksoman.github.io/GIMP-ML-Docs/docs-page.html#section-1 * Icons8 Upscaler (makers of lunacy) - https://icons8.com/upscaler * Waifu2x upscaler for Anime - http://waifu2x.udp.jp/index.html * https://image-upscaler.com/ * Anime4k - https://github.com/bloc97/Anime4K * SRGAN - Super Resolution - https://github.com/tensorlayer/srgan * Image Super Resolution - https://github.com/idealo/image-super-resolution ==== Audio, Text-to-Speech, Voice Cloning ==== * ElevenLabs - create voice clones + multitrack audio - https://elevenlabs.io/ * InvideoAI - https://invideo.io/tools/ai-voice-cloning/ * Synthesia - https://www.synthesia.io/features/ai-voice-generator * BigVU - https://bigvu.tv/voice-generator/ai-voice-cloning * What is voice cloning? - https://podcastle.ai/blog/what-is-voice-cloning/ * Suno - Text prompt to background music - http://suno.com * UDIO ==== A.I. Companions ==== Lot's of work being done and people discussing A.I. companions. This is a sad near-reality. A.I. assistants would be much better to help people without replacing real people. But there's really no stopping the rise of A.I. relationships. A trajectory towards more interaction in the physical world is likely a great idea. Also there will be a great need for more counseling of people by professionals to help them ween off simulations and addictive dopamine tech. ===== Hardware / Cameras ===== * Arducam for Raspberry Pi 5 - https://www.arducam.com/product/presalesarducam-pinsight-12mp-vision-ai-mate-for-raspberry-pi-5/ * Luxonis Robotic AI Cameras - https://www.luxonis.com/ * NVidia NanoJet Board - https://developer.nvidia.com/embedded/jetson-nano-developer-kit ===== Video Production ===== * HeyGen - Creating AI Video Avatar - http://www.HeyGen.com * Wondershare Filmora - https://filmora.wondershare.com/ * AI features - https://filmora.wondershare.com/filmora-features.html#ai * https://www.youtube.com/watch?v=bOw5RwlZHl4 * Invideo AI - https://invideo.io/ * OpusClip - https://www.opus.pro/- Change longform to shortform * Sora - text to video generation - http://sora.com ===== Pinokio ===== A launcher for all sorts of locally installed AI modules. * https://pinokio.computer/ * https://www.youtube.com/watch?v=VXiyhl0c1gI * Text Language module (see above video) - https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca ===== Next Cloud AI Assistant ===== * https://nextcloud.com/blog/first-open-source-ai-assistant/ * https://www.youtube.com/watch?v=9_qnW5Z2xfs * https://nextcloud.com/blog/ai-in-nextcloud-what-why-and-how/ ===== Stable Diffusion ===== Deep learning text-to-image diffusion model capable to generate imagery. Stable Diffusion's code and model weights have been released publicly and can be run on most consumer hardware equipped with GPU with 8GB of VRAM. The user owns the rights to their generated output images, and is free to use them commercially. * Stable Diffusion GIMP - https://www.youtube.com/watch?v=XjBhqogM8Co * Wikipedia article - https://en.wikipedia.org/wiki/Stable_Diffusion * Main Site: https://stability.ai/ * Main Blog: https://stability.ai/blog/ * Github - https://github.com/CompVis/stable-diffusion * Stable Diffusion for Krita - https://www.reddit.com/r/StableDiffusion/comments/x37dtp/krita_addon_with_inpainting_image2image_and/ * Stable Diffusion Krita Demonstrated - https://www.youtube.com/watch?v=-1iKI_yxsLo ===== Prompts to Learn like a Genius ===== * https://www.youtube.com/watch?v=TPLPpz6dD3A * Retrieval Practice is the best form of learning * And interacting with the information * Get A.I. to force you to think about something and test you * Substack: Assigning AI: Seven Ways of using AI in Class * https://www.oneusefulthing.org/p/assigning-ai-seven-ways-of-using Act as a Socratic tutor and help me understand the concept of momentum in physics. Ask me questions to guide my understanding. Can you explain momentum using everyday analogies and provide some real-life examples? Create a set of practice questions about momentum ranging from basic to advanced levels. Give me a list of 20 key terms in this paper and break it into five categories. Make a list of propositions in this text in the format "X is a type of Y", "W is caused by X", "A explains B". Put it into a table with columns. ==== Papers about using A.I. for learning ==== * https://www.ox.ac.uk/students/academic/guidance/skills/ai-study * https://www.oneusefulthing.org/p/assigning-ai-seven-ways-of-using * https://www.sciencedirect.com/science/article/abs/pii/S0099133323000599?via%3Dihub * https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4391243 ===== Bloom's Taxonomy of Learning ===== * Remember * Understand * Apply * Analyze * Evaluate * Create ===== Roundup of various A.I. Apps (quickly will get outdated) ===== {{:pasted:20250113-051520.png}} ===== Can you have AI render a dream? ===== {{:pasted:20231227-000142.png}} {{:pasted:20231227-001657.png}}