<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Docker on Zero-Entry</title>
    <link>https://zero-entry.co.za/tags/docker/</link>
    <description>Recent content in Docker on Zero-Entry</description>
    <generator>Hugo -- 0.147.7</generator>
    <language>en-us</language>
    <lastBuildDate>Fri, 24 Apr 2026 14:26:40 +0400</lastBuildDate>
    <atom:link href="https://zero-entry.co.za/tags/docker/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>30 Days of a Honeypot at Home</title>
      <link>https://zero-entry.co.za/posts/30-days-of-a-honeypot-at-home/</link>
      <pubDate>Sat, 18 Apr 2026 20:30:00 +0200</pubDate>
      <guid>https://zero-entry.co.za/posts/30-days-of-a-honeypot-at-home/</guid>
      <description>Standing up T-Pot on a segmented VLAN behind OPNsense, opening a curated set of ports to a residential IP, and writing down what 30 days of unfiltered internet traffic actually looked like.</description>
      <content:encoded><![CDATA[<p>I finally got around to putting a honeypot on the public side of my home connection. I wasn&rsquo;t trying to catch APTs. I wanted to see what hits a random residential IP when nothing is hiding it.</p>
<p>This is a notes post about standing it up, how it&rsquo;s contained, and what actually showed up in the logs after a month.</p>
<h2 id="why-bother">Why bother</h2>
<p>Most threat intelligence I read describes the internet as a battlefield. Every unpatched device is five minutes from compromise. Every IP gets 30,000 probes a day. The numbers are usually correct. They aren&rsquo;t useful unless you can map them to what your environment looks like.</p>
<p>I wanted my own baseline. Not a vendor&rsquo;s feed, not an aggregated report. What does my ISP-assigned IP attract, right now, and what does the traffic look like when you strip out the marketing spin.</p>
<p>Secondary reason: I wanted to segment the network, and a honeypot is a good forcing function. My homelab had been flat for too long. Nothing makes you VLAN a network faster than hanging something deliberately exposed off it.</p>
<h2 id="threat-model-and-ground-rules">Threat model and ground rules</h2>
<p>Before anything went online, I wrote down what I was willing to tolerate and what I wasn&rsquo;t.</p>
<p>Willing to accept:</p>
<ul>
<li>My IP appearing on scanning blocklists.</li>
<li>My ISP sending me a polite note. (They never did.)</li>
<li>Getting buried in logs I&rsquo;d then have to process.</li>
</ul>
<p>Not willing to accept:</p>
<ul>
<li>Anything the honeypot attracts touching my real LAN.</li>
<li>A honeypot compromise turning into a pivot into anything else I own.</li>
<li>Outbound traffic that looks like I&rsquo;m participating in someone else&rsquo;s botnet.</li>
</ul>
<p>Those three constraints drove every architectural decision that followed.</p>
<h2 id="architecture">Architecture</h2>
<p>The honeypot runs as a single VM on a secondary host, isolated on its own VLAN behind OPNsense. It has one purpose, no shared storage, no shared credentials, and nothing legitimate behind it. If it gets popped, I wipe the VM from snapshot and start again.</p>
<p>The physical picture:</p>
<ul>
<li><strong>Honey-VM</strong>: 4 vCPU, 8 GB RAM, 60 GB disk. Ubuntu 22.04 base, T-Pot on top.</li>
<li><strong>VLAN 66</strong>: dedicated &ldquo;DMZ-lite&rdquo;. No inter-VLAN access. DHCP scoped tight.</li>
<li><strong>OPNsense</strong>: port-forwards a curated set of TCP ports from WAN to the honey-VM.</li>
<li><strong>Suricata</strong> on the OPNsense WAN interface logs everything hitting those ports, independent of what the honeypot itself sees.</li>
<li><strong>Outbound rules on VLAN 66</strong>: no outbound except to a specific syslog collector and to Cloudflare DoH for DNS. No SSH out, no SMB out, no SMTP out, no arbitrary outbound anything.</li>
</ul>
<p>The last point is the one that matters most. A honeypot with open outbound is a honeypot that can participate in the abuse you&rsquo;re trying to study. If Cowrie accepts a shell and the intruder tries to <code>curl</code> a second-stage payload, they should fail at the network, not at the endpoint.</p>
<p>Ports exposed to the internet: 22, 23, 80, 443, 445, 1433, 2222, 3306, 3389, 5060, 5900, 8080. Nothing else.</p>
<h2 id="stack">Stack</h2>
<p>I used T-Pot for the heavy lifting. It&rsquo;s maintained, it aggregates sensible honeypots, and its dashboards are ready out of the box. The components that mattered for me:</p>
<ul>
<li><strong>Cowrie</strong> on 22 and 2222. SSH and Telnet. Logs full session transcripts.</li>
<li><strong>Dionaea</strong> on 445, 1433, 3306, 5060. Protocol emulation for SMB, MSSQL, MySQL, SIP.</li>
<li><strong>Heralding</strong> on anything else that smells like a login prompt.</li>
<li><strong>Honeytrap</strong> as the generic catch-all TCP listener.</li>
<li><strong>Snare / Tanner</strong> serving HTTP decoy content on 80 and 8080.</li>
<li><strong>Elasticsearch + Kibana</strong> for the dashboards, with data shipped out to my own Loki instance as a backup.</li>
</ul>
<p>T-Pot&rsquo;s internal firewalling is fine, but I don&rsquo;t rely on it. The OPNsense rules are the real enforcement. If T-Pot broke tomorrow, nothing on VLAN 66 would suddenly start talking to my real network.</p>
<h2 id="standing-it-up">Standing it up</h2>
<p>Provisioning was uneventful once the segmentation was in place.</p>
<div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"><span class="line"><span class="cl"><span class="c1"># T-Pot install on a fresh Ubuntu 22.04</span>
</span></span><span class="line"><span class="cl">env bash -c <span class="s2">&#34;</span><span class="k">$(</span>curl -sL https://ghst.ly/tpot-install<span class="k">)</span><span class="s2">&#34;</span>
</span></span></code></pre></div><p>The installer handles Docker, pulls the containers, and wires up the reverse proxy for the Kibana side. The one thing I changed was restricting the admin web interface to the management VLAN, reachable only over WireGuard.</p>
<p>SSHD for actual host management got moved to a non-standard port on the management interface. Cowrie owns 22 on the WAN side.</p>
<p>Before opening the firewall, I ran a full scan from an external VPS against the advertised ports to make sure only the intended services responded, and that the banners looked realistic enough to not scream &ldquo;honeypot&rdquo; on the first handshake. A couple of Cowrie defaults were too obvious (the default SSH version string, for one) and needed tweaking. Anyone running OpenCanary fingerprints or Shodan&rsquo;s honeypot detector will still figure it out. Most bots won&rsquo;t.</p>
<h2 id="first-24-hours">First 24 hours</h2>
<p>I expected a slow ramp while DNS caches and scanners noticed the IP. That wasn&rsquo;t the experience.</p>
<p>Within 12 minutes of opening port 22, the first SSH login attempt came in. Inside the first hour: 340 SSH attempts from 47 unique source IPs. By the end of the first day: 8,200 SSH attempts, 1,100 HTTP requests, and around 60 SMB connections.</p>
<p>There is no onboarding period for a public IP. You&rsquo;re in the database already. Opening a port just tells the scanners that something is finally listening.</p>
<h2 id="what-30-days-actually-looked-like">What 30 days actually looked like</h2>
<p>Rough totals at the 30-day mark (Suricata plus T-Pot aggregated):</p>
<ul>
<li><strong>SSH and Telnet attempts</strong>: 412,000 across 22 and 2222, from 14,300 unique source IPs.</li>
<li><strong>HTTP requests to honeypot web roots</strong>: 28,400. Mostly scanner fingerprints and path probes.</li>
<li><strong>SMB connections</strong>: 3,900.</li>
<li><strong>MSSQL login attempts</strong>: 1,100.</li>
<li><strong>RDP connections</strong>: 9,700, almost all from a handful of subnets running NLA probes.</li>
<li><strong>SIP INVITE floods</strong>: two distinct campaigns, one targeting Asterisk defaults, one targeting a specific FreePBX module.</li>
</ul>
<p>Geography is the least interesting dimension and the one vendors love to lead with. Source IPs spread across 90+ countries. That maps to compromised hosts, not operator location. Treating a GeoIP heatmap as a map of threat actors is a mistake.</p>
<p>The credential side is more useful. Top ten SSH username/password combinations over the 30 days, in order:</p>
<ol>
<li><code>root</code> / <code>root</code></li>
<li><code>admin</code> / <code>admin</code></li>
<li><code>root</code> / <code>123456</code></li>
<li><code>root</code> / <code>password</code></li>
<li><code>admin</code> / <code>password</code></li>
<li><code>user</code> / <code>user</code></li>
<li><code>ubnt</code> / <code>ubnt</code></li>
<li><code>pi</code> / <code>raspberry</code></li>
<li><code>root</code> / <code>1234</code></li>
<li><code>support</code> / <code>support</code></li>
</ol>
<p>None of those will surprise anyone who&rsquo;s spent time on this. They confirm what the big honeypot operators have been saying for years: the low end of the attack surface is stuck on the same dictionary it&rsquo;s been stuck on for a decade, because it keeps working.</p>
<h2 id="what-the-payloads-looked-like">What the payloads looked like</h2>
<p>Cowrie logs full session transcripts, which is the part worth reading. Patterns I saw repeatedly:</p>
<ul>
<li><code>uname -a; cat /proc/cpuinfo; free -m</code> as environment fingerprinting before any payload drop. The bot wants to know whether it landed on an ARM router, a MIPS camera, or an x86 box, so it can pull the right binary.</li>
<li>A <code>wget</code> or <code>curl</code> chain pointing at a staging server, usually on port 80 over a direct IP with no DNS. Almost always a short-lived URL, dead within days.</li>
<li>A <code>chmod 777</code> on the downloaded binary, followed by <code>./&lt;binary&gt;</code> and a quick <code>rm</code> to clean up.</li>
<li>Busybox-style commands with shell tricks to survive minimal environments.</li>
</ul>
<p>Three payloads I captured and detonated in an isolated environment later:</p>
<ul>
<li>A Mirai variant targeting MIPS and ARM, with the usual hardcoded C2 list.</li>
<li>A Monero miner compiled for x86_64 with an embedded pool address and worker ID.</li>
<li>An XorDDoS dropper with the &ldquo;encrypted&rdquo; strings still trivially XOR-decodable against a one-byte key.</li>
</ul>
<p>Nothing novel. That&rsquo;s the point. The mass of the internet&rsquo;s attack noise is bots spraying five-year-old payloads at anything that looks like a vulnerable edge device. A residential IP running no listening services would never see this traffic because the TCP connections would simply RST. Exposing ports makes you legible to the layer that scans for this kind of target.</p>
<h2 id="the-quieter-more-interesting-traffic">The quieter, more interesting traffic</h2>
<p>Once you filter out the SSH brute force floor, and it is a floor of roughly 300 to 600 attempts per hour, the rest gets more varied.</p>
<p>Log4Shell probes still show up on HTTP. More than two years after the advisory, JNDI probes are a standing wave. Most point at self-hosted Burp collaborators or long-dead VPS callbacks. Somebody, somewhere, is still paying for a scanner that fires these shots and never checks whether anyone answered.</p>
<p>A handful of requests were clearly scripted against specific CVEs:</p>
<ul>
<li>GPON router authentication bypass (CVE-2018-10561 / 10562). Still hitting in 2026.</li>
<li>Various Ivanti and Fortinet path traversals.</li>
<li>Confluence OGNL injection strings.</li>
<li>Generic WordPress <code>xmlrpc.php</code> pingback fishing.</li>
</ul>
<p>The unusual one: a small cluster of requests that tried to negotiate TLS with an SNI matching a real banking domain. No credentials, no follow-up. Probably a scanner doing inventory for cert transparency lookups. Possibly something less innocent. I don&rsquo;t have enough data to tell, and that&rsquo;s the honest answer.</p>
<h2 id="operational-reality">Operational reality</h2>
<p>A honeypot is not a set-and-forget box. In the first week I burned an evening chasing false positives and another fixing log rotation before a partition filled. The real costs:</p>
<ul>
<li>Disk grows fast. Cowrie session logs plus Elasticsearch indices ate about 18 GB in 30 days. That&rsquo;s cheap, but it isn&rsquo;t free.</li>
<li>Containers drift. Watchtower handles the weekly pull. I still review breaking changes in the T-Pot release notes before merging.</li>
<li>Elasticsearch is memory-hungry and will swap itself into uselessness if you underprovision. 8 GB is the practical floor.</li>
<li>Alerting is where this gets useful. A 500-per-hour SSH baseline is noise. A successful Cowrie shell that persists for more than 30 seconds, or any outbound hit blocked at the OPNsense rule, is signal. Those page me. Everything else goes to a dashboard I check when I feel like it.</li>
</ul>
<p>The alerting config is where most of my ongoing time goes. Without it, the whole thing is a pretty dashboard.</p>
<h2 id="what-id-change">What I&rsquo;d change</h2>
<p>A few things I&rsquo;ll do on the next iteration:</p>
<p>Split the honeypot across two IPs. Run Cowrie on one, everything else on the other. The SSH noise crowds the indices and makes queries slower than they need to be.</p>
<p>Move the dashboards off the honey-VM entirely. Shipping to an external Loki instance is already half the work. The remaining Kibana stack on-box is there for convenience, not necessity.</p>
<p>Add a second outbound-blocking layer inside the VM itself. Defence in depth against a container escape I&rsquo;m still not fully satisfied with.</p>
<p>Log rolling PCAPs on a 7-day window. Right now I only have what Suricata and the honeypots chose to log. Full packet captures would let me revisit sessions I under-investigated at the time.</p>
<h2 id="does-this-change-what-i-do-at-work">Does this change what I do at work</h2>
<p>Partially. It doesn&rsquo;t change how I think about APT-level adversaries. Nothing I saw in 30 days would strain a reasonable environment.</p>
<p>What it changes is how I talk about the baseline. The background radiation of the internet is real, it&rsquo;s measurable, and it doesn&rsquo;t stop. Any machine with an unpatched edge service survives hours, not days. Any default credential on a public interface is already compromised. You just haven&rsquo;t noticed yet.</p>
<p>That isn&rsquo;t a marketing line. That&rsquo;s 412,000 login attempts across 30 days on one residential IP running an obvious honeypot.</p>
<h2 id="closing">Closing</h2>
<p>Segment first. Then break something on purpose, in a place where it can&rsquo;t reach anything you care about. The logs that come back are more honest than any vendor report.</p>
]]></content:encoded>
    </item>
    <item>
      <title>Why I Still Run My Own Infrastructure at Home</title>
      <link>https://zero-entry.co.za/posts/why-i-still-run-my-own-infrastructure-at-home/</link>
      <pubDate>Sun, 11 Jan 2026 20:30:00 +0200</pubDate>
      <guid>https://zero-entry.co.za/posts/why-i-still-run-my-own-infrastructure-at-home/</guid>
      <description>A walkthrough of a self-hosted homelab built around OPNsense, Docker, and deliberate design choices — media, monitoring, VPN containment, and the lessons it took to get there.</description>
      <content:encoded><![CDATA[<h1 id="home-lab-overview-alecto-and-friends">Home Lab Overview: Alecto and Friends</h1>
<p>I&rsquo;ve always enjoyed tinkering with operating systems and finding ways they improve day-to-day life. I&rsquo;m not a cloud hater. Cloud services are useful and I still use them. I self-host because it&rsquo;s fun.</p>
<p>With most SaaS tools, you&rsquo;re limited by design choices you had no part in. My biggest self-hosted system is a Plex machine. I watch what I want, how I want, for roughly the cost of electricity. There&rsquo;s also been a serious learning component: networking, security, general IT practice. That alone has made it worth running.</p>
<h2 id="topology">Topology</h2>
<p>Starting at the internet edge and working inward:</p>
<p><strong>Router / Firewall</strong> I settled on OPNsense. It met and exceeded what I needed. The box runs intrusion detection, Unbound DNS, and a handful of other security-focused services.</p>
<p><strong>Switching</strong> Traffic hits a 24-port unmanaged gigabit switch with SFP ports. Nothing exotic, but most ports are in use.</p>
<p><strong>Flat Network</strong> The network is currently flat, so traffic flows directly to access points, servers, Raspberry Pis, NVR systems, gaming consoles, and everything else.</p>
<p>The topology is simple. The interesting part is what the devices are doing, not how complex the diagram looks.</p>
<h2 id="core-infrastructure">Core Infrastructure</h2>
<h3 id="alecto">Alecto</h3>
<p><strong>Hardware</strong></p>
<ul>
<li>Ryzen 7 1700X</li>
<li>1 TB NVMe</li>
<li>4 TB HDD</li>
<li>GTX 1050 Ti</li>
</ul>
<p>Nothing exotic, but it handles everything I need with around 85% idle time.</p>
<p><strong>Software</strong> Ubuntu LTS as the host OS, Docker for everything else: media acquisition, media consumption, networking, local services, metrics, and automation.</p>
<p>Docker makes backing up and restoring critical services significantly easier, which is the main reason I keep everything containerised.</p>
<h2 id="services">Services</h2>
<h3 id="media-acquisition">Media Acquisition</h3>
<p>The pipeline follows a simple request, acquire, process, library chain.</p>
<ul>
<li>Overseerr</li>
<li>Prowlarr</li>
<li>Sonarr / Radarr</li>
<li>qBittorrent</li>
<li>Deluge</li>
<li>Unpackerr</li>
<li>cross-seed</li>
</ul>
<p><strong>Prowlarr</strong> manages indexers. Private trackers have far less fake or malicious content than public ones, which matters later in the chain.</p>
<p><strong>Sonarr</strong> and <strong>Radarr</strong> handle TV and movies. Quality profiles are simple: HD and 4K. That has covered everything so far. Both monitor RSS feeds from configured indexers and push matched torrents to the downloader automatically.</p>
<p>I run two download clients. <strong>qBittorrent</strong> handles the entire arr stack. <strong>Deluge</strong> handles manual downloads and non-media content. Dynamic save paths split movies and TV cleanly for Plex.</p>
<p><strong>Unpackerr</strong> handles automatic extraction for downloads that arrive as archives. <strong>cross-seed</strong> finds identical or near-identical torrents across trackers and advertises that I already have the data, which improves speeds and availability for others.</p>
<p>The only recurring issue is Sonarr or Radarr occasionally grabbing a fake title. Aggressive regex-based filters have mostly resolved it.</p>
<h3 id="media-consumption">Media Consumption</h3>
<ul>
<li>Plex</li>
<li>Tautulli</li>
<li>Overseerr</li>
<li>Homepage</li>
</ul>
<p><strong>Plex</strong> is the primary player. Mature, stable, available on every device, and accessible for non-technical users. I&rsquo;ve tested Jellyfin and like it, but haven&rsquo;t switched.</p>
<p><strong>Tautulli</strong> gives visibility into Plex usage: playback activity, per-user bandwidth, transcoding load. That data makes decisions around limits and capacity easier.</p>
<p><strong>Overseerr</strong> lets users request titles themselves rather than messaging me. Requests still require approval, but that takes seconds instead of a back-and-forth conversation.</p>
<p><strong>Homepage</strong> is a single customisable dashboard with a high-level view of everything running. It doesn&rsquo;t replace Zabbix or Grafana for monitoring, but it&rsquo;s useful for day-to-day glancing.</p>
<h3 id="networking-and-vpn-containment">Networking and VPN Containment</h3>
<p>Torrent clients are routed through <strong>Gluetun</strong>, a dedicated VPN container running WireGuard. The downloaders have never touched my LAN directly and never see my public IP.</p>
<p>Gluetun runs in strict kill-switch mode. If the VPN drops, traffic stops. There&rsquo;s no fallback to my home connection. Given the provider&rsquo;s SLA, this hasn&rsquo;t been an issue in practice.</p>
<p>No inbound ports need to be open, which reduces exposure further. Speed degradation from the VPN hasn&rsquo;t been noticeable.</p>
<h2 id="observability">Observability</h2>
<p>Prometheus, Node Exporter, cAdvisor, and Grafana cover system-level metrics: CPU load, memory usage, container behaviour. Critical alerts go to Telegram. I&rsquo;m refining thresholds so only actionable issues send a notification.</p>
<h2 id="automation">Automation</h2>
<p><strong>Watchtower</strong> handles container updates on a schedule at 03:00. If an update breaks something, rolling back means redeploying from the same configuration paths. Docker&rsquo;s stateless container model makes that straightforward.</p>
<p><strong>Portainer</strong> handles anything that needs a UI.</p>
<h2 id="network-edge">Network Edge</h2>
<h3 id="opnsense">OPNsense</h3>
<p>OPNsense sits between the internet and all internal systems. UPnP is disabled. No device exposes itself automatically. Only explicitly required services are permitted outbound or inbound.</p>
<p>All traffic is statefully inspected. DNS is forced through Unbound. Suricata monitors inbound and outbound traffic for known malicious patterns. Devices can&rsquo;t quietly phone home, bypass DNS filtering, or accept unsolicited inbound connections without generating an alert.</p>
<p>Services get published deliberately, not accidentally exposed.</p>
<h3 id="boreas">Boreas</h3>
<p>A Raspberry Pi running Nginx Proxy Manager and WireGuard. This started as a workaround for a previous router that lacked VPN support. After moving to OPNsense, the separation made enough sense to keep.</p>
<p>Boreas is my remote access point back into the LAN. Nginx Proxy Manager exposes Overseerr to friends and family outside the local network. Both services sit behind Cloudflare, primarily to obscure my real IP.</p>
<p>The throughput on the Pi is better than you&rsquo;d expect for the hardware.</p>
<h3 id="chronos">Chronos</h3>
<p>A Tor middle relay running as a small contribution to online privacy. No exit node: the ISP complaints and CAPTCHA overhead aren&rsquo;t worth it. A middle relay provides value without the operational noise. It&rsquo;s low-maintenance and largely invisible once configured.</p>
<h2 id="failures-and-lessons">Failures and Lessons</h2>
<p>In the past year, two things actually broke:</p>
<ul>
<li>Disks filled up from log spam. Entirely my fault. Log rotation is properly configured now.</li>
<li>Incorrect or fake titles downloaded. Better filters and denied extension lists resolved most of it.</li>
</ul>
<p>Beyond that, issues have been minor misconfigurations and occasional reboots.</p>
<h2 id="what-id-do-differently">What I&rsquo;d Do Differently</h2>
<p>I&rsquo;d spread services across more hosts if I could go back. Some hardening measures were probably over-engineered, but I don&rsquo;t regret that trade-off. Deploying the arr stack earlier would have saved time.</p>
<h2 id="whats-next">What&rsquo;s Next</h2>
<p><strong>Hardware</strong>: managed switch, better access points, a more capable GPU for transcoding.</p>
<p><strong>Monitoring</strong>: a consolidated Zabbix dashboard with proper alerting.</p>
<p><strong>Networking</strong>: VLANs.</p>
<p>Self-hosted AI models aren&rsquo;t on the list. Cloud tools cover what I need without the overhead.</p>
<h2 id="closing">Closing</h2>
<p>Running this lab has mostly taught me patience. Getting multiple devices, containers, and services working together takes iteration. It&rsquo;s improved my understanding of networking and containerised systems more than any course I&rsquo;ve taken.</p>
<p>I keep running it because it&rsquo;s fun and I learn from it. That&rsquo;s enough reason.</p>
]]></content:encoded>
    </item>
  </channel>
</rss>
